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PREFACE 


The accelerating expansion of social psychology in the past two decades has 
led to an acute need for a source book more advanced than the ordinary textbook 
in the field but yet more focused than scattered periodical literature. Murchison’s 
Handbook of Social Psychology (1935), the only previous attempt to meet this 
need, is out of date and out of print. It was this state of affairs that led us to 
assemble a book that would represent the major areas of social psychology at a 
level of difficulty appropriate for graduate students. In addition to serving the 
needs of graduate instruction, we anticipate that the volumes will be useful in 
advanced undergraduate courses and as a reference book for professional psy- 
chologists. 

We first considered the possibility of preparing a Handbook three years ago. 
However, a final decision to proceed with the plan was not reached until the fall 
of 1951. During the interval we arranged an outline of topics that represented 
our convictions concerning the present state of social psychology. We then wrote 
to a large number of distinguished social psychologists asking them whether they 
felt our venture was likely to be professionally valuable and asking for criticisms 
of the outline we had prepared. The response to these letters was immensely 
gratifying — social psychologists as a group appear sufficiently altruistic to spend 
large amounts of time criticizing and commenting on a project of which they 
approve even though they may be unable to participate in it themselves. We also 
asked for specific recommendations of people who seemed best qualified to prepare 
the various chapters. After receiving answers we drastically revised our outline 
and proceeded to invite authors to prepare the various chapters. It was not until 
the spring of 1952 that we completed our list of contributors and even this list 
later underwent change. We first suggested (tongue in cheek) that the manu- 
scripts be submitted by September 15, 1952. However, as we secretly expected, we 
were forced to change this due date to January 1, 1953. This "deadline" we tried 
hard to meet. But of course we failed and shifted our aspiration to June 15, 1953. 
Again we failed, although by now we were making substantial progress. By early 
in the fall of 1953 we had all the chapters excepting two, and the first volume was 
completed and in the hands of the publishers. The last two chapters were not 
received until early in 1954, when the second volume went to press. 

Something should be said concerning the basis for the organization of the sub- 
ject matter of these volumes. It became apparent early that there are many ways 
to subdivide social psychology but very little agreement concerning just which is 
best. Although we sought the advice of others, we found for almost every com- 
pelling suggestion an equally compelling countersuggestion. Thus, in the end, 
it was necessary to make many arbitrary decisions. So much for our knowledge 
that the Handbook could have been organized in many different ways. There 
is no single scheme that would satisfy all readers. 

We early discovered that the subject matter was too voluminous to be con- 
tained in a single volume. Given this decision it seemed quite natural to present 
in one volume the chapters that dealt primarily with theoretical convictions or 
systematic positions, and also the methods and procedures commonly employed in 
social psychology. Likewise it seemed wise to present in one volume those chapters 
that focus upon the substantive findings and applications of social psychology. 
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'The decision to place the historical introduction, theory, and method chapters 
in the first volume reflects a bias in favor of investigation that begins with an 
awareness of the message of the past, an attempt at theoretical relevance, and 
finally with a full knowledge of the procedural or measurement alternatives. All 
of the content of the first volume is seen, at least by the editor, as a necessary 
preparation for good investigation. These are the things the social psychologist 
should know before he lifts a single empirical finger. The second volume, then, 
can be seen as a justification of the contents of the first volume. Here are the 
empirical fruits stemming from the theories and methods summarized in the 
first volume. 

But does this ideal scheme mirror common practice? Are the major empirical 
advances summarized in the second volume in reality a legitimate by-product of 
theoretical conceptions and sophisticated method? In fairness to science in action 
(as opposed to science on the books) we are afraid the answer is No. Social 
psychology has made its advances largely on the shoulders of random empiricists 
and naive realists. Inability to distinguish between analytic and synthetic and a 
tendency toward reification of concepts has accompanied many of the most sig- 
nificant advances in this field. Who would say that those who view an attitude 
as a "construct" created by the investigator have made more of a contribution to 
this area of psychology than those who naively view attitudes as real and concrete 
entities? Thus we sorrowfully admit the organization we have imposed upon the 
Handbook may bear little relation to the path thus far trod in the development 
of social psychology. Nevertheless, it stands as a suggestion of the manner in 
which future development may well take place and as a reminder that the power- 
ful weapon of systematic theory is now more nearly within the grasp of the wise 
psychologist than formerly. Where yesterday the theoretically oriented investi- 
gator and the random realist may have been on even terms, recent developments 
within the field may well have destroyed this equality. An approach efficient in 
the wilderness may be foolish in a more carefully mapped region. In summary, 
the precedence we give to theoretical positions reflects our conviction of the 
importance of theories as spurs to research, but may also represent a program for 
the future rather than a reflection of the past. 

It must be conceded that not all areas of social psychology are covered in these 
volumes with equal thoroughness. Some gaps are due to the blind spots of the 
editor while others are the result of contributors failing to cover an area they 
originally agreed to cover and, in a few cases, to contributors who withdrew alto- 
gether. In spite of these shortcomings, the volumes in their present state provide 
the most comprehensive picture of social psychology that exists in one place today. 

While deficiencies of the final product are my own responsibility, they exist in 
spite of a number of advisors who gave their time and energy generously through- 
out the venture. Of these collaborators none was nearly so important as Gordon 
Allport. In fairness he should be co-editor of the volume, as he contributed im- 
measurably both in matters of policy and in matters of detail. I owe a very special 
debt of gratitude to my wife Andrea for her tolerance, encouragement, and de- 
tailed assistance. Likewise of great importance is the contribution of Shirley H. 
Heinemann, who has been of constant help throughout the editorial process and 
in preparing the Index. Crucial to the success of this work were various addi- 
tional colleagues who served as referees, reading individual chapters and suggest- 
ing changes and deletions. On this score I express my gratitude to Raymond 
Bauer, Anthony Davids, Edward E. Jones, Kaspar Naegele, David Schneider, and 
Walter Weiss. In addition, many of the contributors served as referees for chap- 
ters other than their own. I am indebted to E. G. Boring, S. S. Stevens, and 
Geraldine Stone for many helpful suggestions based on their experience in arrang- 
ing the Handbook of Experimental Psychology. Mrs. Olga Crawford of Addison- 
Wesley played an indispensable role in final preparation of the manuscripts. 

We wish to thank the following publishers for permission to reproduce tables, 
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figures, and excerpts used in these volumes: Addison-Wesley Publishing Company 
for R. F. Bales, Interaction Process Analysis; Annual Reviews, Inc. for C. P. 
Stone and D. W. Taylor (Eds.) , Annual Review of Psychology; Appleton-Century- 
Crofts for E. R. Hilgard, Theories of Learning; Beacon House for L. S. Cottrell 
and Ruth Gallagher, Developments in Social Psychology; Ernest Benn, Ltd., 
London, for G. LeBon, The Crowd; Cambridge University Press for W. Mc- 
Dougall, The Group Mind; Carnegie Press for H. S. Guetzkow (Ed), Groups, 
Leadership and Men; Clark University Press for C. Murchison (Ed). Handbook 
of Social Psychology; Columbia University Press for D. Bidney, Theoretical 
Anthropology; Doubleday & Co. for M. Cowley and B. Smith (Eds.), Books That 
Changed Our Minds; The Dryden Press and Helen Service for Marie Jahoda, 
M. Deutsch, and S. W. Cook, Research Methods in Social Psychology; The Dryden 
Press and Alfred McC. Lee for A. McC. Lee and N. D. Humphrey, Race Riot; 
Follett Publishing Co. for H. Werner, Comparative Psychology of Mental De- 
velopment; Free Press for R. K. Merton, Social Theory and Social Structure; 
T. Parsons, R. F. Bales, and E. A. Shils, Working Papers in the Theory of Action; 
and Martha Wolfenstein and N. Leites, Movies: A Psychological Study; Grune 
and Stratton, Inc. for P. H. Hoch and J. Zubin (Eds.) , Psychosexual Development 
in Health and Disease; Harcourt, Brace & Co. for C. Fries, The Structure of 
English and K. Koffka, Principles of Gestalt Psychology; Harper and Bros. for: 
N. W. Ackerman and Marie Jahoda, Anti-Semitism and Emotional Disorder; 
T. W. Adorno et al., The Authoritarian Personality; R. G. Barker and H. F. 
Wright, One Boy's Day; B. Bettelheim and M. Janowitz, Dynamics of Prejudice; 
H. Cantril and G. W. Allport, The Psychology of Radio; C. S. Ford and F. A. 
Beach, Patterns of Sexual Behavior; E. R. Guthrie, The Psychology of Learning; 
P. F. Lazarsfeld and F. N. Stanton (Eds.), Communications Research, 1948-49; 
J. Lee (Ed.), The Language of Wisdom and Folly; K. Lewin, Resolving Social 
Conflicts; K. Lewin, Field Theory in Social Science; R. K. Meron, Mass Persua- 
sion; G. Murphy, Personality; J. H. Rohrer and M. Sherif, Social Psychology at 
the Crossroads; Harvard University Press for T. Parsons and E. A. Shils (Eds.) , 
Toward a General Theory of Action and F. Roethlisberger and W. J. Dickson, 
Management and the Worker; Hogarth Press for S. Freud, Group Psychology and 
the Analysis of the Ego and The Collected Papers of Sigmund Freud; Henry 
Holt & Co., Inc. for G. W. Allport, Personality; 'T. Newcomb and E. L. Hartley, 
Readings in Social Psychology; R. S. Woodworth, Experimental Psychology; and 
L. L. Whyte, The Next Development in Man; Houghton Mifflin Co. for F. H. 
Allport, Social Psychology and C. R. Rogers, Client-Centered Therapy; Inter- 
national Universities Press for G. Róheim, Psychoanalysis and Anthropology 
and G. B. Wilbur and W. Muensterberger (Eds.), Psychoanalysis and. Culture; 
Alfred A. Knopf, Inc. for C. Kluckhohn and H. A. Murray (Eds.) , Personality in 
Nature, Society, and Culture; H. D. Lasswell, Propaganda Technique in the 
World War; H. D. Lasswell and Dorothy Blumenstock, World Revolutionary 
Propaganda; and W. F. White, Rope and Faggot; Liveright Publishing Co. for 
W. Kohler, Gestalt Psychology; McGraw-Hill Book Co. for H. G. Barnett, Inno- 
vation: The Basis of Cultural Change; D. Krech and R. S. Crutchfield, Theory 
and Problems of Social Psychology; K. Lewin, Principles of Topological Psy- 
chology; and M. L. Reymert (Ed.) , Feelings and Emotions; The Macmillan Co. 
for H. Blumer and P. M. Hauser, Movies, Delinquency, and Crime; W. Lipp- 
mann, Public Opinion; R. W. Sellars, V. J. McFill, and M. Farber (Eds.), Philos- 
ophy for the Future; and F. K. Shuttleworth and M. A. May, The Social Conduct 
and Attitudes of Movie Fans; Methuen & Co., Ltd. for W. McDougall, Introduc- 
tion to Social Psychology and W. McDougall, Psychoanalysis and. Social Psychol- 
ogy; Noble and Noble, Inc. for W. D. Scott, Psychology of Public Speaking; W. W. 
Norton & Co. for E. H. Erikson, Childhood and Society; J. Piaget, Play, Dreams, 
and Imitation in Childhood; and J. Ruesch and G. Bateson, Communication: 
The Social Matrix of Psychiatry; Oliver and Boyd Ltd., Edinburgh, and Sir 
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Ronald A. Fisher for R. A. Fisher, The Design of Experiments; Oxford University 
Press, Inc. for H. Head, Studies in Neurology and H. A. Murray, Explorations in 
Personality; L. C. Page & Co. for C. Mackay, Extraordinary Popular Delusions 
and the Madness of Crowds; Philosophical Library, Inc. for P. L. Harriman 
(Ed.) , Encyclopedia of Psychology; Prentice Hall, Inc. for S. E. Asch, Social Psy- 
chology; C. Morris, Signs, Language, and Behavior; and C. P. Stone (Ed.), Com- 
parative Psychology; Princeton University Press for H. Cantril, The Invasion from 
Mars; C. I. Hovland et al., Experiments on Mass Communication; B. L. Smith, 
H. D. Lasswell, and R. D. Casey (Eds.) , Propaganda, Communication, and Public 
Opinion; and S. A. Stouffer et al., The American Soldier; G. P. Putnam's Sons 
for Frances C. MacGregor and Margaret Mead, Growth and Culture; Rinehart & 
Co., Inc. for A. L. Edwards, Experimental Design in Psychological Research; A. L. 
Edwards, Statistical Analysis for Students in Psychology and Education; E. Fromm, 
Escape from Freedom; and OSS Assessment Staff, The Assessment of Men; The 
Ronald Press Co. for R. R. Blake and G. V. Ramsey (Eds.), Perception: An 
Approach to Personality and J. McV. Hunt (Ed.), Personality and the Behavior 
Disorders; Charles Scribner's Sons for C. H. Cooley, Human Nature and the 
Social Order; Simon and Schuster, Inc. for A. Einstein and L. Infeld, The Evolu- 
tion of Physics; Society for Promoting Christian Knowledge, London, for J. C. 
Maxwell, Matter and Motion; George W. Stewart, Inc. for H. D. Lasswell and 
N. Leites, Language of Politics; University of California Press for S. M. Lipset, 
Agrarian Socialism; University of Chicago Press for A. L. Kroeber, Anthropology 
Today; S. Tax et al., An Appraisal of Anthropology Today; and D. Waples (Ed.), 
Print, Radio and Film in a Democracy; University of Minnesota Press for M. 
Deutsch and Mary E. Collins, Interracial Housing: A Psychological Evaluation 
of a Social Experiment; Watts & Co., London, for R. Firth, Elements of Social 
Organization; john Wiley & Sons, Inc. for H. Cantril, The Psychology of So- 
cial Movements; L. Carmichael (Ed.), Manual of Child Psychology; W. G. 
Cochran and Gertrude M. Cox, Experimental Designs; and S. S. Stevens (Ed.), 
Handbook of Experimental Psychology; Yale University Press for E. Cassirer, 
An Essay on Man; C. I. Hovland, I. L. Janis, and H. H. Kelley, Communication 
and Persuasion; N. E. Miller and J. Dollard, Social Learning and Imitation; and 
D. Riesman, The Lonely Crowd. We should also like to thank Albert W. Kennon, 
representative of the trustees of the McDougall estate, for permission to quote 
from W. McDougall, The Energies of Men. Thanks also go to Paul F. Lazarsfeld 
for permission to use excerpts from the following volumes of his authorship pub- 
lished by Duell, Sloan & Pearce, Inc.: Radio Research, 1941, Radio Research, 
1942-43, and The Peoples’ Choice by P. F. Lazarsfeld, B. Berelson, and Hazel 
Gaudet. 
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CHAPTER 1 


The Historical Background of 
Modern Social Psychology 


Gorpon W. ALLPORT 


Harvard University 


Is social psychology an ancient discipline? Or 
is it modern — even perhaps ultramodern? If 
we look only at the spectacular expansion of its 
researches and influence during and after World 
War II we are tempted to claim that social psy- 
chology is a creature of the mid-twentieth cen- 
tury. Like Minerva it seems to have been born 
full-grown. Its sudden popularity and produc- 
tivity tempt us to disregard the past, to brush 
aside the speculations of our intellectual ances- 
tors as so much rust clogging the gears of a new 
and vital science. Why bother with the “meta- 
physical stage” of speculation, as Comte called 
it, when a new era of positivism and progress 
has dawned? 

But there is another and wiser way to view 
the matter. Our intellectual ancestors, for all 
their fumbling, were asking precisely the same 
questions that we are asking today. How does 
one generation, they wanted to know, impose 
its culture and its thought-forms upon the next? 
What happens to the mental life of the individ- 
ual when he enters into association with others? 
And long before social psychology became a 
science political philosophers sought an answer 
to the question, What is the social nature of 
man? They well knew, as Vico (1725) observed, 
that “governments must conform to the nature 
of the men governed.” Hence interest in the 


key problem of social psychology — man’s social 
nature — is both ancient and persistent. 

It is true that our intellectual forefathers 
lacked tools of precision for empirical research 
and that they were sometimes naïve in their 
theories, yet they bequeathed to us an important 
store of observation, hypotheses, and perspec- 
tive, together with many shrewd insights that 
have stood the test of time. Even their errors 
and blind alleys are instructive. It has been well 
said that those who do not know history are 
doomed to repeat its mistakes. And the history 
of science shows that both the accomplishments 
and the blunders of one generation of scholars 
may become building stones in the hands of 
the next. 

While the argument in favor of historical 
perspective is compelling, it is not all-compell- 
ing. It does not, for example, require us to make 
a conducted tour through a museum of oddities 
and antiquities. Nor does it hold us to a series 
of dust-covered portraits. A study of the history 
of social psychology can be justified only if it 
shows the relevance of historical backgrounds 
to present-day foregrounds. To this end the 
present essay lays stress not upon curios and 
portraits, but upon those psychological topics of 
current concern that receive genuinely helpful 
illumination from the past. 
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If we ask an apparently simple question, 
“Who founded social psychology?” we run head- 
first into one of the major problems of the 
science itself — the problem of social invention. 
Can we ever say that a single person turns the 
tide of history? Or that a single person ever 
founds a science? If we favor the “great man” 
type of explanation we could single out several 
candidates for the honor of founding social 
psychology: Plato, Aristotle, Hobbes, Comte, 
Hegel, Lazarus and Steinthal, Tarde, E. A. Ross, 
and others, depending on the criteria and the 
time-perspective we choose to adopt. A good 


case could be made for each of these men. Sev- 
eral of them have actually been called “the 
father of social psychology.” But the truest 
answer to the question asserts that the roots of 
modern social psychology lie in the distinctive 
soil of western thought and civilization. Even 
more than most disciplines, social psychology 
requires a rich ground in which the tradition of 
free inquiry, a philosophy and ethics of democ- 
racy, and the prior rise of natural and biological 
science form a nourishing blend. 

While the roots of social psychology lie in the 
intellectual soil of the whole western tradition, 
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its present flowering is recognized to be charac- 
teristically an American phenomenon. One rea- 
son for the striking upsurge of social psychology 
in the U.S.A. lies in the pragmatic tradition of 
this country. National emergencies and condi- 
tions of social disruption provide special incen- 
tive to invent new techniques, and to strike out 
boldly for solutions to practical social problems. 
Social psychology began to flourish soon after 
the First World War. This event, followed by 
the spread of communism, by the great depres- 
sion of the '30's, by the rise of Hitler, the geno- 
cide of the Jews, race riots, the Second World 
War and its consequent anomie, stimulated all 
branches of social science. A special challenge 
fell to social psychology. The question was 
asked: how is it possible to preserve the values 
of freedom and individual rights under condi- 
tions of mounting social strain and regimenta- 
tion? Can science help provide an answer? This 
challenging question led to a burst of creative 
effort that added much to our understanding of 
the phenomena of leadership, public opinion, 
rumor, propaganda, prejudice, attitude change, 
morale, communication, race relations, and con- 
flicts of value. While other nations have been 
confronted with the same world-wide emergency, 
it seems that in the United States the soil of 
western thought, fortified by practical melior- 
ism, proved most fertile for the assertive growth 
of social psychology and related disciplines. 
Practical and humanitarian motives have al- 
ways played an important part in the develop- 
ment of social psychology, not only in America 
but in other lands as well. Yet there have been 
discordant and dissenting voices. In the opinion 
of Herbert Spencer in England, of Ludwig 
Gumplowicz in Austria, and of William Gra- 
ham Sumner in the United States, it is both 
futile and dangerous for men to attempt to steer 
or to speed social change. Social evolution, they 
argue, requires time and obeys laws beyond the 
contro] of man. The only practical service of 
social science is to warn men not to interfere 
with the course of nature (or society) . But these 
authors are in a minority. Most social psycholo- 


gists share with Comte an optimistic view of 
man's chances to better his way of life. Has he 
not already improved his material welfare via 
natural science, and his health via biological 
science? Why should he not better his social 
relationships via social science? For the past 
century this optimistic outlook has persisted 
even in the face of slender accomplishment to 
date. Human relations seem stubbornly set. 
Wars have not been abolished, labor troubles 
have not abated, and racial tensions are not 
relieved. Give us time and give us money for 
research, the optimists say. 

"There is a more formal way of viewing the 
recent upsurge of interest in social psychology. 
It could be explained in terms of Comte's theory 
of three stages (1830, Vol. I, Ch. 1). Comte 
would say that only recently have the social sci- 
ences left the constraints of the first two stages, 
the theological and metaphysical respectively, 
and entered fully into the third stage of positiv- 
ism. While Comte himself endeavored to inau- 
gurate the third stage, it is clear that the fruit of 
his effort was delayed for nearly a century until 
the positivistic tools of experiment, statistics, 
survey methods, and like instruments were more 
adequately developed. Hornell Hart (1949) has 
plotted convincingly the recent upswing in the 
productions of social science, and argues that 
the recent acceleration marks the delayed en- 
trance of social science into the era of positivism. 
The delay, only now ending, has resulted in a 
serious “cultural lag” (Ogburn, 1922) which is 
due to the prior entrance of the natural sciences 
into the era of positivism. Technological de- 
velopments are dangerously far in front of social 
advances. Man can change matter into energy 
but cannot yet socially control the energy he 
creates. The cultural lag manifests itself in 
many phases of social life. Anomie is wide- 
spread. 

Whether social science, under proper ethical 
guidance, can eventually reduce or eliminate the 
cultural lag may well be a question upon which 
human destiny depends, 
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No sharp boundaries demarcate social psychol- 
ogy from other social sciences. It overlaps politi- 
cal and economic science, cultural anthropology, 
and in many respects is indistinguishable from 
general psychology. Likewise, its tie with sociol- 
ogy is close. It was a sociologist, E. A. Ross, who 
wrote the first book to bear the title Social Psy- 
chology (1908). Of the dozens of textbooks 
subsequently published in the United States 
somewhat more than half have been written by 
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psychologists, somewhat less than half by sociol- 
ogists. 

In spite of this apparent lack of autonomy, 
social psychology has its own core of theory and 
data and its own special viewpoint. Its focus of 


» interest is upon the social nature of the indi- 


vidual person. By contrast, political science, 
sociology, and cultural anthropology take as 
their starting points the political, social, or 


cultural systems in which an individual person — 
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lives. It is obvious that a complete science of 
social relations, as Parsons and Shils (1951) 
point out, will embrace both the personality sys- 
tem and the many-sided social system. 

With few exceptions, social psychologists re- 
gard their discipline as an attempt to under- 
stand and explain how the thought, feeling, and 
behavior of individuals are influenced by the 
actual, imagined, or implied presence of other 
human beings. The term “implied presence” 
refers to the many activities the individual car- 
ries out because of his position (role) in a 
complex social structure and his membership in 
a cultural group. 

Sociology, anthropology, and political science 
are "higher level" disciplines; they seek inclu- 
sive laws of social structure, social change, cul- 
tural patterning. They wish to know the course 
of society with the individual extracted. What 
happens when all citizens of the United States 
are gradually replaced by other citizens The 
English language continues to be spoken; the 
form of government is not fundamentally al- 
tered; the cycles of economic behavior and the 
existence of social classes continue to be pretty 
much as they were. By contrast, social psychol- 
ogy wishes to know how any given member of a 
society is affected by all the social stimuli that 
surround him. How does he learn his native 
language? Whence come the social and political 
attitudes that he develops? What happens when 
he becomes a member of a group or of a crowd? 

Social psychology is above all else a branch of 
general psychology. Its center of emphasis is the 


same: human nature as localized in the person. 
Some writers argue that since personal mental 
life is always influenced by "the actual, imag- 
ined, or implied presence of others" then all 
psychology must be social. The point is tenable 
if we wish to pressit. But in practice it has little 
value. There are many problems of human na- 
ture that need to be solved apart from social 
considerations: problems of psychophysics, sen- 
sory processes, emotional functions, memory 
span, the nature of personality integration. So- 
cial psychology overlaps general psychology, but 
is not identical with it. 

In tracing the history of social psychology we 
are in the uncomfortable position of having to 
abstract from the total context of each author's 
thought those ideas that bear specifically on the 
social nature of the individual. We shall not, 
for example, examine the entire political philos- 
ophy of Bentham, or the entire sociology of 
Herbert Spencer. We pick out only those psy- 
chological assumptions upon which the re- 
mainder of their social theory rests. As for 
psychologists, we do not describe the total con- 
tribution of Wundt or McDougall or Freud to 
psychological science, but only those portions 
that are especially relevant to the explanation 
of social conduct. 

Thus to write a history of social psychology 
requires a special type of light-fingeredness. 
One extracts a finding here, an idea there, and 
tries to show how these threads have woven 
themselves into the fabric of present-day social 


psychology. 


SECONDARY SOURCES 


The only way the student can achieve an accu- 
rate sense of history is to dig into the original 
writings of the leading authors of the past. But 
as the shadow of history lengthens it becomes 
increasingly difficult to consult primary sources 
exclusively. Secondary sources, if used with dis- 
crimination, have their uses. Not only do they 
save time, but they help a historian check his 
interpretations against those of others. 

While there are many histories of sociological 
and political thought, there is no definitive his- 
tory of social psychology. Selected portions of 
the subject are treated by Sprowls (1927) and 
by Karpf in her American Social Psychology 
(1932) and in her more recent addendum (1952). 
Cottrell (1941) has surveyed the decade 1930-40. 
None of these authors makes a sustained dis- 
tinction between sociology and social psychol- 
ogy. The history of social psychology in Ger- 
many is reviewed by Lukasczyk (1951). Among 
the histories of sociology we sometimes find an 
antipsychological bias, as in Becker and Barnes, 


Social Thought from Lore to Science (1952). 
But in this book as well as in Sorokin (1928), 
Rice (1981), L. L. Bernard (1934), Barnes 
(1948) , House (1936), we find helpful interpre- 
tations of the thought of past authors, including 
their basic psychological assumptions. 

The history of cultural anthropology is still a 
neglected subject. Some assistance may be ob- 
tained from Haddon (1910, 1949), and even 
more from Kroeber and Kluckhohn (1952), who 
give a painstaking review of the concept of cul- 
ture, allowing due weight for its psychological 
connotations. 

Political theories have received many histori- 
cal treatments, some of which include an exten- 
sive consideration of the social nature of man. 
A useful source in this connection is G. Engel- 
mann, Political Philosophy from Plato to Jer- 
emy Bentham (1927). This volume contains a 
series of pastiches that summarize each author's 
views of "the nature of men governed" in the 
author's own style. Other sources include Sa- 
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bine (1937) and Catlin (1947). Catlin makes 
the interesting observation that while political 
philosophy has flourished for two millenia, we 
are now only at the beginnings of a political 
science, because the development of a science of 
politics would have been “too dangerous to the 
powers that be." The same reasoning may help 
to explain why it is only in recent years that the 
soil has been favorable for the growth of a rela- 
tively unimpeded science of social psychology; 
and why it is only in certain western lands that 
the growth has taken place. 

We must not forget the history of philosophy. 
Until a century ago all social psychologists were 
at the same time philosophers, and many philos- 


ophers were social psychologists. Surveys that 
aid in tracing this relationship are written by 
Windelband (1931), Baker (1947), Russell 
(1945), Vaughan (1939). Historians of intel- 
lectual movements also have much to offer, e.g., 
M. E. Curti in The Growth of American 
Thought (1943) . 

To these sources must be added historical 
treatments of general psychology, for above all 
else social psychology depends upon the basic 
orientations and methods of experimental and 
theoretical psychology. In Boring (1950) there 
is much of value. Other historians of psychology, 
such as Brett (1912-1921) and Murphy (1949), 
devote special chapters to social psychology. 


SoctaL PsvcHOLOGY IN ANTIQUITY 


From Plato to the time of Comte virtually all 
theories of the social nature of man were tied 
to, and incidental to, theories of the state. For 
this reason we may rightly say that up to a cen- 
tury ago social psychology was largely a branch 
of political philosophy. It was, of course, Plato 
and Aristotle who first aroused the curiosity of 
western man concerning his own social nature. 

Plato tells us in The Republic that states 
originate because the individual is not self- 
sufficing. He needs the help of many. The type 
of government that results is due to the pre- 
dominance of certain social sentiments. In one 
set of circumstances one of five possible forms of 
statecraft (and sentiments) gains the ascend- 
ancy, in different circumstances another: 

Aristocracy depends upon the rule of philos- 
ophers whose dominant motives are love of wis- 
dom and a passion for justice (the desire that 
each individual shall fulfill the calling for which 
by nature he is best fitted) . 

Timocracy arises when the state's guardians 
are motivated by vainglory, by love of power 
and military ambition. The resulting pattern 
is one of harshness toward subordinates and sub- 
mission toward superiors, 

Oligarchy results when through timocratic 
avarice a few become rich and succeed in estab- 
lishing social values that regard wealth as su- 
preme and poverty as evil. 

Democracy is born in revolt against the in- 
satiable greed of oligarchy. Such slogans as 
Liberty and Equality help evoke the aggressive 
protest of the masses against restraint. 

Tyranny represents the lowest form of sover- 
eignty and results from the excess of freedom 
which democracy engenders, allowing ignorant 
drones to gain power and to rule by caprice sup- 
ported by force. 

Plato felt that men formed social groups be- 
cause they needed them. Thus he held a utili- 


tarian or "social contract" view. Aristotle, on 
the other hand, saw the gregarious motive as 
something instinctive. Man is by nature a “po- 
litical animal" (Politics, 1, ii). It is our inborn 
nature that leads us to positive affiliations and 
to a collective way of living. Yet Aristotle recog- 
nized that the social instinct varies in its force. 
It is not strong enough to lead to true unity 
within the state. The unity of a family is 
stronger. Stronger still is the unity (selfhood) 
of the individual. 

Both Plato and Aristotle developed the anal- 
ogy between the individual organism and so- 
ciety, thus giving an impetus to a doctrine that 
— as we shall see — has hardy supporters down 
to the present day. 

Plato felt that the rule of the head (the 
philosopher-intellect) was essential to a good 
society. Aristotle inclined to have a higher 
opinion of diversified democracy. Since men 
differ greatly in their talents, in a democracy 
“one comprehends this, another that, and all, 
the whole.” Plato would keep the peace through 
a rule by the elite. Aristotle would, when pos- 
sible, preserve peace through an expression of 
the interests of all people. Plato reasoned de- 
ductively from his conception of an ideal so- 
ciety; Aristotle wrote inductively from the point 
of view of human nature as he found it. Plato 
was more of a rationalist; Aristotle more of an 
empiricist. 

Because of the differences between them it has 
been said that every subsequent writer, philos- 
opher, or scientist takes either a Platonic or an 
Aristotelian approach to the study of man and 
society. Perhaps the distinction is too sharp. 
However this may be, one thing is certain: Pla- 
tonic and Aristotelian strands of thought are 
found in all western theory, past and present. 
The flavor of Comte, whom we next consider, is, 
as we shall see, primarily Aristotelian. 


THE HISTORICAL BACKGROUND OF MODERN SOCIAL PSYCHOLOGY V 


Astronomy Physics 


Mathematics 


LOGICS PHYSICS 


PSYCHICS 


Biology | Sociology 


/ 
7 D 
ü D 


D ji 
M social 1 
«X | psychology ’ 
M jj 
a b 
H b 

B 1 
^ D 
` ; 
physiological 
psychology 


Fic. l. Comte's hierarchy of abstract sciences. (Adapted from DeGrange, 1930, p. 211.) 


Fic. 1. Comte's hierarchy of abstract sciences. (Adapted from DeGrange, 1930, p. 211.) 


COMTE’S Discovery or SOCIAL PSYCHOLOGY 


Two facts concerning the contributions of 
Auguste Comte, 1798-1857, are known to most 
students. First, that in his famous “law of three 
stages" he called attention to the gradual emer- 
gence of the sciences from the theological, 
through the metaphysical, into the positive 
stages of development. We have already com- 
mented on the bearing of this law upon the 
current problem of the cultural lag. Secondly, 
most students know that Comte classified the 
basic abstract sciences, and in 1839 reached the 
conclusion that sociology (the name is his in- 
vention) should be recognized as a new positive 
science, One might say that Comte christened 
sociology many years before it was born, but 
nonetheless clearly foresaw its development. 

What most students do not know is that to- 
ward the end of his life Comte was wrestling 
with a “true final science” which, if fully worked 
out, could only have been what today we call 
psychology, even though Comte preferred to 
label it the science of “la morale positive.” In 
Volume 4 of his Systéme de Politique Positive 
(1854) he states his intention to publish in 1859 
Le Systéme de morale positive, ou Traité de 
Véducation universelle. But two years before 
this projected date he died, leaving only frag- 
mentary expositions of the new science. These 
fragments have been collated and interpreted by 
De Grange (1923, 1930) . 

What is important for us today is the logic 
of Comte's argument. No writer has ever given 
such sustained attention as he to the ordering of 
the sciences. The fact that he found himself 


finally driven to postulate a "true final science" 
— and that his conception of that science paral- 
lels our present conception of modern psy- 
chology (especially social psychology) — is a 
neglected event of historical importance. If it 
were possible to designate a single deliberate 
"founder" of social psychology as a science we 
should have to nominate Comte for this honor. 

The reason why Comte refused to adopt the 
term "psychology" for his science is itself re- 
vealing (see Lévy-Bruhl, 1903, pp. 188-209). In 
his day, "psychology" was too rationalistic, too 
introspective, and too mentalistic — in short, 
too “metaphysical” for his taste. He feared that 
by preserving the name he would retard the 
development of the positive science that he was 
seeking. Just as he coined “sociology” to aid 
him in defining a new scientific field, so too he 
coined “la morale” to designate the final science 
which should follow and be built upon biology 
and sociology. 

The abstract sciences. Comte insists upon 
separating the abstract sciences from the con- 
crete. The former deal with irreducible phe- 
nomena, with elements and their analysis, with 
“events” that are fundamental and primary; the 
latter deal with composite phenomena, with 
concrete "beings" (not abstract events), and 
with applications of the abstract sciences. Fig- 
ure 1 lists the seven abstract sciences, in the 
order of their presumed emergence. All other 
sciences are concrete; among them are geology, 
meteorology, botany, zoology, and education 
(the last being an example of applied morale) . 


8 HANDBOOK OF SOCIAL PSYCHOLOGY 


The nature of these sciences is described in 
detail in two long series of volumes, the Positive 
Philosophy, 1830-1842, and Positive Polity, 1851- 
1854. The third series, the Subjective Synthesis, 
which was to include in Volume Two his full 
account of la morale, was never finished. The 
abstract sciences, he argued, can evolve only in 
a definite hierarchical order, as indicated in 
Fig. 1. Mankind can progress only from what is 
known to what is unknown, and only now is the 
vista opening for a positive science of human 
behavior. 

It was the level of Psychics that gave him the 
greatest trouble. At first he discerned only two 
irreducible classes of phenomena pertaining to 
sentient life: (1) the biological, organic, or vi- 
tal, and (2) the sociological, collective, or so- 
cietal. Gradually he felt himself forced to 
recognize a third class of phenomena: (3) the 
moral or individual, wherein the affective, in- 
tellectual, and active functions of the person 
are bound together. 

The nature of la morale. As early as the sec- 
ond volume of the Positive Polity (1852) Comte 
asked himself the momentous question: “How 
can the individual be at once cause and conse- 
quence of society?” A product of society, he is 
yet a creative agent, not to be understood in 
terms of the gross generalizations of sociology. 
While an individual is a concrete being, he re- 
quires an abstract science that will examine his 
nature from a point of view that grows out of, 
but is yet beyond the biological and the socio- 
logical points of view. Man is something more 
than a biological being, something more than a 
“cultural cumulation” with which sociology 
deals. He is a moral agent, uniquely patterned 
and deserving a special science to explore and 
interpret his nature. 


When biology is looked upon as merely out- 
lining the study of human existence in the light 
of the study of the vegetative and animal func- 
tions, then sociology alone gives a knowledge of 
our intellectual and moral attributes which be- 
come sufficiently appreciable only in their collec- 
tive upgrowth. After them, the true final science, 
ie, morale, is able to systematize the special 
knowledge of our individual nature, by virtue of 
an appropriate combination of the two points of 
view, the biological and the sociological. (1852, 
Engl. transl., p. 357) 


It took Comte nearly thirty years of constant 
labor to reach this position. His new advance 
excited him greatly. " 
La morale, he insisted, must deal with the 
individual unity of men. Casting around for a 
theory of motivation that he might employ in 
his new science he found only one to his taste, 


the phrenological system of Franz Josef Gall. 
What he liked about Gall's doctrine was its 
dynamic emphasis upon the core traits of in- 
dividual men. Some men are marked by great 
physical courage, some by ambition, others by 
an altruistic sentiment. The fact that Gall tied 
his faculties to “interior functions of the brain” 
seemed to Comte to be an added scientific ad- 
vantage. What Comte was seeking was a science 
of personality — unfortunately, some years be- 
fore such a science was possible. 

The science would, however, be dependent 
upon sound biology and sound sociology. At 
times la morale would lean more heavily upon 
its biological foundations, and would yield 
findings that today might be classified as physio- 
logical or as constitutional psychology. At other 
times, la morale would deal with the indi- 
vidual's place in a social and cultural context, 
and constitute a social psychology. 

In this line of reasoning we find three striking 
propositions: (1) A science of individuality is 
possible and necessary. (2) This science, while 
allowing for an idiographic approach to per- 
sonalities, will not shun generalizations con- 
cerning what is common to all men, It is a sci- 
ence of human nature in general, but one that 
must adapt itself to the phenomenon of in- 
dividuality. (3) La morale can be viewed — as 
modern psychology is often viewed — either 
from the biological or social point of view. A 
few years after Comte's death both Wundt and 
Tarde were insisting sharply upon this same 
dichotomy within the science of psychology. All 
psychology, they said, is either physiological or 
social, 

Comte's discovery of la morale did not weaken 
his enthusiasm for his earlier discovery of so- 
ciology. Sociology remains the science of the 
"Great Being" which encompasses individuals. 
The existence of a language, a culture, a social 
system is prior to, and in no way reducible to, 
the lives of individuals. 


"These individual perturbations do not sensibly 
alter collective existence, and they do so less as it 
is further developed. The mutual neutralization 
between various individuals leaves subsisting in 
sociology only the permanent influences of the 
truly collective attributes. (1852, Engl. transl., 
p. 438) 


Implications. The problem Comte wrestled 
with is perennial. The modern social psycholo- 
gist, no less than Comte, is haunted by the ques- 
tion: How can the individual be both a cause 
and a consequence of society? That is to say, 
How can his nature depend indisputably upon 
the prior existence of cultural designs and upon 


| 
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his role in a predetermined social structure, 
while at the same time he is clearly a unique 
person, both selecting and rejecting influences 
from his cultural surroundings, and in turn cre- 
ating new cultural forms for the guidance of 
future generations? And all the while this 
social-individual interaction is in progress, man 
is also, of course, a biological being subject to 
the laws of his species. 

Whether Comte’s final three sciences should 
stand in precisely the form in which he defined 
them a century ago is not here the issue. We 
wish merely to point to the modernity of his 
problem. In recent years new concepts have 
appeared on the horizon: Social Relations, Hu- 
man Development, Behavioral Sciences, Human 
Relations. Each of these labels, it seems, tries to 
express an integration of the biological, the so- 
cial, and the psychological approaches. Thus in 
a sense each seems, to attempt a translation of 
Comte's untranslatable term la morale. While 
he failed completely to establish this term in the 
lexicon of science, he did succeed in establishing 
his other pivotal term, sociology. 


Comte was aware of what he called a “happy 
ambiguity” in the term la morale. On the one 
side it connoted to him an abstract science of 
individuality, on the other hand, the realm of 
morality and ethics. It suggested to him the pos- 
sibility of establishing a scientific basis of human 
morality. His view was that in order to achieve 
higher morality man will first have to evolve a 
new discipline. This science will be anchored 
in biological nature, and also in the study of 
society and culture. When its development is 
far enough advanced we may then hope to apply 
its findings for the betterment of human living, 
to what Comte designates “a religion of human- 
ity." This hopeful line of reasoning has many 
adherents today. 

To us Comte’s formal and classificatory efforts 
may seem stilted and old-fashioned. Yet they 
illustrate the truth that acute problems of today 
were likewise acute problems in the past. And 
the moral is: solutions that we reach today may 
be sounder if we take into account solutions 
offered by our predecessors. 


SIMPLE AND SOVEREIGN THEORIES: THEIR SIGNIFICANCE 


Most social psychologists of the nineteenth 
century thrived on unitary explanations. Each 
tended to select and develop one simple and 
sovereign formula that seemed to him to hold 
the key to social behavior, The phrase “simple 
and sovereign” is borrowed from Henry George 
(1879) who declared that the single tax would 
be a "simple and sovereign remedy for the ills 
of mankind.” In the nineteenth century both 
social scientists and reformers sought unitary 
solutions of social riddles, Favorite among the 
psychological open sesames were pleasure-pain, 
egoism, sympathy, gregariousness, imitation, 
and suggestion. 

The year 1908 may be said to mark a turn 
away from such monistic explanations and to- 
ward pluralistic explanations. In that year the 
first two textbooks in social psychology ap- 
peared. The earlier, by Ross, continued to place 
the burden of explanation upon a single prin- 
ciple, sometimes called by him “imitation,” 
sometimes “suggestion.” The later text, Mc- 
Dougall's, called upon a plural array of instincts 
as prime movers of mankind. While the instinct 
doctrine may, in a sense, seem to represent only 
another simplicist type of causal force, still the 
instincts were regarded by McDougall as varied 
in type and diversified in the social behavior 
they provoked. 

It is easy to inveigh against the fallacy of 


unitary explanation, or the “simplicist fallacy,” 
as it is sometimes called, Few modern writers 
focus on a single motive or mechanism and 
claim it to be an allsufficient explanation of 
social behavior. Yet even today we find authors 
who favor some one predominant factor to the 
relative neglect of others. Among them are such 
favorites as conditioning, reinforcement, anxi- 
ety, sexuality, guilt, frustration, cognitive or- 
ganization, role, and social class. Thus it is 
difficult for even a modern author to keep a 
balance in his repertoire of explanations. The 
reason seems to be that every writer aspires to a 
coherent system of explanation and wishes to 
reduce the number of variables in his system to 
the minimum. 

The following pages examine in some detail 
the leading simple and sovereign conceptions of 
the nineteenth century. Our purpose is not to 
resurrect discredited theories, nor even to warn 
modern authors against their own obsessive 
tendencies in system-building. Rather we shall 
hope to demonstrate that the unitary principles 
of the past still represent live issues in contem- 
porary social psychology. While our present-day 
approach is certainly more cautious and more 
plüiralistic,-yet the phenomena described by 
earlier writers are still with us, and in some 
cases it is doubtful whether today's conceptuali- 
zations improve upon those of yesterday. 
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HEDONISM 


We take first a simple and sovereign theory 
that reaches from the time of Greek philosophy 
into the present day. Psychological hedonism 
(or as Jeremy Bentham, 1748-1832, prefers to 
call it, the Principle of Utility) maintains that 
pain and pleasure are our "sovereign masters" 
(1789, p. 1). 

While Bentham had the hedonistic doctrines 
of Epicurus, Aristippus, Hobbes, Adam Smith, 
and many others to draw on, the credit for the 
first extensive formulation of hedonism in rela- 
tion to social psychology is his. He denies the 
importance of sympathy in social life. 'The basic 
fact is that men simply act to secure pleasure 
(sometimes called happiness) and to avoid pain. 
On the occasion of every act a person is led to 
pursue that line of conduct which, according to 
his view of the case taken by him at the moment, 
will be in the highest degree contributory to his 
own happiness. John Stuart Mill (1863) agreed 
with Bentham that to desire anything except in 
proportion as the idea of it is pleasant is a 
physical impossibility. 

Starting with this basic assumption, Bentham 
evolves his “hedonistic calculus.” It maintains 
that the degree of pleasure and of pain can be 
measured if we analyze any given affective state 
with the aid of certain dimensions. Pleasures 
(and pains) have differing degrees of duration, 
intensity, certainty, propinquity (or remote- 
ness) , fecundity (will pleasure or pain follow?) , 
purity (will the pleasure be mixed with pain?) , 
extent (will other people be involved pleasur- 
ably or painfully?). Such an analytic scheme 
reminds us of Wundt's and Titchener's later 
attempts to establish the attributes of feelings, 
though the dimensions they chose were different 
from Bentham's. 

'To Bentham all pleasures were measurable. 
Their motivational power, also their utility, 
could be computed. He did not deny that pleas- 
ures had differing qualities; there were, he ad- 
mitted, pleasures of sense, wealth, skill, amity, 
good name, power, piety, benevolence, malevo- 
lence, memory, imagination, expectation, asso- 
ciation, and relief (1789, Chapter 5). But one 
was “as good as” another. A bottle of pickles is 
as good as a book of poetry if it yields equal 
pleasure. (On this point John Stuart Mill dis- 
agreed.) All that mattered to Bentham was the 
maximizing of total pleasure. To reach a ra- 
tional decision regarding a course of conduct 
— and psychological hedonism has a heavily 
rationalistic cast — one would simply compute 
the probable hedonic consequences of the pro- 
posed act in terms of the resulting duration, 


intensity, certainty, etc., of pleasure. He would 
(and should) act so as to bring himself the 
greatest hedonic good. 

At this point psychological hedonism merges 
into ethical hedonism. Not only is it true that 
men do tend to act in such a way as to maximize 
pleasure but they should so act. To be sure, 
other people's pleasure may enter into the per- 
son's calculations — in the dimension of “ex- 
tent." He should try to maximize the happiness 
of others along with his own. And the policy of 
government and of all social agencies should 
be to facilitate "the greatest good (happiness) to 
the greatest number." 

Thus utilitarian ethics is simply an extension 
of a psychological theory into the domain of 
morals. To apply this ethics one must in gen- 
eral follow a policy of laissez-faire, or "hands 
off." It was customary to ask the question “How 
much government should we have?" and to give 
the answer "As little as possible." For if men 
instinctively seek pleasure, then they need very 
little guidance or constraint. Let each serve his 
own good. While an occasional law may be 
needed, still laissez-faire is the surest way to 
achieve the greatest good for the greatest num- 
ber. 

Here is a case where a psychological assump- 
tion had far-reaching practical consequences in 
society. Its proponents — and there were many 
— fought against social legislation. Why, they 
asked, should we curtail the good resulting from 
the industrial revolution? For example, if un- 
der sweatshop conditions ten thousand men 
obtain factory-made pants at a cheaper price, 
the "greatest good" is served; for their pleasure 
in the aggregate probably exceeds the pain 
caused to the relatively few sweat-shop (or child) 
laborers involved in the process of mass pro- 
duction. 

The doctrine of the “economic man" is like- 
wise a direct outgrowth of the same hedonistic 
assumption. Man, it was held, basically works 
for monetary gain, because money is pleasure in 
abstracto. With it one buys the utilities that 
yield him maximum good according to his con- 
ception of the matter (and no one else's concep- 
tion counts). To take only one example from 
the classic economic laws founded on this as- 
sumption, it was held that "man will buy in 
the lowest market and sell in the highest mar- 
ket." For many years this "law" of behavior 
underlay economic theory. To a considerable 
extent it still does so. Yet exceptions are nu- 
merous. In times of panic, the reverse relation 
holds. Further, a man who is temperamentally 
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generous or temperamentally pessimistic may 
act contrary to the law. And Pareto tells of a 
superstitious banker who lost money by refusing 
to sit at a table with thirteen people to consum- 
mate a sale "in the highest market." Thus a 
host of noneconomic motives enter into eco- 
nomic conduct. It is true that many modern 
economists have abandoned the rigid formulas 
that derived from rationalistic hedonism. The 
"economic man” is becoming extinct. A newer, 
pluralistic conception of economic behavior is 
arising that takes into account many more fac- 
tors than are included in the hedonistic assump- 
tion (Parsons, 1949; Katona, 1951). 

Herbert Spencer, 1820-1903, gave support to 
the hedonistic theory and to laissez-faire by 
tying both to the doctrine of evolution. He 
pointed out that pleasurable activities, for the 
most part, make for survival. Pain betokens 
danger and death. It is therefore natural to seek 
pleasure and avoid pain. To the obvious ob- 
jection that some destructive activities are also 
pleasurable (drunkenness and other vices) Spen- 
cer replied that this dislocation is only a re- 
flection of the imperfect state of society. When 
social life is fully regulated by evolutionary 
principles, then pleasure and survival value will 
be identical. 

Spencer joined the opponents of social legis- 
lation, likewise on psychological grounds. The 
state must respect rugged individualism and 
allow every pleasureseeker to attain his own 
good in his own way — so far as possible. Men's 
motives, like nature herself, are often "red in 
tooth and claw," but even so a premium must 
be put on these natural impulses. To be sure, 
Spencer also recognized the affiliative (sympa- 
thetic) motives in men. But these, he thought, 
should be expressed not in the activities of the 
state, but only within the circle of the family. 
For him there was a sharp distinction between 
the "ethics of the state" and the "ethics of the 
family." On this ground he opposed the advent 
of free public education in Britain — which was 
finally achieved over his protests in 1873. Let 
the family plan for, and pay for, the education 
of its children. The state should never put a 
premium on weakness. 

We have dwelt on the relationship between 
psychological hedonism and social policy in the 
nineteenth century for two special reasons. 
First, we wish to show that here is a psychologi- 
cal hypothesis that was, in a sense, put to the 
test — and failed, In the long run the state 
could not leave men alone to seek their own 
good, relying on their instinct for pleasure and 
their reason. The cry of protest from Dickens, 
Ruskin, Kingsley, and the socialists finally pre- 


vailed. Their argument was that only one side 
of human nature and only one portion of the 
population were being favored by Utilitarian 
ethics. Human life is too complex to rely on 
the manifestly narrow doctrine of Utility. The 
tide turned toward social welfare policies — to- 
ward what today is called “welfare state." The 
claims of simon-pure hedonism, when tested in 
the crucible of social policy, proved inadequate. 

A second reason for dwelling on this matter 
is to provide illustration for another significant 
fact. The theories of social psychology are 
rarely, if ever, chaste scientific productions. 
They usually gear into the prevailing political 
and social atmosphere. Dewey (1899) has shown 
that this is so. For example, an aristocracy pro- 
duces no psychology of individual differences, 
for the individual is unimportant unless he hap- 
pens to belong to the higher classes. Dewey 
likewise points out that dualistic psychology 
flourishes best when one group holds a monop- 
oly of social power and wishes to do the thinking 
and planning, while others remain the docile, 
unthinking instruments of execution. And 
apologists for the status quo, he adds in another 
contéxt, are the ones who most readily declare 
human nature to be unalterable. 


The ultimate refuge of the standpatter in every 
field, education, religion, politics, industrial and 
domestic life, has been the notion of an alleged 
fixed structure of mind. (1917, p. 273) 


Our point is that laissez-faire was in part the 
consequence of a psychological theory, and in 
part a product of the prevailing social ethos. 
'The thinking of Bentham, Mill, Spencer, and 
other hedonists was inevitably molded by the 
prevailing practices of the era of the industrial 
revolution. Their psychological theory meshed 
into the social situation of the day, and became 
to some extent what Marx and Engels (1846) 
and Mannheim (1936) would call “ideology.” 

In recent years we have had more spectacular 
samples of the dependence of scientific theory 
upon prevailing political and social conditions, 
viz., in the surrender of German psychology to 
Nazi thought and of the Russian psychology to 
Communist thought. 

Hedonism today. Yet hedonism is more than 
a culture-bound and time-linked ideology. It is 
a basic theory of the prime motivation of men. 
In various guises it is held today as tenaciously 
as in the past. Freud, especially in his earlier 
writing, insisted that all instincts "seek pleas- 
ure." While they may be overlaid, repressed, or 
controlled, they still strive unconsciously, and 
in disguised ways, for pleasure. 

At the present time, however, the term “pleas- 


D 


12 HANDBOOK OF SOCIAL PSYCHOLOGY 


ure” is not popular with psychologists, chiefly 
because it implies a dualism in the relation of 
body and mind. (Pleasure is a conscious state, 
and it is considered unscientific to claim that 
consciousness “causes” behavior.) Hence other 
labels for current hedonism have been devised. 
For example, Dollard and Miller write: 

The principle of reinforcement has been sub- 
stituted for Freud's pleasure principle. The con- 
cept of “pleasure” has proved a difficult and 
slippery notion in the history of psychology. 
(1950, p. 9) 

As we shall see, “reinforcement” refers to past 
pleasure (or satisfaction) rather than to future 
pleasure (which the classical hedonists stressed), 
but the hedonistic flavor is still present. 

Another disguised version of hedonism is the 
prevailing dictum, found in most present-day 
psychological texts, that all motives tend to 
achieve “tension reduction.” Thus Kluckhohn 
and Murray write that “the nearest thing to an 
all-embracing principle” is the concept of “need, 
drive, or vectorial force." Such a force leads the 
individual to "re-establish equilibrium, reduce 
the tensions, appease the need" (1948, p. 14). 
This view is at bottom a variant of hedonism, 
because the act of escaping tension is also an 
escape from anxiety, from frustration, from dis- 
tention — in brief, from pain. 'T'o be sure, the 
stress here is on negative hedonism (Epicurean) 
rather than a positive pleasure-seeking hedon- 
ism (Cyrenaic) , but it is hedonism all the same. 
Since all motives are said to be of this tension- 
reducing order, the position is not unlike 
Bentham's contention that "pleasure and pain 
are our sovereign masters.” 

Hedonism, therefore, as an over-all concep- 
tualization of human motivation, is neither dead 
nor discredited. Perhaps its longevity certifies 
that it contains a core of truth. 

Critique. There are, however, many argu- 
ments raised against hedonism, some of which 
apply to modern as well as to classical formula- 
tions. To assist our survey of these criticisms, 
we shall adopt the analytical scheme proposed 
by Troland (1928). This author advises us to 
distinguish between hedonism of the present, 
of the future, and of the past. 

Hedonism of the present would hold that 
each individual now, in the present moment, is 
acting in such a way as to obtain maximum 
present pleasure. This is clearly an untenable 
position, excepting perhaps in the case of a 
vegetative, psychotic patient. Most people «lo 
submit themselves to the dentist chair; they do 
wash dishes, save money, and forego present 
pleasures. No hedonistic writer makes any con- 
trary claim. 


H 


Hedonism of the future, however, is the doc- 
trine of Bentham and Mill and, in some re- 
spects, of the “tension reduction” school of 
thought. The individual is doing now what in 
the future is designed to gain happiness or relief 
from pain. Here many criticisms are relevant. 

(1) We recall that when recast in terms of 
social policy (laissez-faire) the assumptions of 
classical hedonism failed to hold up. Men did 
not in fact act so as to maximize pleasure for 
themselves or for others. 

(2) Troublesome too is the interactionist po- 
sition implied. How can “ideas” of future pleas- 
ure motivate present conduct? 

(3) Especially serious is the charge that men 
cannot aim at pleasure in the abstract. They 
can only seek concrete goals (a job, a mate, food, 
shelter). At best the feeling-tone is a by-product 
of reaching the goal; it does not constitute the 
motive itself. In this connection, employing the 
evidence of introspection, Titchener finds that 
pleasure and pain are not the sole determinants 
of action. Indeed, he writes, “I do not consider 
that they can be numbered at ‘all among the 
conditions of action" (1908, p. 297£) . 

(4) Similar is McDougall's argument that 
pleasure and pain are nonmotivational; they are 
nothing more than signposts, guides, indicators 
that instincts are successfully or unsuccessfully 
running their course (1923). A mother cannot 
help but care for her offspring through thick 
and thin (if she has a normal parental instinct) . 
Pleasure and unpleasure merely tell her that 
the instinct is, or is not, fulfilling its basic pur- 
pose of successfully caring for her young. In- 
stincts are the prime movers; at most they are 
guided by the signals of pleasure and unpleasure 
to select instrumental activities that will fulfill 
their purposes. 

Freud, we recall, identified instincts with 
pleasure seeking. His earlier view of the 
matter was thus very different from Mc- 
Dougall’s. In later years, however, Freud modi- 
fied his hedonistic position considerably, and in 
Beyond the Pleasure Principle (1920) postu- 
lated Thanatos, a death instinct, which ulti- 
mately negates Eros and pleasure seeking. 

(5) If we appeal to common observation, 
doubts increase. While hedonism may seem a 
good "fit" to the self-seeking activities of child- 
hood and youth, it seems less appropriate to 
the activities of parents, teachers, artists, and 
above all, martyrs. In mature conduct there is 
often an overwhelming sense of duty, loyalty, 
or commitment to a way of life, and these mo- 
tives seem to take control of the personality. 
When pleasure or pain are encountered, they 
may be regarded as “twin impostors,” and the 
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motive may run its course regardless of the affect 
involved. To this line of argument the hedon- 
ists reply that duty-ridden people are after all 
merely avoiding the pain of not doing their 
duty; and martyrs too are selecting the least 
painful course of conduct, or perhaps seeking 
the pleasure of the next world. It is a question 
whether at this point hedonists are not over- 
extending their terms. Can the bomber who 
crashdives into his target be said, in any reason- 
able sense, to be seeking pleasure and avoiding 
pain? His motive seems remote from such affec- 
tive considerations. It is easy to stretch the 
meanings of pleasure and pain until they lose 
significance. 

Hedonism of the past escapes some of these 
criticisms, but is open to others. Its most suc- 
cinct statement comes from Thorndike’s dictum 
that “pleasure stamps in; pain stamps out” 
(1898) . He was referring, of course, to the learn- 
ing process, and defining what he called “the 
law of effect,” which in the present day is usu- 
ally rendered as “reinforcement theory.” This 
form of hedonism holds merely that each person 
now is acting according to the mode of conduct 


EcoisM 


Hedonism is a doctrine of self-centeredness. 
A person who seeks pleasure and avoids pain is 
inevitably serving his own affective interest. 
Eyen though, as Mill eloquently argued, he can 
sometimes gain happiness from contributing to 
the welfare of others, it is still the pursuit of his 
own pleasure that motivates him. 

Thomas Hobbes, 1588-1679, was likewise a 
hedonist, but unlike Bentham and Mill he re- 
duced pleasure seeking to a more basic passion 
of egoism. Egoistic conflict was, to his mind, so 
prominent that mankind must be regarded as in 
a state of “war of all against all.” To obtain 
pleasure it is necessary first and foremost to have 
power. For power gives one the means to obtain 
ease and sensual pleasure; it brings admiration 
and flattery from others, and it thus conduces 
to “the greatest joy of the human soul,” which 
is to have a high opinion of oneself. Therefore, 
man’s most basic motive is “the desire of power 
after power that ceaseth only in death.” 


So that in the first place, I put for a generall 
inclination of all mankind, a perpetuall and rest- 
lesse desire of Power after power, that ceaseth 
only in Death. And the cause of this, is not al- 
wayes that a man hopes for a more intensive 
delight, than he has already attained to; or that 
he cannot be content with a moderate power: but 
because he cannot assure the power and means to 
live well which he hath present, without the 
acquisition of more. (1651, p. 63) 


he has in the past found pleasurable (satisfying, 
effective, tension-reducing) . Good consequences 
have reinforced his habits, his opinions, his 
ways of seeking goals, so that now he maintains 
them. To argue this position in detail would 
take us too far afield. Suffice it to say that studies 
of memory fail to show convincingly that pleas- 
urable memories are significantly more often 
retained than unpleasurable, as the theory 
would predict (Meltzer, 1930). And the exis- 
tence of satiation, boredom, risk-taking, and 
other phenomena make trouble for the theory 
(Hoppe, 1930; Allport, 1946) . 

At the present time no conclusion seems justi- 
fied, other than that pleasure and unpleasure 
have some important relationship to motiva- 
tion. Just what it is we do not yet know. It may 
be as central as Bentham claimed, or as inci- 
dental as McDougall claimed, It may be a 
principle appropriate to childhood but less so to 
maturity. In any case, the role of affect needs 
vastly more empirical and theoretical study be- 
fore it finds its rightful place in an improved 
science of social behavior. 


(POWER) 


Since life in society would be impossible if the 
insatiable desire for power were not curbed, men 
yield to the “common power” of the state which 
he calls the Leviathan. Through a form of “so- 
cial contract” each person obtains protection 
from other power-seeking mortals, and obtains 
a chance to utilize his leisure for the arts of 
peace and for the acquisition of knowledge. 

In several respects Hobbes gives an analysis 
of the power motive that is surprisingly modern. 
The following passage indicates the extent to 
which power-holders may corrupt communica- 
tion and create an ideology favorable to them- 
selves: 


For I doubt not, but if it had been a thing con- 
trary to any man’s right of dominion, that the 
three Angles of a Triangle should be equall to two 
Angles of a Square; that doctrine should have 
been, if not disputed, yet by the burning of all 
books of Geometry, suppressed, as farre as he 
whom it concerned was able. (p. 67) 


Extravagant as the example may seem, its re- 
semblance to the deeds of Nazis and Soviets is 
obvious. 

Equally shrewd is his observation of the way 
in which each man thinks very highly of his own 
particular ego. 


For such is the nature of men, that howsoever 
they may acknowledge many others to be more 
witty, or more eloquent, or more learned; Yet they 
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will hardly believe there be many so wise as 
themselves. (p. 82) 


Since all men have a maximum feeling of self- 
worth and selfsatisfaction, then men are by 
nature equal. “For there is not ordinarily a 
greater signe of the equall distribution of any 
thing, than that every man is contented with 
his share." In many respects Hobbes foreshad- 
ows modern doctrines of self-esteem, status-seek- 
ing, and self-regard as pivotal motives, and at 
the same time perceives the sly operation of 
what today are called ego-defensive mechanisms. 

Many variants of the power doctrine can be 
found in the nineteenth and twentieth centu- 
ries. In Germany two influential proponents 
were Max Stirner and Nietzsche. Stirner's book, 
The Ego and his Own (1845), anticipated the 
works of Nietzsche. The latter in his The Will 
to Power (1912), gives a crisp statement of his 
position: 


What man wills, what every smallest part of a 
living organism wills, is a plus of power. Both 
pleasure and unpleasure are consequences of 
striving for it. (Section 702) 


To Nietzsche, as to McDougall, hedonic states 
were incidental to a more basic form of motiva- 
tion. All social behavior, according to Nietzsche, 
is a direct or disguised reflection of power seek- 
ing. Affect is a “consequence.” Even the emo- 
tion of gratitude is nothing more than a “good 
revenge” one takes to re-establish his own 
potency in a social relationship. When viewed 
in the light of man’s will to power, his alleged 
motives of altruism, love of truth, and religion 
turn out to be hollow shams. 

A generation later the doctrine received an 
empirical and clinical rendering at the hands of 
the Austrian psychiatrist Alfred Adler (1917), 
who felt that his teacher and associate, Freud, 
was neglecting the role of inferiority feelings so 
often engendered by poor physique or organic 
defects. The striving for masculinity, overcom- 
pensation, a desire “to confirm one's own value," 
loomed large in Adler's thought. Still later an- 
other psychoanalyst, deviating from Freud, de- 
clared competition to be even more important 
than sex striving in causing personality dis- 
orders (Horney, 1939), and the philosopher 
Bertrand Russell (1938) emphasizes the same 
motive, While present-day psychologists avoid a 
unitary conceptualization, much current work 
features the status motive (Williams, 1920) , the 
ego-level (Hoppe, 1930) , ego-involvement (All- 
port, 1943), self-image (Lecky, 1945). These, 
and many other writings, lay heavy emphasis 
upon self-esteem. 


The most uncompromising and exhaustive 
treatment of the ego-motive is found in L’Ego- 
isme, written by the French biologist, Felix Le 
Dantec (1918). His approach is evolutionary. 
Le Dantec argues that fairly stable levels of mat- 
ter and life have evolved: atoms, molecules, 
cells, organisms. Each tenaciously seeks to main- 
tain itself. A phagocyte in the blood stream 
does not devour disease bacteria “in order to" 
serve the organism as a whole, but simply to 
maintain itself. Similarly, each individual or- 
ganism pursues its own good, without reference 
to society. Try as hard as we may, we cannot 
evolve a social unit to supplant the individual 
ego. Each person is interested exclusively in 
himself; altruism is an illusion. “To exist at all 
is to struggle; to survive is to conquer.” All 
social varnish is superficial; at bottom the cave- 
man remains. Even the most pious Christian is 
thrilled and excited by battle and by brute vic- 
tory. 

If this stark account of human motivation is 
true, one naturally asks, “How are we to account 
for society at all, especially for the social insti- 
tutions that seem to foster cooperation and 
mutual aid?" Le Dantec answers: it is because 
of the existence of common enemies. The basis 
of all society is l'ennemi commun. Even the 
family is at bottom merely a serviceable unit for 
common protection. Parents take care of chil- 
dren in order later to benefit, e.g., gaining finan- 
cial support in their old age. 'The basis of 
friendships and of in-groups is not love so much 
as a common fear or common hate. The easiest 
way to get along with a stranger is to find some 
person, some group, or some object that both 
dislike. Allies in wartime place their own selfish 
interests in abeyance until the common enemy 
is defeated; then the allies fall out with one an- 
other. Indeed, even a single nation has genuine 
unity only in wartime, when the lives of all its 
members are threatened. For this reason gov- 
ernments provoke wars. 

Yet, Le Dantec continues, we cannot afford 
to admit that our motives are purely egoistic 
and that the only basis of social solidarity is 
common hate and distrust. To do so would 
weaken further the already precarious fabric of 
society. Hence a fantastic superstructure of 
“metaphysical absolutes" has grown up. A meta 
physical absolute is any doctrine, taught with 
the sanctity of tradition, that masks the basic 
egoism of human life. Conscience is the carrier 
of such doctrines, and is created by parents and 
teachers in children in order to preserve the 
hypocrisies that are so essential in social life. 

Take, for example, the metaphysical absolutes 
of religion. The Decalogue, so it seems to Le 
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Dantec, provides a transparent example of social 
hypocrisy. The first three Commandments re- 
quire men’s obedience to God, and this initial 
obedience provides the sanction for all remain- 
ing Commandments. Le Dantec would agree 
with Voltaire, who held that no parent could 
bring up a child without invoking divine sanc- 
tion. The postulated power of a deity is essen- 
tial to achieve social control. The fourth 
Commandment reveals the point: it says “Honor 
thy father and thy mother” — a rule of extreme 
convenience to parents. The utilitarian nature 
of this Commandment is betrayed in its addi- 
tional phrase "in order that thy days may be 
long M 

“Thou shalt not kill" is likewise an egoistic 
rule, meaning, naturally, that thou shalt not 
kill me. Similarly "thou shalt not steal" protects 
the property of the one who teaches the rule. 
"Thou shalt not bear false witness" grew out 
of the necessity for the in-group to be relatively 
truthful. Likewise, the prohibition against cov- 
eting is psychologically adroit, for the place 
to block transgressions is in the attitudes of the 
would-be transgressor before they issue into 
action. “Thou shalt not commit adultery” is 
a commandment taught with special enthusiasm 
by those who have sexual partners to those who 
have not. 

Le Dantec's view of human motivation and 
of social values may seem to an extreme degree 
to be cynical and dyspeptic. But is it wrong? 

Critique. Common to all theories of egoism 
is one precarious assumption — namely, that the 
process of social learning never results in a 
genuine transformation of motives; that is to 
say, the process of socialization is never truly 
effective. Le Dantec views socialization as a mat- 
ter of “deformations of the ego resulting from 


life in common." "Why deformations?” one 
may ask. May not social learning result in for- 
mation, in conformation, or even in re-forma- 
tion? Comte held a more optimistic view of 
socialization, expressed in his Law of Affective 
Evolution, which says that with time there 
comes a diminution in the preponderance and 
intensity of personal inclinations, and a genuine 
growth and extension of altruistic sentiments. 

May it not be that the picture of egoism is 
appropriate to the unsocialized being (includ- 
ing the child) more than to adults with social- 
ized personalities? Hobbes wrote, "the wicked 
man is the child grown strong." Unbridled self- 
assertion may be natural at an early stage in the 
development of personality, but abnormal at a 
later stage. "Theories of socialization today are 
split on this same issue: some hold that the 
primitive egoistic drives are never modified, but 
only overlaid, or channelled, or "cathected." 
Others argue for a changing ego-structure, 
wherein new interests become functionally au- 
tonomous of their origins, or where productive- 
ness and maturity effectively inhibit earlier 
instinctive aggression and pleasure seeking. 

It is difficult to strike a proper balance in this 
matter. On the one hand we cannot deny that 
hypocrisies and rationalizations of men con- 
cerning their own motives are common, On the 
other hand, who can say with certainty that the 
original nature of man does not include some 
strongly affiliative dispositions? Or that the 
course of social learning does not result in thor- 
oughgoing transformations of the character- 
structure? While self-love remains positive and 
active in every man, is it necessary that it should 
be sovereign with him? We shall resume con- 
sideration of these questions in our discussion 
of Sympathy (pp. 18-21) . 


IRRATIONALISM AND RATIONALISM IN SOCIAL Psyc HOLOGY 


In this section we shall interrupt our exposi- 
tion of simple and sovereign doctrines to discuss 
an issue that is central to all of them. The 
reader has noted, no doubt, that hedonism and 
egoism are regarded as powerful and irrational 
pushes that dominate man's entire behavior, 
making life in society at best a difficult achieve- 
ment. This unfavorable view of man’s social 
capacities has in fact predominated throughout 
the ages. From the time of antiquity writers 
have stressed the force of pleasure seeking, of 
fear, of the power drive, or other equally blind 
passions. In The Prince (completed in 1513) 
Machiavelli saw nothing to be gained in ap- 
pealing to people's intellect. The prince must 
see to it that he is beloved and feared. If he 
must choose one of the two, let him inspire 


fear, for people are at bottom ungrateful, false, 
cowardly, and interested only in their own wel- 
fare. A century and a half later Hobbes, as we 
have seen, held an equally uncomplimentary 
view of human nature. 

Relative rationalism, Yet as the Enlighten- 
ment broke over Europe the stock of the com- 
mon man went up. He was given credit for 
managing his social life with relative (though 
not complete) intellectual acumen. Locke, 
1632-1704, felt that the "social contract" was a 
rational necessity of man’s nature. In fact, he 
equated “natural law” with “the command of 
reason.” Unlike Hobbes, who thought the state 
of nature was a state of war, and unlike Rous- 
seau, who thought it was a state of happy inno- 


cence, Locke saw the state of nature to be one 
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of reasonableness, moderation, adaptive com- 
promise (1689). The doctrine of the reasonable 
decency of man, coupled with faith in his ability 
to utilize the developments of science, com- 
merce, and industry for his own ends was spread 
by such writers as Godwin (1793) and Con- 
dorcet (1795), and influenced both the Utili- 
tarians and the founders of American democ- 
racy. 

At first sight it seems paradoxical to call 
hedonism a rationalistic type of social psychol- 
ogy. The aim of men is emotionally determined; 
they cannot abstain from the pursuit of pleas- 
ure. But when this starting point is admitted, 
then their subsequent conduct is accomplished 
through rational choice—perhaps even through 
Bentham’s “hedonistic calculus.” Present-day 
forms of hedonism have completely lost their 
rationalistic cast, but Bentham and Mill had 
high regard for man’s capacity to calculate in 
advance the consequences of his conduct. 

In the United States the Founding Fathers 
likewise had marked faith in men’s ability to 
face facts, judge rightly, and vote wisely. ‘Ihe 
conviction, to be sure, was stronger in Thomas 
Jefferson and James Madison than in Alexander 
Hamilton, who feared, as did Plato, that democ- 
racy runs the danger of becoming the “rule of 
the worst.” In the Federalist Papers the is- 
sue is argued out. Checks and balances are 
devised for the Constitution which, while main- 
taining faith in the rational judgment of voters, 
yet prevent government by mob passions or by 
power-hungry demagogues. Social psychologists 
today might well ask themselves whether the 
issue of rationality versus irrationality did not 
achieve a more balanced solution in various 
historic American documents than it achieves in 
their own theories, After all, American democ- 
racy has proved viable for nearly two centuries. 
Its assumptions concerning “the nature of the 
men governed" cannot be wholly wrong. There- 
fore we may well search The Federalist (1788) 
and other writings of Jefferson, Madison, and 
their contemporaries for a well-balanced view 
of rational and irrational factors in the social 
nature of man. 

Nineteenth-century irrationalism. While the 
rationalism of the Enlightenment was not with- 
out its influence, it remains true that most social 
psychologists in both past and present have 
favored irrational interpretations. A theory is 
irrationalistic if it possesses two features: (1) It 
must claim that the prime determinants of 
human behavior are emotion, drive, instinct, or 
some form of blind will. (2) It must also square 
these motives with the apparently peaceful and 
logical structure of group living. More specifi- 


cally, it must tell what the function of intellect 
is. If the intellect has no causal properties, then 
it must at the very least be granted a role in 
inventing hypocrisies and rationalizations that 
help sustain the social edifice. 

An influential expression of irrationalism in 
the nineteenth century is Schopenhauer's essay, 
The Primacy of the Will in Self-Consciousness 
(1819) . In it he gives many reasons for believing 
that blind unrest is basic in all life. When the 
will (emotional need) is active the intellect can 
only serve it. A more modern statement of the 
same position is found in the writing of Kempf 
(1918) who holds that the autonomic nervous 
system is the leader and the central nervous sys- 
tem is the servant, in their co-functioning. 

It was the gigantic impact of Darwin, Mc- 
Dougall, and Freud, with their varying concep- 
tions of instinct, that assured the supremacy of 
irrationalism in present-day social psychology. 
The depreciation of intellect was further as- 
sisted by the rise of behaviorism, and by the 
union of social psychology with -abnormal psy- 
chology during the late ninetenth century. Only 
in recent years, as we shall soon see, has there 
been any marked sign of reaction against irra- 
tionalism. 

Rationalization. lrrationalists admit that for 
all our enslavement to instincts, to blind will, 
or to the push of environmental stimuli, men 
do perceive; they do cognize, plan, and ponder. 
And in particular they perceive, plan for, and 
conceptualize their social relationships. Irra- 
tionalism insists, however, that much of man's 
theorizing is merely a mask for the real state of 
affairs, specifically for the unconscious motives 
that are operating. Man's hypocrisies were 
clearly noted by Machiavelli, by Hobbes, and 
many other earlier writers. Bentham called them 
"fictions," which he defined as the imaginary 
entities people put in place of perceptible and 
inferential entities. Fictions have the happy gift 
of smoothing the course of social life. For ex- 
ample, instead of claiming his prerogatives as a 
powerful individual, the king resorts to double- 
talk about "the crown," thus masking his own 
selfish demands. Bentham introduced likewise 
the concept of “fallacy,” by which he sought to 
label all misuse of praise and blame in social 
control Today we speak of “name-calling,” of 
"smear words," and "purr words." This trick 
Bentham clearly spotted and named, a century 
before the days of modern semantics (Ogden, 
1932) . 

The term rationalization was first used by Er- 
nest Jones (1908) to represent Freud's finding 
that irrational acts are "justified by distorting 
the mental processes concerned and providing a 
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false explanation that has a plausible ring of 
rationality.” A few years later Robinson in his 
Mind in the Making (1921) applied the con- 
cept on a large scale to human history, defining 
it as follows: 


Rationalization is the self-exculpation which oc- 
curs when we feel ourselves or our group accused 
of misapprehension or error. (p. 44) 


Actually the term has broader meanings, reach- 
ing beyond sheer self-justification. It refers also 
to the cloaking of a motive, to defense of a value 
system, to excusing the bad behavior of our 
loved ones, and to any refusal to look at an argu- 
ment in its own right. 

Closely related is the doctrine of stereotype 
advanced by Walter Lippmann (1922). The 
“picture in our heads” serves a variety of useful 
functions because it simplifies complex social 
reality to manageable, if false, proportions. The 
dynamics of the stereotype are clearly set forth 
in the following quotation. 


If the experience contradicts the stereotype, one 
of two things happens. If the man is no longer 
plastic, or if some powerful interest makes it 
highly inconvenient to rearrange his stereotypes, 
he pooh-poohs the contradiction as an exception 
that proves the rule, discredits the witness, finds 
a flaw somewhere, and manages to forget it. But 
if he is still curious and open-minded, the novelty 
is taken into the picture, and allowed to modify it. 
Sometimes, if the incident is striking enough, and 
if he has felt a general discomfort with his estab- 
lished scheme, he may be shaken to such an extent 
as to distrust all accepted ways of looking at life, 
and to expect that normally a thing will not be 
what it is generally supposed to be. (p. 100) 


Much older, but closely related, is the doc- 
trine of ideology. Some trace it to Francis 
Bacon (1620) who pointed to the chronic error 
and distortion that lay in the prevailing "idols" 
of the tribe, of the marketplace, and of the cave. 
Marx and Engels (1846) used the term “ideol- 
ogy” to refer specifically to the elaborate beliefs 
promulgated by the capitalist class to justify its 
favored position in society. Influenced by Marx 
and Engels, the movement known as the “sociol- 
ogy of knowledge" (Mannheim, 1936) has 
concerned itself with any ideology that tended 
to derive from, and to justify, a set of values 
that was in fact determined by irrational, or 
quasi-rational, considerations. It is interesting 
to note that the present-day usage of the term is 
losing the savor of hypocrisy. Whereas Marx 
would never have called communism an “ideol- 
ogy,” today it is customary to do so. We speak 
even of democratic or Christian ideology with- 
out derogatory implication. 


Among other variants of rationalization we 
may cite Le Dantec’s metaphysical absolutes, 
previously discussed; myth as used by Sorel 
(1914) and Delaisi (1927); folklore (Arnold, 
1937) ; vital lies (Lee, 1912) ; “thobbing” (Ward, 
1926) ; and intensional orientation (Hayakawa, 
1941). 

Special attention should be called to Pareto's 
doctrine of derivations, Pareto was a versatile 
economist who wrote a vast treatise in sociology 
from the point of view of an irrationalist social 
psychology (1916). Apparently Pareto was un- 
acquainted with the writing of his intellectual 
predecessors. The results was therefore a de 
novo system of thought. Those who recognized 
that he was only saying, in different and compli- 
cated terms, what irrationalistic social psycholo- 
gists had previously said severely criticized him 
(e.g., McDougall 1935); but those who, like 
Pareto himself, were unacquainted with the his- 
tory of social psychology, praised him extrava- 
gantly, likening his methodology to Descartes’ 
and his originality to Aristotle’s (Bousquet 
1928) . 

Pareto, like all irrationalists, offers a theory 
of the real motives of men (residues) and of fic- 
titious motives (derivations). Residues bring 
men under the domination of "sentiments" 
which clever people manipulate by the use of 
“nonlogical” appeals. The resulting social con- 
duct is “explained” and prettified with the aid 
of verbal illusions (derivations). Derivations 
spread widely in society through devices that 
psychologists today recognize as principles of 
propaganda. The true motives lie in classes of 
mental processes to which Pareto gives such la- 
bels as “the instinct of combinations," “need of 
manifesting one's feelings by exterior acts," 
“sexual residues," and the like. We need not 
take time here to show how both doctrines (resi- 
dues and derivations) are handled in more or- 
derly fashion by various other systems of irra- 
tionalist psychology (e.g., Freudian, McDougal- 
lian). 

Ce rationalism. While the 
dominant trend in social psychology has cer- 
tainly been irrationalistic, counterinfluences 
have not been lacking. One might cite, for ex- 
ample, the general cast of John Dewey's 
thought. At one time, to be sure, he argued that 
“instinct” underlay all social conduct (1917), 
but he soon shifted his formulation to favor 
“habit” as the unit of behavior (1922). Habit 
in Dewey's hands is not wholly irrational, rather 
it is a learned disposition that aids man in his 
rational transactions with his environment. 
While Dewey is by no means blind to the force 


of emotions, his tenacious faith in education and 
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in democracy has, like the teaching of Thomas 
Jefferson, acted as a brake upon irrationalism in 
American social psychology. 

In this connection we should mention the in- 
vention and growth of opinion polling within 
the past fifteen years. The movement assumes 
that the conscious views and judgments of men 
have extensive influence on their conduct, and 
should play a part in the determination of pub- 
lic policy. The widespread use of the polling 
method in the United States Army during 
World War II (Stouffer et al., 1949) reflected 
far more rationalistic than irrationalistic as- 
sumptions concerning the social nature of man. 

We have said that McDougall's instinct doc- 
trine (1908) augmented Darwinian and Freud- 
ian influence. Almost immediately it found its 
way, via Graham Wallas (1908), into an in- 
terpretation of political life. A few years later, 
however, Wallas became alarmed at the one- 
sidedness of the resulting picture, and wrote a 
second treatise to counteract the irrationalism 
of his own first production. In The Great So- 
ciety (1914) he proposed in addition to the 
McDougallian list of native propensities an 
“instinct of thought.” The greatest triumphs of 
human history, he argued, were products of this 
instinct, unaffected by the turmoil engendered 
by all other instincts. The instinct of thought 
is the only instinct aroused by inanimate ob- 
jects, and thus productive curiosity is a genuine 
basis for man’s transactions with the physical 
portion of his environment. Nor is it entirely 
absent in social relationships. 

While it would take too long to argue the mat- 
ter here, it seems safe to say that post-Freudian 
psychoanalysis has taken a certain turn toward 


rationalism. The crux of the change lies in a 
revised view of the ego. Whereas Freud declared 
that the ego (by which he meant the rational 
region of personality) had no dynamic power in 
its own right, neo-Freudians perceive that the 
ego may genuinely transform the instinctual 
forces of the id until a "productive" (more ra- 
tional) personality results (cf. G. W. Allport, 
1953). Equally striking is the strong drift of 
favor in psychotherapy toward "nondirective 
counseling" (Rogers, 1942) . This movement re- 
flects confidence in the rational powers of the 
self to survey its situation and to reach a reason- 
able decision regarding its future course of con- 
duct. 

Finally, among the rationalistic trends we 
mention the growing emphasis on phenomenol- 
ogy and on cognitive organization (e.g, Mac- 
Leod, 1951; Krech and Crutchfield, 1948). While 
this movement does not deny the role of motive 
and emotion, it prefers to look at these factors 
not as the starting point of all action, but as fac- 
tors present within the total perspectival grasp 
with which the individual views his situation. 
Surveying the irrationalism of earlier social psy- 
chology, Asch has declared that it has drawn a 
caricature rather than a portrait of man (1952, 
p- 24). Asch argues that people ordinarily make 
use of their capacity to grasp accurately the ob- 
jective character of a social situation, and to act 
in keeping with its rational requirements. 
Phenomenology and Gestalt psychology center 
attention upon men's perception and cognition 
of their surroundings. This approach, as con- 
trasted with the approach through motivation 
and urge, will capture such rational factors in 
social conduct as may exist. 


SYMPATHY 


It is not only rationalists who object to hedon- 
ism and egoism as sole explanations of social 
conduct. Some irrationalists (believers in in- 
stinct, urge, inborn nature) join the protest. 
They argue that self-love is a grossly one-sided 
principle. Granted that self-love is present in 
every man, they say it is not necessarily domi- 
nant in his nature. 

Some even reverse the formula; they insist 
that the human relationship of love or symbiosis 
is primary; eg. Kropotkin (1902), Suttie 
(1935), Ashley-Montagu (1950). A basic 
groundwork of love and trust is established in 
the early mother-child relationship; it is essen- 
tial to all survival. So fundamental is this fact 
that it is often overlooked by psychologists, who 
are inclined to glue their attention upon the 
pattern of hostility and aggression which is per- 
ceptible only because it contrasts with the un- 


derlying ground of affiliation (see Sorokin, 1950; 
especially Chapter 5) . 

Other theorists do not insist that sympathy or 
affiliation is sovereign, but they agree in giving 
it a prominent place among social motives, In 
this section we shall consider the history of this 
motive whether or not it is advanced as a sole 
explanation of social conduct. 


SYMPATHY, IMITATION, SUGGESTION 


First let us note that there is good historical 
reason why we should examine three fundamen- 
tal concepts in close succession. In a sense, they 
compose the principal triumvirate of theories in 
social psychology. The reason for their close re- 
lationship goes back to Plato. As everyone 
knows, Plato, like Freud, conceived the human 
mind as made up of three faculties or “institu- 


THE HISTORICAL BACKGROUND OF MODERN SOCIAL PSYCHOLOGY 19 


tions.” To Plato the abdomen was the seat of 
emotions or feeling; the breast the seat of striv- 
ing and action; the head the seat of reason and 
thought. (Society had three parallel classes: 
slaves, warriors, philosophers.) Throughout the 
ages this Platonic trichotomy has persisted. 
Mind, it is said, is constituted of: 

Affection (feeling) 

Conation (striving) 

Cognition (thought) 
Now it is possible for a psychologist to stress 
one of these faculties at the expense of others. 
When a social psychologist does so, seeking in 
any one of these functions an exclusive (or al- 
most exclusive) explanation of social conduct, 
he is likely to end up with a simple and sover- 
eign system based on one of three principles: 

Sympathy (affective) 

Imitation (conative) 

Suggestion (cognitive) 
Ina certain sense these concepts are interchange- 
able, in that the phenomena one author would 
explain by one principle, a different author 
would explain by another. And occasionally a 
given author combines two, as when Ross (1908) 
speaks of the "suggestion-imitation" principle. 

Because of this historical linkage, we shall 

consider the three concepts seriatim, starting 
with the doctrine of sympathy in its various 
forms. 


PROPONENTS OF SYMPATHY 


Adam Smith, in his Wealth of Nations (1776), 
gave marked impetus to hedonism and to laissez- 
faire, Yet his total system of thought demanded 
an equal emphasis upon human sympathy. The 
opening sentence of his Theory of Moral Senti- 
ments (1759) reads: 


How selfish soever man may be supposed, there 
are evidently some principles in his nature which 
interest him in the fortune of others, and render 


their happiness necessary to him. . . . 


Smith distinguished two basic forms of sym- 
pathy which most subsequent writers have pre- 
served, First there is the quick, almost reflex, 
type of response. When we sce a person struck 
with a stick we cringe; when we watch a tight- 
rope walker we grow tense. In these cases we 
feel as the other person feels and do as he does. 
Some writers have regarded this mimicry as in- 
stinctive. But today we might say that the con- 
ditioned reflex formula covers it well enough: at 
first we cringe when we are hit, and later we 
cringe at the visual cues that were initially asso- 
ciated with our own cringing. 

The second type of sympathy is more intellec- 
tualized. We are able to sympathize with a per- 


son even though we do not feel as he feels. We 
may congratulate the successful and condole the 
afflicted at times when we feel neither success 
nor affliction. There are sympathetic bonds be- 
tween friends even when they are viewing a 
given situation with unlike emotions. 

Smith derives man’s sense of justice from the 
operation of sympathy. Our capacity for sym- 
pathy leads us to feel with both the aggressor 
and the aggrieved. The resulting balance of 
sympathy determines our final judgment as to 
where justice lies. Similarly, our judgments of 
propriety or good taste are the result of weigh- 
ing a person’s emotional expression against the 
exciting fact. If it seems disproportionate we 
cannot sympathize, and so judge his response to 
be improper. These instances indicate how 
Smith uses his favorite principle to explain a 
large and diversified range of social behavior. 

Herbert Spencer distinguishes the same two 
basic forms of sympathy (1870). He called 
them respectively presentative (immediate, re- 
flexive) and representative (conscious, reflec- 
tive). For a still more highly intellectualized 
sentiment (e.g., the abstract idea of Love as a 
proper way of life) he coins the term “re-repre- 
sentative sympathy.” Also, like Smith, he felt 
that the hedonism and laissez-faire he espoused 
needed softening with the aid of a principle 
that would explain affiliative urges. Society 
originates in the sex instinct. It leads to the 
institution of the family, which is the essential 
unit of survival. Within the family the prin- 
ciple of sympathy takes over; a child survives 
not because he is strong and fit but because his 
defenselessness arouses some grade of sympathy 
in others. Spencer, as we have previously seen, 
argued strongly that this instinct should find its 
expression only in the family circle and not be 
allowed to weaken the rugged fiber of the state, 
which should give its rewards to the strong and 
never to the weak. Gregarious propensities are 
a mutation in the evolutionary struggle. While 
they are essential to human survival, they must 
not lead to coddling. Spencer, as we have said, 
had no use for the welfare state; he even op- 
posed free public schools. Let education, he 
said, be a product of family sympathy, not of 
public policy. 

Peter Kropotkin. The harshness of the “red 
in tooth and claw” doctrine of evolution led to 
a marked reaction. In Mutual Aid (1902) Kro- 
potkin argued that the mutation of sympathy 
td which Spencer called attention is the primary 
fact in human evolution, It represents the 
neglected aspect of Darwinism. Kropotkin's evi- 
dence rests chiefly upon anecdotes collected both 
in time of war and in time of peace. He argues 


D 


20 HANDBOOK OF SOCIAL PSYCHOLOGY 


that the mutual aid we see around us is not 
based upon love for specific individuals, but 
rather upon a broad instinct of “human solidar- 
ity.” A more modern book that in many respects 
restates Kropotkin’s position is Ashley-Monta- 
gu's On Being Human (1950). 

Gregariousness. Many authors have postu- 
lated, as Kropotkin did, a protean social instinct. 
Sociologists especially have done so, in order 
that this broad psychological foundation might 
support their differentiated theories of social 
structure and functioning. For Giddings there 
is a basic "consciousness of kind" compounded 
of organic sympathy, the perception of re- 
semblance, conscious or reflective sympathy, af- 
fection, and the desire for recognition. “To 
trace the operation of the consciousness of kind 
through all its social manifestations is to work 
out a complete subjective interpretation of so- 
ciety" (1896, p. 19). 

During the first world war Trotter published 
his Instincts of the Herd in Peace and War 
(1916), which postulated a similar basic ex- 
planatory propensity. In it Trotter argued 
somewhat chauvinistically in favor of the Eng- 
lish variant of this instinct as against the Ger- 
man. While McDougall (1908) likewise 
postulated a gregarious instinct, he did not de- 
rive all social conduct from this single source. 
The parental instinct, the instinct of appeal, 
and other instincts, too, have social reference. 
When about 1920 a sharp reaction set in against 
the instinct doctrine, gregariousness all but dis- 
appeared as a postulate from social psychology. 
Virtually no writer today employs it as an ex- 
planatory principle. 


PRIMITIVE PASSIVE SYMPATHY 


Before leaving instinctivist explanations, it is 
necessary to call attention to McDougall's theory 
of "the sympathetic induction of emotion" or, 
as he preferred to call it, primitive passive sym- 
pathy” (1908). He states his view as follows: 


We must not say, as many authors have done, 
that sympathy is due to an instinct, but rather 
that sympathy is founded upon a special adapta- 
tion of the receptive side of each of the principal 
instinctive dispositions, an adaptation that rend- 
ers each instinct capable of being excited on the 
perception of the bodily expressions of the ex- 
citement of the same instinct in other persons. 
(14th. edition, p. 98) 

In other words, if I perceive expressions of rage 
in another this perception is likely to arouse ray 
own instinct of anger. There are two keys that 
will unlock each instinct: one is the "biologi- 
cally adequate" cause; the other is the percep- 
tion of the instinct in action in another person. 


The reader may rightly ask, Would not this for- 
mula lay all of us open to too much induced 
emotion? Should we not constantly be suffering 
instinctive seizures vicariously? McDougall es- 
capes this dilemma in an interesting fashion. 
He brings laughter into the picture. Laughter 
is a device by which we avoid excessive sym- 
pathy. Byron wrote, “If I laugh at any mortal 
thing, "tis that I may not weep.” McDougall 
heartily agrees, and adds that laughter saves us 
not only from depression and grief, but from 
all other forms of vicarious sympathy as well. 

Theodore Ribot. In his Psychology of the 
Emotions (1897) Ribot gives great prominence 
to sympathy, which he calls “the foundation of 
all social existence.” He is not content with the 
two forms which most writers recognize, but 
postulates three. The first is the primitive or 
automatic type that we have suggested might 
be regarded as an instance of conditioned re- 
sponse. The second is reflective in that the 
individual is self-conscious concerning his state 
of mind. He knows that he feels for the other 
person even if the suffering is not his. The 
third is an intellectualized sentiment of loyalty, 
tolerance, or philanthropy — far broader in 
type than any specific instance of fellow-feeling. 
His three grades are similar to, if not identical 
with, Spencer's. 

Ribot's approach is phenomenological. He is 
asking, What are the diverse forms of experi- 
ence or "intentions" that men have, to which the 
single and inadequate label sympathy is ap- 
plied? Are we dealing here with a simple unitary 
process, or with a term covering various types of 
mental acts? 

Max Scheler. 'The phenomenological ap- 
proach reaches its climax in Scheler's Wesen 
und Formen der Sympathie (1923). The Ger- 
man language is far more flexible than the 
English or French in dealing with this particu- 
lar subject. A translation of this book was pub- 
lished by the Yale Press in 1954 under the title 
The Nature of Sympathy. There are, according 
to Scheler, eight forms of sympathetic orienta- 
tion (or acts). For most of them clear English 
equivalents are lacking. 

The first three are of a low grade, really 
“pseudosympathy,” since in these cases the focus 
is upon the subject's own feeling and not upon 
his feeling with or for others. 

1. Einfiihlung is the primitive, reflex process 
mentioned by Smith, Spencer, Ribot, and others. 
The term empathy is a fair translation, provided 
it is understood to mean only elementary motor 
mimicry and is not employed in the broad sense 
of “a gift of understanding people" as is some- 
times the case today. 
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2. Miteinanderfühlung calls attention to a 
condition of "simultaneous feeling," as in the 
case of two or more people reacting in a similar 
way to the same stimulus (e.g., an attentive and 
emotional cinema audience) . 

$. Gefühlsansteckung refers to the spread of 
feeling through social induction and facilita- 
tion, as in a mob, panic, or throng. It likewise 
includes McDougall's sympathetic induction of 
emotion. Such "transpathy" is still focused up- 
on the affective experience of the individual, 
and is not truly sympathetic. 

Higher levels of sympathy are found in: 

4. Einsfühlung, wherein occurs an identifica- 
tion of feeling: the child playing with a doll 
identifies with her mother. Citizens in wartime 
feel united in suffering and in aim. There is an 
especially high degree of such “unipathy” be- 
tween lovers who feel keenly the joys and sor- 
rows of each other. 

5. Nachfiihlung is far more conscious and 
detached, We encounter it in the statement, "I 
know just how you feel.” In such a case, how- 
ever, we distinguish clearly our own feeling 
from anothers. We may even add, "I know 
how you feel, but I wouldn't act as you are act 
ing.” This state of mind is Adam Smith's “in- 
tellectualized,” or Ribot's “reflective” sympathy. 

6. Mitgefühl may best be translated as "fel- 
low-feeling." It is phenomenologically distinct 
from all states of self-reference, and denotes that 
clear and normal condition where one person 
appreciates accurately another's feelings, and 
usually (though not necessarily) values them pos- 
itively. Fellow-feeling is the primary basis of 
social living — a product of phylogenetic evolu- 
tion. 

7. Menschenliebe is the level of sentiment 
recognized explicitly by Ribot and McDougall. 
One not only senses the other person's state of 
mind, but prizes it and respects it. Altruism and 
philanthropy belong to this class. 

8. Akosmistische Person- und Gottesliebe 
(unworldly sympathy) refers to the sense of mys- 
tical sympathy that characterizes some people's 


religious orientation to life as a whole — "a 
unity of all finite spirits in God." 

Scheler's phenomenological analysis has not 
been surpassed in subtlety. Besides being valu- 
able in its own right, it illustrates the important 
fact that psychological analysis depends to a 
considerable degree upon the categories of lan- 
guage available to the analyst. 

Empirical work, Yt was not until the decade 
of the 1930's that sustained attempts were made 
to reduce the problems of sympathy to an em- 
pirical basis. Freudian theory, World War I, 
and the prevailing temper of irrationalist 
theories had focused attention earlier upon 
aggressive behavior. But during the business 
depression interest in cooperative conduct rose. 
A summary of work prior to the late 1930's is 
found in May and Doob (1937) , and pertinent 
anthropological evidence is presented by Mead 
(1937) . 

Employing the method of observation, Lois 
Murphy (1937) analyzed five thousand episodes 
of behavior in a nursery school. While aggres- 
sive acts were much more numerous than sym- 
pathetic acts, she found that the latter occurred 
with considerable frequency, chiefly among chil- 
dren who felt psychologically secure, who them- 
selves had had an experience similar to that 
evoking their sympathy, and among children 
who came from certain types of cultural sur- 
roundings. 

In recent years considerable relevant work 
has been added, although the concept of sym- 
pathy or even of cooperation is seldom used, 
probably because their flavor is moralistic rather 
than scientific. But much of the current re- 
search in group dynamics, in industrial rela- 
tions, in the psychological aspects of group 
tensions, and in psychotherapy represents an 
empirically action-oriented extension of the 
movement we have here surveyed. The in- 
terested reader will find guidance in Leighton 
(1945), Pear (1950), Chase (1951), Maier 
(1952) , Sorokin (1950). 


IMITATION 


Social psychologists are required to explain 
the overwhelming fact of social conformity in 
human behavior. Parents set models to which 
children conform; fashions are models to which 
adults conform; and culture itself is a model to 
which everyone (or nearly everyone) conforms. 
No problem in social psychology is more in- 
sistent. The term “imitation” designates the 
problem — but does not solve it. 

Certain nineteenth-century writers, however, 
saw in imitation a simple and sovereign prin- 
ciple suited to explain all conformity. One 


"t 


single force was thought to provide both the 
motive and the means. Walter Bagehot (1873) 
represents this point of view: 


At first a sort of "chance predominance” made 
a model, and then invincible attraction, the neces- 
sity which rules all but the strongest men to imi- 
tate what is before their eyes, and to be what 
they are expected to be, moulded men by that 
model. (1875, p. 36) 


Imitation is thus the conserving agency in so- 
ciety accounting for the “cake of custom” that 
e 


22 HANDBOOK OF SOCIAL PSYCHOLOGY 


binds all but the strongest men. It accounts 
for the “sameness” in savage societies, for the 
mimicry of children, for the gregariousness of 
species. Since conformity behavior is the result 
of “invincible attraction” the really critical 
problem lies in invention and novelty. Inven- 
tive departures from conformity Bagehot ac- 
counted for by the activity of "strong men,” 
whom he also called “nation builders.” 

More extended is the treatment given by the 
French author Tarde, 1843-1904. For him, as 
for his British contemporary, imitation was “the 
key to the social mystery." Enthusiastically he 
asserts "Society is imitation" (1903, p. 74). 
Among Tarde's famous laws of imitation are the 
following: (a) The law of descent maintains 
that superior classes are imitated by the socially 
inferior. Fads, we might note today, start in 
Paris or on Park Avenue and die on the counters 
of the dime stores. (b) The law of geometrical 
progression calls attention to the rapid dissemi- 
nation of a fashion, rumor, or craze from the 
point of origin. (c) The law of the internal 
before the exotic helps to account for the fact 
that one's own culture is imitated in preference 
to foreign cultures. About the process of imi- 
tation itself Tarde has little to say. He likens 
it to a hypnotic dream state — to somnambu- 
lism, which runs its course under the dominance 
of photographic images formed by the model in 
people's minds. This formulation runs parallel 
to the concept of "suggestion," equally promi- 
nent in French social psychology of the same 
period (Charcot, LeBon) . 

Novelty and invention, according to Tarde, 
are the result of conflicting imitations. All op- 
position is a matter of clash between contrary 
models. Competition, discussion, war are the 
three great forms of social opposition; they 
arise because contrary models are being imi- 
tated. 

Unlike his countryman Durkheim, Tarde was 
an individualist. He once wrote to Baldwin in 
America, “your point of arrival is my point of 
departure,” by which he meant that Baldwin's 
treatment of imitation as a specific mechanism 
in each individual person is the foundation up- 
on which Tarde himself would build his more 
sweeping sociology of imitation. 

Baldwin found in imitation the key to mental 
development in the child (1895, 1897). His 
account calls attention to the fact that there 
seem to be at least two forms of imitation — 


nondeliberate and deliberate (we might say to-, 


day conditioned reflex and insightful). The 
process involves three stages. In the projective 
stage the child receives impressions of a model 
as a photographic plate receives an image. (The 


meaning of “projective” here differs from the 
modern use of the term.) In the subjective 
stage the child tends to assume the movements, 
strains, attitudes of the model. He is “a veritable 
copying machine” and cannot help doing so. 
It should be noted that at this crucial stage in 
the process Baldwin seems to rely on the little 
understood tendency to elementary motor 
mimicry. He also calls attention to the large 
amount of self-imitation in infancy — circular 
reflex activity — which makes for the repeti- 
tiousness of the child’s behavior. Finally, in the 
ejective stage the child reaches a comprehension 
of the model; he may recognize that he is acting 
like the other; he knows how the other feels. 
Here imitation is an avenue to knowledge of the 
alter. 

Mead (1934) saw clearly the deeper implica- 
tions of Baldwin's ejective stage. It helps to ex- 
plain how the individual comes to take the role 
of other people. We do more than imitate. We 
perceive what the other is doing (through ejec- 
tive imitation) but we also perceive our own 
response to it. What occurs is an interweaving 
process. Each act is the resultant both of our 
role assumption and of our self-perceptions. 
There follows, therefore, a "conversation of ges- 
tures” from which emerge mutual understand- 
ing and a continual accommodation and ad- 
justment of two people’s sets of interests. In 
the process, language is of the highest im- 
portance, and "significant symbols" become a 
substitute for gross motor adjustment. Mead's 
theory of socialization is not exclusively “imi- 
tationist"; yet his point of departure lies in the 
doctrine as expounded by Baldwin. 

Reference should be made to the tendency of 
many writers to include an “instinct of imita- 
tion” among the potpourri of motives postu- 
lated in man’s original nature. William James 
did so (1890). Bernard (1924) shows that the 
acceptance of an instinct of imitation as one 
innate motive among many others, has been a 
common but sterile practice in the history of 
social and psychological science. 

The writers we have mentioned, with the 
possible exception of Mead, regard imitation 
as having motivational force. Later writers 
tended to shift their attention away from the 
"why" of imitative behavior to the "how." Imi- 
tation is regarded merely as a means by which 
other motives reach their goal. McDougall per- 
ceived this issue clearly. He felt that, unlike 
true instincts, imitation had little or no urge 
character, and no specific goal. It seemed to 
him to be rather a “nonspecific innate tendency” 
(1908, Chapter 4). In this respect it should be 
classed with play, and also with “primitive pas- 
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sive sympathy” which we have previously dis- 
cussed. Yet McDougall was much troubled by 
the process of precision which enables a child 
(or a parrot) to accommodate his vocal sounds 
to a model; as well as by the manifest tendency 
of spectators to assume the postural strains of 
the dancers or athletes they are watching; also 
by the child’s tendency to take over the tensions 
of an adult into his own motor (and emotional) 
conduct. Blanton and Blanton (1927) empha- 
size the importance of postural tensions in the 
mother which tend to be assumed even by an 
infant in arms, so that a mother’s fear becomes 
the child’s fear. 

This process of empathy remains a riddle in 
social psychology. It would seem to be geneti- 
cally and conceptually basic to social learning 
and to lie at the heart of any theory of imitation. 
Some motor mimicry, as we have said, seems 
reducible to previous conditioning, but in other 
cases it appears to precede and to be a precondi- 
tion of learning. The nature of the mechanism 
is not yet understood. It is obvious that the 
process of perception itself entails motor adjust- 
ments (e.g. of the eyes) that are “imitative” of 
certain properties of the perceived object. This 
fact may give the clue we need. The problem 
is discussed further by G. W. Allport (1937, pp. 
530-533) and by Lipps (1907) . 

Most accounts of the “how” of imitation do 
not reach back to this simple empathic phe- 
nomenon. Many authors lean on the “ideo- 
motor theory,” which Cooley states as follows: 


It is a doctrine now generally taught by psy- 
chologists that the idea of an action is itself a 
motive to that action and tends intrinsically to 
produce it unless something intervenes to prevent. 
This being the case, it would appear that we 
must always have some impulse to do what we 
see done, provided it is something we understand 
sufficiently to be able to form a definite idea of 
doing it. (1902, pp. 62 f) 


The ideo-motor theory is no longer “generally 
taught." But its potential value as an explana- 
tion of imitation (if it were true) is great. It 
merely says that all ideas press toward expres- 
sion, Since many of our "ideas" come while we 
are observing other people (or things) it follows 
that we tend to act out (imitate) our perception 
of the stimulus pattern. While we cannot say 
that this theory is without merit, it is coarse and 
needs considerable refinement and moderniza- 
tion. 

When idea-psychology declined and behavior- 
ism arose an explanation of imitative behavior 
was sought in the properties of the conditioned 
reflex, Classical (Pavlovian) conditioning is 
represented in the theories of Humphrey (1921), 


F. H. Allport (1924), and Holt (1931). Let us 
take Holt's statement of the matter in his for- 
mulation of what he calls the "echo principle." 


A child will learn to echo back any action of 
another provided that another's performance of 
the act stimulates any of the child's sense organs 
at a moment when the child is engaged in a (ran- 
dom) performance of the same act. (1931, p. 
112) 


To illustrate, let us take the infant’s early game 
of pat-a-cake. The theory assumes that first the 
child is moving his hands randomly in a clap- 
ping manner. The fact that he does so repet- 
itively is an important initial fact, for the 
circular reflex involved provides a longer base 
in time for the conditioning to take hold. Per- 
ceiving this action, the parent likewise claps his 
hands (note that the parent is the first “imita- 
tor") and says "pata-cake." Thereafter the 
child tends to clap his hands when he sees the 
action, or hears the words “‘pat-a-cake.” 

While trying to keep the simplicity of a stimu- 
lus-response view of the matter, later authors 
regard the formula of classical conditioning as 
inadequate, and prefer to look at the phenome- 
non in terms of instrumental conditioning or 
"reinforcement." According to this approach, 
imitation is learned because it brings rewards 
and satisfactions. The young boy, for example, 
will develop a generalized tendency to imitate 
his brother when he finds that candy, comfort, 
and other desirable goals follows his (at first 
accidental) imitation of his brother's behavior. 
This “matched-dependent” relationship is re- 
garded by Miller and Dollard (1941) as ac- 
counting for most forms of imitation. It 
encounters some difficulties when applied to the 
process of deliberate conscious copying, and 
does not seem to apply to the elementary phe- 
nomenon of motor mimicry in infants (prior 
to reward or satisfaction) . 

Gestalt psychologists (Kóhler, 1927; Asch, 
1952, Ch. 16) doubt that a conditioned imitative 
tendency can be established even with the aid 
of reward. Apes as well as human beings seem 
normally to require a comprehension of the 
means-end relationship represented by the 
model before they adopt it. An organism di- 
rected to a goal will reproduce in the motor 
sphere what he sees in the model only if he has 
familiarity with the situation and sees the rele- 
vance of the model for his goal. In this type 
of theory we are dealing with conscious and 
deliberate copying, which to some psychologists 
presumes “insight,” or some other “cognitive” 
principle of learning. 1 

Turning from the cognitive to the emotional 
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level, we encounter the familiar concept "'iden- 
tification.” The term obviously refers to affec- 
tive imitation. The boy identifies with the 
father, the bobby soxer with an actress, the 
Negro with his race, and the humanitarian with 
mankind. This broad term is the key concept 
in psychoanalytic explanations of socialization 
(conformity) of the child. He is said to learn his 
morality and basic character-structure through 
emotional imitation of the parent and other au- 
thority figures. Freud (1921) makes the in- 
teresting suggestion that people who have no 
particular emotional significance for us are 
understood through empathy, but those who are 
of emotional value to us are understood through 
identification. 

Finally, we call attention to the importance 
of imitation in cultural anthropology. Wissler 
has said that without assuming some imitative 
process it would be hard to imagine how men 
could possess a culture, “for it must be perpetu- 
ated by the imitation of the older by the younger 
members of the group" (1923, p. 206). The 
"diffusionists" in anthropology are accused of 
exaggerating the extent of imitative adoption 
of culture traits. Anthropologists who argue for 
"multiple origin" say that much apparent imi- 
tation is specious. They say that similarly con- 
structed people with similar needs would make 
the same inventions and do the same things — 
without imitation being involved. The same 
caution can extend to everyday behavior. Chil- 
dren and adults may do the same thing under 
similar provocation without in reality imitating 
at all. 

Summary. The reader who finds this survey 
uncomfortably condensed will profit from read- 
ing the second Appendix to Miller and Dollard, 
Social Learning and Imitation (1941) . 

Imitation is a protean concept. Its definition 
must inevitably remain exceedingly broad. The 
term refers to any occasion where a stimulus 
situation gives rise to motor activity of a sort 
that resembles the stimulus situation. Since 


various types of processes may have this broad 
result, we may well suspect that there is no 
single entity. Also we may suspect that no one 
“learning theory" will account for the various 
forms of imitation that grow up in human con- 
duct. Investigators and writers have called at- 
tention to at least five apparently distinct 
mechanisms that may be involved: 

(1) Motor mimicry, empathy — basically a 
perceptual motor reaction at present not 
fully understood (Lipps, McDougall, 
Blanton and Blanton) . 

(2) Classical conditioning, including the 
echo principle (Holt, Humphrey, F. H. 
Allport) . 

(3) Instrumental conditioning, emphasizing 
the rewards involved in engendering 
either specific or generalized imitative 
habits (Miller and Dollard) . 

(4) Cognitive structuring, including all in- 
stances of deliberate copying, and in- 
sightful reproductions (Kóhler, Asch) . 

(5) Identification — having at present vari- 
ous meanings, but emphasizing the emo- 
tional disposition of the whole person to 
resemble some model (Freud) . 

While it is not impossible that these processes 
may in time be successfully reduced to a single 
formula, the prospect seems most unlikely. 

As for earlier writers on the subject, we may 
declare that their interest lay primarily in the 
societal consequences of imitation. Though they 
tended to postulate a sovereign motive and 
mechanism, their real concern was with the so- 
cial results of conformity (Bagehot, Tarde, and, 
to some extent, Baldwin, Mead, Cooley) . Even- 
tually, of course, the sociology of conformity 
will receive illumination from the mechanisms 
of imitation when these are more fully under- 
stood, but for the time being it is well to view 
the problems as distinct. The sociology of con- 
formity must wait for an improved psychology 
of the processes that result in similar or same 
behavior. 


SUGGESTION 


As we have pointed out, sympathy, imitation, 
and suggestion are the Big Three. Between 
them these concepts have dominated much of 
the systematic social psychology of the past. The 
differences among them, as we have said, reflect 
the preference of authors for emphasizing the 
functions of affection, conation, or cognition, 
respectively. ; 

The cognitive flavor of the term "suggestion" 
stands out clearly in McDougall's definition: 


Suggestion is a process of communication result- 


ing in the acceptance with conviction of the com- 
municated proposition in the absence of logically 
adequate grounds for its acceptance. (1908, p. 
100) 


McDougall held that the dynamics of the proc- 
ess lay in men's instinct of submission; this 
instinct is aroused by any person or symbol hav- 
ing prestige, and accounts for our yielding to 
communicated propositions “in the absence of 
logically adequate grounds." 

Historically considered, suggestion is the most 
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important of the three concepts. This is true 
even though its impact on social psychology was 
not felt until about 1890. But so persuasive at 
that time was the work of Charcot, LeBon, 
Sighele, Sidis, and others, that nearly all the 
problems in social psychology became formu- 
lated both for scientists and for laymen in terms 
of suggestion. Even today the layman inclines 
to think of the whole field of social behavior as 
a matter of "mass mind," "mob hysteria," 
“power of suggestion.” The concept brought 
about a marriage between abnormal and social 
psychology. While the union has cooled, it has 
not entirely dissolved. 

The process of suggestion has been explained 
in various ways, chiefly in terms of: 


animal magnetism (mesmerism) 

ideo-motor response 

dissociation of consciousness 

association of ideas, redintegration, condition- 
ing 

reduction of determining tendencies 

identification 

cognitive restructuring 


These explanations pertain to both suggestion 
and hypnotism, for most writers regard hyp- 
notism as nothing more than an extreme in- 
stance of the suggestive process. The early 
history of these concepts is well reviewed by 
Boring (1929) . 

Animal magnetism was the vague formula 
offered by Mesmer (1779). By “magnetizing” a 
wand or a tub Mesmer found that when subjects 
in his seances touched these instruments they 
fell into trances and became obedient to his 
voice. Even in his day Mesmer's magnetic ex- 
planation was severely questioned. Yet his sub- 
jects did go into trances, and cures of neurotic 
ills did result. Mesmerism was a fact, however 
queer its trappings and theory. 

Ideo-motor response. Braid (1843) rejected 
Mesmer, and invented the term "hypnotism" to 
describe the phenomenon under dispute. He 
noted the importance of sensory fixation. Hyp- 
notism must start with a limitation of the field 
of consciousness. Sensory fixation is followed by 
a restriction of ideas, a process he called mon- 
oideism. An idea firmly planted in the patient's 
mind takes root and issues into motor behavior. 
Surgeons of the day were greatly interested, for 
in the era prior to anaesthetics it seemed a hope- 
ful possibility that ideas planted in a patient's 
mind through hypnosis might dull the pain of 
the knife. 

It is important to recall here that much nine- 
teenth-century psychology was dominated by the 
concept of “idea.” One heard of the mechanics 


of ideas, of the association of ideas, and of ideo- 
motor action. Herbart, Kant's successor at 
Kónigsberg, was a dominating figure. His in- 
fluence on educational practice was immense: 
nineteenth-century educational psychology be- 
came largely a matter of planting the right ideas 
(e.g., via copybooks) in children's minds, hoping 
that such ideas would issue into right actions. 
In this period Michael Faraday (1853) tried to 
improve "the morbid condition of public 
thought" by pointing out that table tipping and 
other spiritualistic phenomena could easily be 
explained by the subtle effects of the idea upon 
physical movement. The direct issuing of ideas 
of movement into action became known as 
dynamogenesis, and was regarded as a special 
case of the ideo-motor theory. The high repute 
of the theory as late as the time of William 
James is seen in his enthusiastic endorsement: 


We may then lay it down for certain that every 
representation [idea] of a movement awakens in 
some degree the actual movement which is its 
object. (1890, II, p. 526) 


Liébeault (1866) published an important 
work in which he explicitly introduced the con- 
cept of suggestion, insisting that the process was 
identical with hypnotism, and that all people 
could under favorable circumstances be hypno- 
tized. To him suggestion was merely the acting 
out of an idea planted with rapport in a field 
of limited attention. This emphasis on rapport 
anticipates McDougall's claim that the submis- 
sive instinct is aroused in all cases of suggesti- 
bility, and also Freud's still heavier stress on 
identification as essential to the state of rapport 
that makes suggestible behavior possible. 

Bernheim (1884) followed Liébeault in his 
thinking, and througn his direction of the 
Nancy school of psychiatry became influential 
in establishing suggestive psychotherapy based 
on the view that suggestion is an essentially nor- 
mal ideo-motor process. 

Dissociation of consciousness. Opposition to 
the theories of the Nancy school sprang up in 
Paris at the school of Salpétriére, headed by 
jean Martin Charcot (Oeuvres complétes, 1888- 
1894). Asa persuasive and dramatic clinician, 
Charcot demonstrated to gaping audiences the 
phenomena of the hysteric splitting of personal- 
ity under hypnosis. Unlike the Nancy school, 
he maintained that only hysteric personalities 
could be hypnotized (hence hypnosis and nor- 
mal suggestion do not form a continuum) . 
Some of Charcot's students, e.g., Pierre Janet 
and Boris Sidis, modified this extreme position. 
Sidis (1898) , for example, regarded hypnotism 
as “abnormal suggestibility,” marked by deep 
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dissociation and disaggregation of conscious- 
ness. By contrast “normal suggestibility" falls 
short of true hypnotism, but is also a matter of 
splitting off (to a slighter extent) the waking, 
guiding, controlling, guardian consciousness 
from the automatic, reflex, subconscious self. 
Abnormal suggestibility can be induced through 
direct commands ("You are falling asleep"), 
whereas the normal person tends to resist direct 
command and to respond only to subtle indirect 
suggestions introduced at "the psychological 
moment" (Tommy, it’s bedtime”). Sidis sums 
up his views under three laws: 

1. Suggestibility varies as the amount of dis- 
sociation and inversely as the unification 
of consciousness. ) 

2. Normal suggestibility varies with indirect 
suggestion and inversely with direct sug- 
gestion. 

3. Abnormal suggestibility varies with direct 
and inversely with indirect suggestion. 

Charcot's influence on clinical psychiatry was, 
of course, very great, both through students who 
followed his teaching closely (e.g., Morton 
Prince) and those who departed from it, or re- 
acted against it (Janet, Sidis, Binet, Freud). 
Equally great was his influence upon social psy- 
chologists, who found in the doctrine of disso- 
ciation an explanation for crowd phenomena, 
mobs, mass hysteria, demagogic leadership. Per- 
haps the most influential book ever written in 
social psychology is LeBon's The Crowd (1895) . 
It is a direct product of Charcot’s teaching. 
Later we shall speak of it more fully. 

This school of thought took a grim view of 
man’s subconscious nature. Whether released 
through hypnosis, hysteric seizure, or mob con- 
ditions, the automatic, subconscious self is an 
antisocial actor. In 1886 Robert Louis Steven- 
son immortalized the Charcotian view in The 
Strange Case of Dr. Jekyll and Mr. Hyde, clear 
evidence that the dramatic teachings of the Sal- 
pétriére school had by that time become inter- 
national public property. A decade or two later, 
in spite of Freud's rejection of Charcot's formu- 
lations, his concept of the unconscious Id took 
over the same essential function as Charcot's 
split-off subconscious self. This concept likewise 
became public property, and has helped main- 
tain the supremacy of irrationalism in clinical, 
academic, and popular psychology to the present 
day. 

dëse about 1890, many writers, as we have 
said, enthusiastically applied the concept of dis- 
sociation to social behavior, thus bringing about 
the marriage between abnormal and social psy- 
chology. Charcot's pupil, Morton Prince, found 
it entirely natural to enlarge the scope of the 
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Journal of Abnormal Psychology which he had 
established in 1908, and to rechristen it in 1922 
as the Journal of Abnormal and Social Psychol- 
ogy. He regarded the two fields as indissolubly 
linked. 

Association, redintegration, conditioning. A 
more conventional type of theorizing keeps sug- 
gestion safely within the confines of simple asso- 
ciationism. A stimulus "suggests," i.e., touches 
off, a particular idea (determining tendency, 
brain path) that leads to a particular response. 
For a fisherman a sunny day in springtime “sug- 
gests” fishing. The Scottish psychologist, 
Thomas Brown (1820), employed the term 
"suggestion" as wholly equivalent to ‘“associa- 
tion.” Bernheim (1884) of the Nancy school, 
while recognizing the importance of rapport 
and of ideo-motor response, also agreed that 
every impression, every mental picture, every 
association was a suggestion. Only slightly dif- 
ferent is Titchener’s statement that, “A sugges- 
tion is any stimulus, external or internal, 
accompanied or unaccompanied by conscious- 
ness, which touches off a determining tendency” 
(1916, p. 450) . 

A variant of this view makes suggestion a mat- 
ter of redintegration. A previously formed dis- 
position of a highly complex order can be 
rearoused as a whole by an associated cue. The 
sunny springtime day is the cue: “going fishing" 
is the complex activated response. Hamilton 
(1859) first called attention to this part-arous- 
ing-the-whole relationship in association. More 
recently Hollingworth (1920) used it exten- 
sively to account for suggestibility in both nor- 
mal and abnormal subjects. 

The concept of the conditioned response is 
closely related to this line of thought. Sugges- 
tions are conditioned cues, and the process 
involved is simply the reactivation of a motor 
pathway to which the cue has been conditioned. 
“Tom, go to bed” (direct suggestion) or “Tom, 
it is bedtime” (indirect suggestion) are cues 
intended to reinstate "Tom's nocturnal habit 
systems. This view is found in all behavioristic 
treatments of social phenomena, such as those 
of F. H. Allport (1924), Bechterew (1932) , Hull 
(1934) . 

The manifest difficulty with these treatments 
is that they fail to distinguish suggestion from 
any other process of association of ideas, or 
from any other stimulus-response sequence 
While it undoubtedly is true that suggestion 
, should properly be viewed as a subclass of asso- 
ciative phenomena, still to have distinctive 
value as a concept there must be some differen- 
tia specifica to give it its peculiar complexion 
among such phenomena. 
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Reduction of determining tendencies. The 
differentiation required is stated in Warren's 
Dictionary (1934): 


A suggestion is a stimulus, usually verbal in 
nature, by which one individual seeks to arouse 
activity in another by circumventing the critical, 
integrative functions. 


We have already cited McDougall’s similar defi- 
nition, which refers to the acceptance of a 
proposition without logically adequate grounds. 
In these and in similar definitions, attention is 
called to some arbitrary restriction in the de- 
terminants of behavior. The individual is not 
employing all relevant ideas, nor his full intelli- 
gence. He is for the moment acting without full 
selfdetermination. Granted that suggestion 
proceeds according to the laws of association 
(conditioning) , still we must allow also for the 
blocking of normal associations, so that the end 
result in behavior is due to a selected field of 
determinants. Braid, we have noted, labeled 
this restriction "monoideism." When we speak 
of "reduction of determining tendencies" we are 
not, of course, referring to the sheer numbers 
of associations present, but to a functional re- 
striction of the brain field, so that the “critical, 
integrative" functions are circumvented. 

Let us take the case of a man who buys a suit 
of clothes under the persuasion of a silver- 
tongued salesman. The seller points out its vir- 
tues in terms of fashion, flattering fit, and com- 
fort. Bombarded by these arguments the 
customer yields to the persuasion. But the 
buyer has not given due weight to equally rele- 
vant considerations that might have blocked the 
purchase: expense, specific wardrobe needs, 
and personal preference. Even though the sales- 
man may have introduced more determining 
tendencies than the buyer initially possessed, 
yet functionally speaking, the salesman has re: 
duced the effective determinants. "The buyer's 
purchase is, therefore, partly, if not altogether, 
the result of suggestion. 

Around the turn of the century a great deal 
of psychological thought and writing was or- 
dered to the concept of suggestion. Many books 
and articles dealt with automatisms, dissocia- 
tion, hysteria, and like matters. Against this 
background Binet (1900) made two significant 
contributions. He reacted against hypnosis, 
agreeing with Wundt that its use was immoral, 
and sought instead for “inoffensive” methods by 
which to explore "normal suggestibility.” In 
so doing he leaned away from Charcot and to- 
ward Bernheim. His second contribution had 
to do with his invention of experimental meth- 
ods. He presented children with lines, with 


drawings, with questionnaires in such a way 
that some strong “idées directrices" were planted 
in their minds. Later they were asked to repro- 
duce the stimulus, with the result that specific 
distortions occurred in the direction of the im- 
planted idea. In group behavior he observed the 
effects of prestige among children. The leaders 
in his experimental groups of children proved 
to be less suggestible than the followers. Owing 
largely to Binet's efforts, suggestion became a 
standard problem for academic psychology as 
well as for clinical. 

One favorite problem had to do with the con- 
ditions of suggestibility. Who is suggestible? 
The answers given, as summarized by Ross 
(1908) include several dubious assertions. Cer- 
tain species of animals (e.g., sheep) are more 
suggestible than others; certain "races" (French- 
men and Slavs) are more suggestible than others 
(Anglo-Saxons) ; children more so than adults; 
women more so than men; people are sug- 
gestible if they are temperamentally absent- 
minded or in a state of emotion, such as 
jealousy; they are vulnerable toward sources 
possessing prestige; and are especially sugges- 
tible in crowds, or toward often repeated asser- 
tions. 

There is something rigid about this approach. 
Krech and Crutchfield (1948, p. 333), for ex- 
ample, argue that suggestibility is not a general 
trait. No woman is uniformly more suggestible 
than a man. Indeed, in areas where she is well 
informed (e.g, in shopping or child rearing) 
she is probably less suggestible than her hus- 
band. 

Cantril has clarified the matter. He states 
that suggestion occurs especially when the in- 
dividual is confronted by a “critical situation” 
where he cannot readily make a decision. He 
will accept the suggested proposition for belief 
(and action) : 

(1) if he has no adequate mental context of 
his own for the interpretation of the event; or 
(2) when his mental context is so rigidly fixed 
that the suggestion automatically arouses this 
context (e.g, a prejudice) and the person fails 
to examine the situation in its own right. The 
first condition results from bewilderment; the sec- 
ond from the “will to believe." (1941, pp. 64 f.) 


To paraphrase this view, people will act un- 
critically (with a reduction of determining 
tendencies) if they are relatively unfamiliar 
with a topic, unaccustomed or unable to check 
up on the suggestion offered to them — in short, 
if their mental organization is unstructured. 
They are likewise prone to accept the suggestion 
if it fits into their rigidly structured stereotypes, 


thus nourishing and calling into action preex- 
. 
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isting attitudes, prejudices, beliefs. An “un- 
stuck” mind and an “overstuck” mind both 
favor suggestibility. 
It would not be hard to cast the entire psy- 
chology of crowd control, demagoguery, and 
propaganda into this framework of thought. 
The devices used are often of a type to “un- 
structure” people’s minds. Demagogues call 
attention to the bad state of affairs, to the mal- 
feasance of government officials, or to the 
threats of poverty, ill health, unpopularity. 
When people are sufficiently unsettled by this 
barrage, the propagandist states his remedy. It 
is then likely to be accepted as a way out of 
the confusion. Or, still more frequently, dema- 
gogues appeal directly to people’s “overstuck” 
convictions. The propagandist, orator, or ad- 
vertiser simply associates his product (or mes- 
sage) with pre-existing idols (sentiments, preju- 
dices, convictions) and thus has it accepted 
because it is congruent with the person's pre- 
existing desires and aversions. George Bernard 
Shaw once defined propaganda as “the organi- 
zation of idolatory” (the organization of the 
propagandist's message with an old idol). 
Identification. McDougall, as we have seen, 
invoked the submissive instinct as a necessary 
link in the suggestion process. The Nancy 
school felt that good rapport was necessary be- 
tween suggestionneur and suggestionné. Freud 
felt both these views to be pallid. Thoroughly 
familiar with the theories and therapy of Nancy 
and Salpétriére, he rejected completely their 
concepts of suggestion and hypnosis. "There are 
deeper reasons, he said, why a leader attracts 
followers and is able to mold their behavior. 
It is because they find in him a love object, one 
who relates himself to their thwarted sex needs 
(1921) . In the case of male groups and crowds 
(e.g., Nazi storm troopers) a deeply repressed 
homosexual interest is revealed through their 
passivity toward, and identification with, the 
leader. What has been called suggestion, espe- 
cially in mass situations, is in reality a conse- 
quence of identification. The theory has been 
more fully explained, with special reference to 
the leader-follower relation in wartime, by 
Waelder (1939). Its merit lies in calling atten- 
tion to the importance of emotional attachments 
in bringing about obedient behavior. It helps 
also to account for the fact that members of 
groups, through a common focus of identifica- 
tion, also identify with one another. It used to 
be said in Germany that there is no such thing 
as a “single Nazi.” Only with the support of a 
group does the peculiar subservience to the 
leader and his ideology take possession of the 
individual. 


As cognitive restructuring. In recent years 
considerable skepticism has been expressed con- 
cerning the utility, and even the validity, of 
the concept of suggestion. Asch, for example, 
has declared flatly that “the case for the doctrine 
of suggestion has not been proven” (1948, p. 
251). 

The argument turns on the question whether 
there is any real difference between conduct 
called suggestible and conduct that is said to 
show critical ability or full self-determination. 
Another way to state the issue is in terms of 
“rationality.” Is the person who seems to us to 
be suggestible, in reality behaving any less ra- 
tionally than the person who seems to us to be 
nonsuggestible? From their own points of view, 
both are probably acting in an intelligent way, 
making the best use they can of the totality of 
information that confronts them. 

One experimental example will suffice to etch 
this new point of view (which derives from 
phenomenology and Gestalt psychology). Sub- 
jects are asked to think about the following 
passage, and are told that its author is Thomas 
Jefferson: 


I hold it that a little rebellion, now and then, 
is a good thing, and as necessary in the political 
world as storms are in the physical. 


When asked whether they agree with the senti- 
ment expressed, and what it really means to 
them, subjects generally approve it, and in- 
terpret the word “rebellion” to mean somewhat 
minor agitation, But when subjects are told 
that the author is Lenin, they ordinarily re- 
pudiate the statement, and interpret the word 
“rebellion” to mean violent revolution (Asch 
1952, pp. 419-425). 

Just what is happening here? The doctrine 
of suggestion would say that people hold previ- 
ous emotional attitudes toward Jefferson and 
Lenin, the former being "good," the latter 
"bad." Hence the statement when ascribed to 
Lenin is rejected, for it is not viewed critically 
in its own right, but is responded to only in 
terms of one's prejudice against Lenin. ‘The 
selfsame statement is accepted when it “redinte- 
grates" one's veneration for Jefferson. Thus the 
person is suggestible, reacting not to the state- 
ment in its own right, but on the basis of pre- 
formed attitudes toward its presumed author. 

The cognitive theorist, however, points out 
that the author's name altered the whole mean- 
ing of the stimulus. This fact is shown by the 
subject's tendency to interpret "rebellion" in 
different ways. If Jefferson wrote the passage 
he could only have meant by it mild agitation; 
if Lenin, he surely had in mind violent revolu- 
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tion, Hence it is not the “selfsame” passage, 
evaluated differently because of the suggestive 
force of the authors names. Rather, two dis- 
similar stimuli are evaluated rationally Oe, 
according to the true meaning of the stimulus in 
context). As Asch puts it, it is not the judgment 
of the object that is altered (through sugges- 
tion), but the object of judgment is itself al- 
tered, and is interpreted objectively and ration- 
ally. 

Whether this line of argument can be ex- 
tended so far as to eliminate the need for the 
distinctive concept of suggestion is doubtful. 
Even in the case cited it seems clear that 
people’s prejudices for Jefferson or against 
Lenin are active factors in the process of in- 
terpretation, and that there is a tendency there- 
fore to react to the statement not with all one’s 
critical associations, but primarily on the basis 
of the pattern of reduced determining tenden- 
cies activated by the author’s name. 

The term “cognitive restructuring” does in- 
deed describe (phenomenologically) what seems 
to happen in all cases of suggestion. We per- 
ceive and interpret matters differently in dif- 
ferent circumstances. But whether “cognitive 
restructuring” is an explanatory concept is a 
question. It serves to warn us that much be- 
havior that seems suggestible and irrational to 
others is not so from the actor’s point of view. 
He is doing what he can in the given circum- 
stances, Yet the fact remains that in certain 
circumstances his behavior is far more critical 
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and self-determined than in other circumstances. 
Hence, we still need the concept of suggestion 
to designate the distinction in behavior that re- 
sults from a narrow, rather than a full, set of 
determinants. à 

Summary. Most, if not all, of the views here 
surveyed contain some truth. We start with the 
simple observation that in certain circumstances 
people act with less critical capacity than in 
others. When their critical sense is wholly or 
partly in abeyance we are likely to find a high 
degree of dominance of one idea or image 
(monoideism). Dynamogenic tendencies may 
also be present. Redintegration and condition- 
ing help touch off the decisive determiners of 
conduct. In extreme cases there may be a hys- 
teric splitting of the personality. In all cases a 
cognitive restructuring takes place, in the sense 
that the individual organizes the determinants 
that are acting upon him. This organization 
can often be controlled from outside if the 
proper conditions are present (e.g., an unstuck 
or overstuck mind), and if leaders or manipu- 
lators use adroit appeals, aided perhaps by 
bright trappings (sensory fixation), and elo- 
quence. Rapport, identification, a submissive 
attitude, and habits of obedience greatly facili- 
tate the process. 

'Thus suggestion is a complex phenomenon. 
We need all the insights that history has be- 
queathed us to construct an adequately compre- 
hensive theory. 


Crowp 


Plato feared democracy because he feared the 
irrationality of men. In relative solitude they 
may deliberate wisely, but taken herdwise their 
reasoning is defective. "Had every Athenian 
citizen been a Socrates, every Athenian assembly 
would still have been a mob.” While this sus- 
picion of human collectivities is ancient and 
recurrent, it was not until the 1890's that a sys- 
tematic theory of crowd behavior became avail- 
able. 

Le Bon's reliance on suggestion, considered as 
a hysteric splitting of personality, is seen in the 
following quotation. In a crowd, he writes, 


the individual may be brought into such a con- 
dition that, having entirely lost his conscious 
personality, he obeys all the suggestions of the 
operator who has deprived him of it, and commits 
acts in utter contradiction with his character and 
habits. (1896, p. 34) 


Crowd conditions release deep prejudices, racial 
tradition, and brute instinct. The crowd man 


is no longer conscious of his acts. In his case, as 


in the case of the hypnotized subject, at the same 
time that certain faculties are destroyed, others 
may be brought to a high degree of exaltation. 
Under the influence of a suggestion, he will un- 
dertake the accomplishment of certain acts with 
irresistible impetuosity. (p. 35) 


People in crowds and crowd leaders are given to 
action, never to critical thought. When a man 
joins a crowd he “descends several rungs in the 
ladder of civilization. Isolated, he may be a 
cultivated individual; in a crowd, he is a bar- 
barian, that is, a creature acting by instinct” (p. 
36). Since Le Bon believed in the excessive 
suggestibility of women and of certain races, 
he adds, “Crowds are everywhere distinguished 
by feminine characteristics, but Latin crowds 
are the most feminine of all" (p. 44). 

Having applied the theory of dissociation to 
«rowd member, Le Bon next considers the 
attributes of the crowd leader. He is a person 
who knows how to invoke pointed and clear 
images "freed from all accessory explanation." 
Crowds act only on the basis of "image-like 
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ideas.” While these may be accidental and pass- 
ing — such as an auctioneer or barker might 
employ to make a sale — more often they are 
fundamental, deep-lying convictions of an in- 
stinctive or quasi-religious order. But in any 
event the leader will deal only in image-like 
ideas that point directly to action (ideo-motor 
theory). The leaders immediate tool in the 
control of ideas is the well-chosen word. “The 
word is merely the button of an electric bell that 
calls them up” (p. 118). The word evokes the 
image, the image evokes a sentiment, the senti- 
ment leads to action. Crowds “are not influ- 
enced by reasoning, and can only comprehend 
rough-and-ready associations of ideas. . . . The 
laws of logic have no action on crowds” (p. 
128). The clever leader will obey the principles 
of affirmation and repetition. He will assert, not 
argue; and above all he will, as Hitler later also 
advised, give his proposition thousandfold repe- 
tition. In discussing the elementary principles 
of crowd control Le Bon set the stage for mod- 
ern work on demagogic leadership (cf. Lowen- 
thal and Guterman, 1949) and on the rules of 
propaganda (cf. Doob, 1935). 

It is obvious that Le Bon had a low opinion 
of the crowd-man. He saw crowds as a menace 
to established institutions. Crowds flourish as 
civilization declines. He wrote “The age we are 
about to enter will in truth be the era of 
crowds.” Masses are bent on destroying society 
as we know it and harking back to “primitive 
communism,” Was Le Bon thinking propheti- 
cally of the fascist and communist upsurges of 
the twentieth century? Yet for all his apprehen- 
sion and conservatism Le Bon was unwilling to 
condemn democracy. While he deplores the 
irrationality of electoral crowds, of juries, of 
parliaments, he thinks it well to risk their errors, 
for in their way crowds do express the basic 
aspirations of the race, and at times act with a 
rough heroism. 

Sighele’s evaluation differed. This Italian 
writer — whose basic theories were so much like 
Le Bon's that they fell into a great dispute over 
priority — said that democracy, because of its 
crowdish tendency, was an evil. As an Appendix 
to his Psychologies des Sectes (1895) he pub- 
lished an essay Contre le Parliamentairisme 
which laid one of the cornerstones for later 
fascistic ideology. Parliaments were crowdish — 
away with them! Curiously enough Sidis (1898), 
an American, drew opposite conclusions from 
similar premises. Like Le Bon and Sighele, he 
held suggestion to be a basic evil, but he felt 
that democracy could and should combat its 
ravages through proper education, so that 
eventually the decisions of the electorate and of 


parliaments might be reached on a more ra- 
tional basis. 

There is a systematic character in Sighele's 
work that merits attention. His three volumes 
form a logical series: Le crime à deux (1893), 
La foule criminelle (1891) , and Psychologie des 
sectes (1898). All are saturated with the evils 
of suggestion. Man’s subconscious nature is 
bestial and criminal. Hence suggestion and 
criminality are inevitably linked. Like Lom- 
broso and many of his countrymen, Sighele was 
captivated by the study of crime and other 
abnormal manifestations of social life. But 
what is noteworthy about this author is his or- 
derly conception of social psychology. He deals 
with the dyadic relationship — two individuals 
bound together as suggestionneur and sugges- 
tionné, also with the larger transitory collective 
of the crowd, and eventually with more or- 
ganized and enduring groups or sectes. No other 
social psychologist seems to have focused his 
work at the basic level of the dyad, and then 
tested his views in relation to progressively 
larger groupings. The idea is a good one. The 
dyadic relationship has, however, received ex- 
tensive treatment by Simmel (1950). 

Both Le Bon and Sighele maintain that crowd 
phenomena do not necessarily require physical 
proximity. While most crowds are congregate, 
they may (especially in these days of radio com- 
munication) be consociate. Even in his day Le 
Bon observed that 


Thousands of isolated individuals may acquire 
at certain moments, and under the influence of 
certain violent emotions — such, for example, as 
a great national event — the characteristics of a 
psychological crowd. (1896, p. 27) 


While it is certainly true that mass phenomena 
are ordinarily found only in congregate crowds, 
we must allow for exceptions. There are on rec- 
ord at least two cases of a consociate panic — the 
result of Orson Welles’ broadcast The War of 
the Worlds in the United States in October, 1938 
(Cantril, 1940), and in Equador in February, 
1949 (Britt, 1950, Section 31) . 

The combined influence of Charcot, Le Bon, 
Sidis, Sighele, Tarde, blew like a gale upon the 
American sociologist, E. A. Ross. His Social 
Psychology (1908) brought to a climax this 
whole vigorous movement. While Ross deals 
with many of the consequences of suggestion 
(fashions, crazes, conformity, custom), he re- 
gards crowds as the crowning mass phenome- 
non. In them the strength of “multiplied 
suggestion" is at its maximum. The individual 
is "helpless to control his position or mave- 
ments.” Ross's account of crowds is strictly in 
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the tradition we have described, and shares its 
atmosphere of melodramatic exaggeration. 

Turning our attention for a moment from 
congregate crowds to the cognate phenomenon 
of “crazes,” let us list briefly Ross’s laws (1908, 
p. 76): 


l. A craze takes time to develop to its height. 

2. The more extensive its ravages, the 
stronger the type of intellect that will 
fall prey. 

3. The greater its height the more absurd the 
propositions that will be believed. 

4. One craze is frequently followed by an- 
other. 

5. Dynamic society (e.g, American) is more 
craze-ridden than one in the ruts of cus- 
tom. 

6. The higher the craze the sharper the re- 
action against it. 

7. Ethnic or mental homogeneity is favorable 
to the craze. 


Let the reader ask himself the following ques- 
tions, What would be a good example to illus- 
trate each of these various laws? On what basis 
did Ross devise them? Are they well-framed 


scientific propositions? Why, or why not? If 
the reader will take the time and trouble to 
carry through this exercise, he will then be in a 
position to evaluate properly the influential sug- 
gestion-imitation school of thought that we have 
been discussing. 

E. D. Martin's The Behavior of Crowds (1920) 
leans on both the Le Bon-Ross tradition and on 
Freud. From the former the author derives the 
conviction that "the crowd mind is a phenome- 
non which should best be classed with dreams, 
delusions, and the various forms of automatic 
behavior" (p. 19). From the latter he derives 
his belief that crowds are agents for the release 
of repressed impulses. There are hunger crowds, 
fear crowds (panics) , aggressive crowds (mobs) , 
and other emotional masses, each one corres- 
ponding to some one long-standing repression 
that is struggling for an outlet. Under favorable 
conditions crowds offer a channel for their re- 
lease. In this sense, therefore, crowds are a safe 
way for men to "go crazy together. While 
Freud (1921) would agree in part, we have seen 
that his theory of suggestion and crowd requires 
the added attribute of "identification" between 
the tnembers and their leader. 


Tue Group MiND 


Many authors, past and present, would agree 
that the interaction of individual minds pro- 
duces a common manner of thinking, feeling, 
and willing, different from that of single minds 
in isolation, or from the mere summation of 
minds. 

Yet the phrase “a common manner of think- 
ing, feeling, and willing" is susceptible of vari- 
ous interpretations. In the period 1850-1930, 
roughly delimited, these interpretations were 
regarded as so many attempts to solve the prob- 
lem of the "group mind." Solutions were offered 
by sociologists, philosophers, and by anthro- 
pologists, as well as by psychologists. In recent 
years the label "group mind" has fallen into 
disuse, but the same issues remain in new guises. 
For instance, what are today called “social insti- 
tutions” are manifestly common manners of 
thinking, feeling, willing; so too is a “culture,” 
also a “nation.” The problem of the group and 
the individual, of the one and the many, is still 
with us. And the solutions of the past, often only 
slightly disguised, may be found in current 
theorizing. 

A modern example. A team of five contempo- 
rary authors set out to discover “the functional 
prerequisites of a society” (Aberle, Cohen, Da- 
vis, Levy, Sutton, 1950). They defined society as 


a group of human beings sharing a self-sufficient 


system of action which is capable of existing 
longer than the life span of an individual, the 
group being recruited at least in part by the 
sexual reproduction of the members. (p. 101) 


A nation or tribe is an excellent example of a 
society thus defined. Such a society, the authors 
argue, cannot endure unless the following con- 
ditions are met: 


1. There must be provision for adequate re- 
lationship to the environment and for 
sexual recruitment of new members. 

2. There must be role differentiation and 
role assignment (i.e., a division of labor to 
carry out the intricacies of the social or- 
ganization) . 

3. Means of communication must exist. 

4. The members must share a body of “cog- 
nitive orientations" (i.e, have common 
modes of thinking) . 

5. They must share certain goals (common 
modes of feeling and willing). 

6. There must be a normative regulation of 

means (e.g., laws, ceremonials, customs) . 

Affective states must be regulated (because 

"the ungoverned expression of lust and 

rage leads to the disruption of relation- 

ships and ultimately to the war of all 

against all") . 
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8. Socialization (ie, new members must 
learn the accepted modes of behavior) . 

9. There must be effective control of disrup- 
tive forces (crime, fraud, promiscuity) . 


These authors write in an ahistorical vein; they 
make no mention of the venerable problem of 
the group mind. Yet the issues they treat are 
precisely those that have led many of their 
predecessors to argue for the existence of a 
group mind. 

A close parallel may be seen between this list 
of functional prerequisites for a society and the 
criteria established for the group mind by Mc- 
Dougall (1920). (Numbers in parentheses call 
attention to the items in the above list that 
parallel McDougall’s criteria.) 


A. Continuity of existence, with the indi- 
vidual replaceable (1). 

B. In the minds of members there must be 
an idea of the group (4,5) . 

C. Interaction with other groups, especially 
rivalry and conflict, promotes a group self- 
sentiment (implied in 3,4,5) . 

D. Existence of a body of traditions and cus- 
toms in minds of members (implied in 
3,4,5,6,7,8,9) . 

E. Specialization of functions and division of 
labor (2). 


Whether we have made an exact paralleling for 
these two sets of criteria is not important. Our 
point is simply that historical formulations of 
the group-mind problem are still relevant to 
current theorizing. Earlier writers struggled to 
place the individual and his group in proper 
perspective — so do modern authors, Earlier 
writers sensed a continuity in the ideas, motives, 
habits of the group that was not dependent up- 
on the ideas, motives, and habits of any par- 
ticular individual. So do modern authors, 

The social psychologist is not particularly 
concerned with the vast amount of ethnological 
and sociological evidence that may be adduced 
in support of the social mind. The existence of 
language, religion, tradition, nationalism is 
enough to convince him that a problem exists. 
What he wants to know is just how the social 
mind is supposed to operate. "Mind" is his 
province, and if in any sense at all there is a 
social or group mind he wants to know its na- 
ture and functioning. What then are the lead- 
ing historical views from which he may make his 
selection — or upon which he must improve? 

While any classification of theories is some- 
what arbitrary, it seems helpful to distinguish 
seven types of emphasis. Occasionally a single 
author will be found to embrace more than one 
of these conceptions. 


. Analogical doctrines 

. Collective unconscious 

. Objective mind 

Folk mind 

. Collective representations 
. Cultural determinism 

. Common-segment doctrines 


MO Como 


Analogical doctrines. Throughout the ages 
men have likened society to an organism. We 
have already called attention to Plato’s psycho- 
social analogy: 


the head — cognition — the ruling class 
the breast — conation — the warrior class 
the abdomen — affection — the slave class 


The first edition of Hobbes’ Leviathan (1651) 
carried as its frontispiece a picture of a gigantic 
human figure made up of smaller men (Fig. 2) . 
This simple imagery serves to arouse in our 
minds the problem that has troubled writers 
from Plato to the present day. How are we little . 
men related to the Great Being of society? The 
first relationship that strikes us is one of simi- 
larity. Little men have thoughts, feelings, wills; 
so too in a sense does the Great Being. 

Herbert Spencer (1876) pressed the analogy 
much further. The nutritive system of society 
includes agriculture and the productive indus- 
tries; the circulatory system includes traffic, com- 
munication, the stock exchange; the regulating 
“brain” consists of the organs of government. 
With what we may suspect to be wry humor, 
Spencer ascribed to the House of Commons the 
functions of the cortex, to the House of Lords 
the functions of the medulla oblongata. Such 
analogies were vastly popular among nineteenth 
century social theorists, 

Some writers, Spencer among them, pointed 
to limitations in the analogy: society lacks the 
specific external form that every organism pos- 
sesses; its parts or "organs" are not in contact 
with one another and show greater freedom of 
mobility. Yet more convincing, these writers 
felt, are the similarities: the social organism, 
like the individual, has a continuous life history 
from birth to death; an increasing complexity 
marks its growth, along with an increasing spe- 
cialization and differentiation of parts. What is 
most important of all, one cell (person) may 
die without disrupting the continuity of the 
whole. Individuals are replaceable; one perishes 
and one is born, but the common mental con- 
tent and functioning of the group continue. 

Analogical thinking is by no means a thing 
of the past. The organismic model still attracts 
many types of writers. One often reads such 
statements as the following: "Religion is the 
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902 Power of A Connon’; 
Mars EecresrastIcar4 


and CIVIL 


By Tomas HOBBES; 
b. of MALMESBVRY: di 


Fic. 2. “By Art is created the great Leviathan called a Commonwealth or State (in Latin Civitas) which is 
but an artificial Man, though of greater stature and strength than the Natural, for whose protection and de- 
fense it was intended.” (From the Frontispiece to the First Edition of Hobbes’ Leviathan, 1651) . 
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heart in the organism of the group mind. If this 
heart ceases to beat, decay and decomposition 
set in.” Mere metaphor — but helpful in mak- 
ing a point. A more subtle example of current 
analogical doctrine is seen in Wiener's concep- 
tion of Cybernetics (1948). The initial model 
employed is modern mechanics. The “feed- 
back" machines (e.g. thermostat) have servo- 
mechanisms like those of the human body. Both 
may, for example, become “jammed” through 
overloading or through faulty clearing. Simi- 
larly in society, systems of communication can 
be overloaded and clog, resulting in “noise.” 
And just as neuroses and compulsions may ob- 
tain a monopoly on the nervous system of the 
individual, so may selfish tycoons buy up and 
dominate the communication channels of so- 
ciety. Enthusiasts for cybernetics might say that 
Wiener is really stating universal laws and thus 
forging a "unity of science"; the same laws of 
servomechanisms, communication, and noise 
apply to machines, to man, and to society. But 
others will see in cybernetics nothing much 
more than an artful revival of Spencerian 
thinking. 

Collective unconsciousness. In a sense, all psy- 
chologists hold that our conscious mental life 
is continuous with a subconscious mental life. 
A few (not many) assert that this subconscious 
mental life reaches out to embrace more than 
our own personal experiences. Each person's 
mind is like an island. While it seems to stand 
alone, there are subterranean connections via 
the ocean floor with other islands. The ocean 
floor can be conceived in a variety of ways. 
Perhaps it is a "racial unconscious," as Jung 
would say (1922) ; perhaps it is the atman (the 
universal soul) of Hindu psychology. Perhaps 
it is "objective mind" — the objektiver Geist of 
Hegel, the island formations representing so 
many individuations of one universal psychic 
system. 

William James, though certainly not a He- 
gelian, employed a similar postulate to account 
for certain religious experiences, especially the 
apparent inrush of religious conviction and 
comfort during moments of prayer or medita- 
tion. 


The further limits of our being plunge, it seems 
to me, into an altogether other dimension of ex- 
istence from the sensible and merely “under- 
standable" world. Name it the mystical region, 
or the supernatural region, whichever you choose. 


(1902, p. 506) 
To James this metaphysical assumption did not 
seem far-fetched. He wrote at a time when the 
subconscious self had become an accredited 
phenomenon. Who could tell the limits to its 


subliminal reach? He felt that the concept was 
"exactly the mediating term required" to relate 
the scientific study of the individual to the 
superindividual verities of religion. In terms of 
consciousness we are single individuals, but in 
terms of subconsciousness each of us has a wider 
self than he knows. 

The metaphysical view here represented is 
not popular in present-day social psychology. 
Yet there are those who argue that without some 
similar doctrine it is hard to explain certain 
common ideas that occur in the race. Jung's 
contention, for example, is that certain symbols 
recur, especially in dreams, regardless of specific 
contact between cultures (1922). For instance, 
an individual entirely ignorant of Egyptian lore 
may in his dreams or fantasy employ the scarab 
as a symbol of immortality. In so doing he dips 
into a racial unconscious. 

Objective mind. We need to look more closely 
at the Hegelian conception of the objektiver 
Geist, for it has seeped into much sociological 
and psychological thinking. Brett, in A History 
of Psychology (1921, Vol. 3) , argues that we owe 
to Hegelian philosophy the beginning of social 
psychology. For Hegel's system is the true ante- 
cedent of all theories which pass beyond indi- 
viduals and claim that in some sense the social 
mind has unity. 

According to Hegel's idealistic philosophy 
there is only one Mind (1807). It is absolute, 
all-embracing, divine. It works itself out in the 
course of history, Individual men are but its 
agents. Its principal focus is in the state, which 
is therefore the chief agent of divine life on 
earth, Each state has, in fact it is, a group mind. 
It has its own laws of growth and development 
(the dialectic) and while it makes much use of 
individuals, it is by no means reducible to their 
transitory mental life. Marx, as well as Hitler, 
was among the sinister spiritual children of 
Hegel. Like him, they equated personal free- 
dom with obedience to the group, morality with 
discipline, personal growth with the prosperity 
of the party, class, or state. Du bist nichts: dein 
Volk ist alles was the Nazi rallying cry. 

We can trace Hegel's psychological influence 
in several directions. As we have said, it under- 
lay Karl Marx's exaltation of social class as a 
superindividual entity, All contemporary Soviet 
psychology reflects this view (Bauer, 1952). In 
Britain, Bosanquet (1899) and Green (1900) 
were among the political philosophers who, fol- 
lowing Hegel, viewed the state as an organic 
mind transcending the component minds of 
individuals, and demanding “sober daily loy- 
alty.” It is hardly necessary to point out that 
psychological apologists for racism and national- 
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ism (eg. Jaensch, 1938) tend no less than 
Hegel to apotheosize the group mind, as repre- 
sented by the state, race, folk, or Kultur. 
Schaeffle (1878), while approaching the matter 
by way of the organic analogy, completed this 
analogy by accepting the nation as an organism 
in the fullest sense of the word, as capable of 
having its own goals and purposes, its own pleas- 
ures and pains, its own consciousness, and its 
own right to demand obedience from the in- 
dividual. 

Even without subscribing to the moral aspects 
of Hegelian philosophy, many social psycholo- 
gists in Germany have taken up his concept of 
Volksgeist. To them it seems self-evident that a 
people has a mental unity. Hence the goal of 
social psychology is the discovery of "what must 
be added to the sum of individuals to account 
for the creation of a unity out of multiplicity, 
and what aspects of individual action conform 
to the action of others so that a harmony of 
activity results" (Lukasczyk, 1951, p. 51) . In the 
following section we shall consider Vólkerpsy- 
chologie, which especially busied itself with this 
problem. 

In France, Espinas (1877) set forth a theory 
of an objective group mind, apparently uninflu- 
enced by Hegel. Individual minds, he held, fuse 
into a “collective consciousness.” They do so 
because presentations (ideas) pass rapidly from 
mind to mind by means of language and ges- 
tures; impulses (emotions) likewise are con- 
tagious. Thus it is from the normal processes of 
communication that a group mind is generated. 
While individual minds are multiple, yet there 
is engendered a social consciousness capable of 
knowing itself, and in which each individual 
participates. 

Folk psychology. In addition to Hegel, sev- 
eral writers, such as Fichte and von Humbolt, 
laid the philosophical ground for the concept of 
Volk which gradually emerged in the course of 
Germany's struggle for unification. By 1860 the 
concept was ripe for psychological exploitation. 
At that time three programs for research and 
theory were laid down, all of them aiming to 
combine ethnological study with psychological 
interpretations. Bastian (1860) introduced his 
concept of Elementargedanken — thought-forms 
so basic to the life of a folk that they would 
supply the unit of analysis needed for a study of 
any form of human society. Lazarus and Stein- 
thal (1860) inaugurated the Zeitschrift für 
Vólkerpsychologie und Sprachwissenschaft, 
which had an influential life of thirty years, 
ceasing publication in 1890. Wundt (1862) 
wrote his remarkable Beitráge, in which he laid 
down a program of writing and research for 
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himself that should (and did) occupy him for 
sixty years, reaching its culmination in the pub- 
lication of ten volumes of his Vólkerpsychologie 
(1910-1920). While these authors worked from 
the armchair (or at "the green table," as the 
Germans say) , the movement they represented 
had a lasting influence on social science. In 
Germany, Thurnwald (1925-1935), and in 
America Judd (1926) and Goldenweiser (1933), 
are at least in part representatives of the tradi- 
tion. 

Some years later England picked up the 
thread with empirically skillful fingers. The 
conviction that anthropology needs psychology 
found its first expression in authentic collabora- 
tive field work. In 1898 the Cambridge Anthro- 
pological Expedition to Torres Straits enlisted 
in its field staff three trained experimental psy- 
chologists, W. H. R. Rivers, W. McDougall, and 
C. S. Myers. Here for the first time a study with 
laboratory equipment was made of preliterate 
peoples under their ordinary conditions of life. 
The expedition was a landmark in the collabo- 
ration of social scientists. The spirit of this 
historic research is still alive and expanding. 
Today ethnologists often borrow both theory 
and methods from psychology, and psychologists 
in turn are growing more and more aware of 
the need for testing their hypotheses cross- 
culturally. In this connection one thinks of 
the work of Freud (1913), Bartlett (1923), 
Róheim (1925), Rivers (1926), Kluckhohn 
(1944) , Malinowski (1944), to mention only a 
few. It is not difficult to account for the inter- 
dependence of social psychology and ethnology 
(cultural anthropology). Taken by itself, eth- 
nology lacks a theory of the motives and capaci- 
ties of men; whereas social psychology by itself 
cannot state the cultural models to which men's 
motives and capacities tend to conform. 

Lazarus and Steinthal were a collaborative 
pair, an anthropologist and a philologist, pri- 
marily interested in a comparative characterol- 
ogy of peoples. Each people, they held, had its 
own Volksgeist. This Geist they defined in the 
opening volume of their Journal (1860) as "a 
similar consciousness of many individuals, plus 
an awareness of this similarity, arising through 
similar descent and spatial proximity." In this 
definition we may, if we wish, see merely a mild 
nominalism. The reality of the group mind is 
not transcendental, it is only a matter of simi- 
larities of individual minds plus a recognition 
of these similarities. But, unfortunately, this 
easily intelligible concept is not consistently 
maintained. In most of their discourse Lazarus 
and Steinthal speak of the Volksgeist in true 
Hegelian fashion as a metaphysical entity stand- 
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ing over against the individual mind. This type 
of double-talk was apparently encouraged by the 
influence of Herbart, whom the authors fol- 
lowed. Herbart had written an essay proposing 
that the static and dynamic conditions of the 
individual mind had parallels in the statics and 
dynamics of the state (1821). Just as ideas 
might rise above, or sink below, the threshold 
of an individual's consciousness, so too they 
may grow active or fall into disuse in society. 
Just as ideas may conflict in an individual's 
mind, so too may there be a war of ideas in 
society. Strictly speaking, Herbart's own posi- 
tion is merely analogical, but it was interpreted 
as establishing an independent self-active Geist 
of the people. 

The embodiment of the folk mind lies in 
speech, mythology, religion, folklore, art, litera- 
ture, morality, custom, and law. Hence the 
appropriate methods for its study are those 
familiar to ethnologists and to philologists. 
With this starting point, it is not surprising that 
little progress was made in developing a social 
psychology, as distinct from traditional eth- 
nology. In spite of programmatic fanfare, psy- 
chological analysis did not advance. When, 
on rare occasions, psychology was invoked it 
was invariably the idea-psychology of Herbart. 

Wilhelm Wundt (1832-1921), in spite of his 
controversies with Lazarus and Steinthal, was 
much like them in his conception of Vélkerpsy- 
chologie. The chief systematic difference be- 
tween them was that Wundt was not a 
Herbartian but a Wundtian. Having evolved 
his own fundamental concepts in general psy- 
chology, he drew upon them freely in his social 
psychology. Wundt decided early in his career 
(1862) that there were two branches of psychol- 
ogy — physiological and social (folk). He 
planned to devote the first part of his life to 
the former subject, and the second part to the 
latter. He kept to his plan, writing prodigiously 
— an average of two and two-tenths pages per 
day for a span of sixty-eight years, 53,735 pages 
in all (Boring, 1950, p. 345) . 

The most significant fact about Wundt is his 
insistence that the study of all higher mental 
processes falls in the province of Vélkerpsy- 
chologie. He did not believe that individual 
psychology, especially as pursued in the psycho- 
logical laboratory, could account for men's 
thought. Thinking is heavily conditioned by 
language, by custom, and by myth, which to him 
were the three primary problem areas of Völker- 
psychologie. The argument runs as follows: 
when men receive sensations from the outer 
world these commence to combine according to 
the laws of association, but association depends 


upon the absorption of an impression into the 
individual's apperceptive mass. Impressions en- 
ter into a creative relation with memories and 
contexts stored in this apperceptive mass. But 
this apperceptive mass is itself largely a product 
of culture. The apperceptive mass is furnished 
with linguistic habits, moral ideas, and ideologi- 
cal convictions that compose the Volksseele. 
Wundt would have felt at home in modern-day 
discussions of “social perception." 

He preferred the term Volksseele to Volks- 
geist which, he insisted, was too objective, im- 
plying an entity entirely apart from, and over 
and above, individuals. Semantically at least, 
Wundt leaned somewhat away from the sub- 
stantive group mind implied by Hegel and by 
Lazarus and Steinthal. Yet, like all group-mind 
theorists, Wundt seems to temporize. He says 
that while it is true that the folk soul presup- 
poses component individuals, yet it is more than 
the sum of their mental activities. Interaction 
brings about new qualities. A language, for ex- 
ample, is a psychic product and a determining 
force that cannot possibly be accounted for ex- 
cepting in terms of interaction. Josiah Royce 
was following Wundt when he wrote: 


The creator of the English speech is the English 
people. Hence the English people is itself some 
sort of mental unit with a mind of its own. (1913, 


p. 27) 


The case for the folk soul rests to a large 
extent upon Wundt’s concept of psychic actual- 
ity. Although in a broad way Wundt subscribed 
to psycho-physical parallelism, yet he believed 
that mental events can be regarded as lying in a 
causal chain. Men's experience is not identical 
with the activity of nerves. One state of con- 
sciousness may lead directly to another, and one 
man's conscious life may interact with an- 
other's. Thus by employing a somewhat loose 
hypothesis of the body-mind relationship 
Wundt is able, without serious difficulty, to 
claim a kind of autonomy for group mental life. 

We have said that, to Wundt, the primary 
ingredients of the Volksseele were language, 
myth, and custom. By the time he had exhausted 
these topics in his ten-volume Vólkerpsychologie 
he had, however, embraced art, religion, law, 
and social organization. But he omits altogether 
the material and technological aspects of cul- 
ture. He found no use for the skulls, tools, or 
potsherds that fascinate anthropologists. Why? 
Because as a psychologist Wundt identified the 
mind of a folk only with its immaterial culture, 
i.e., with the habits and ideas that lead to activ- 
ity, not with the products of activity. 

Summary. 'The movement we have been dis- 
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cussing is a curious blend of Hegelian idealism, 
Herbartian idea-psychology, Wundtian con- 
cepts, and library ethnology. While its central 
intent was to discover the nature and laws of 
the group mind, the actual accomplishment in 
theory was meager and confused. The move- 
ment did, however, succeed in preparing the 
ground for fruitful collaboration between psy- 
chology and social anthropology. Today this 
tide of collaboration is still rising, and has not 
yet reached its crest. An important clarification 
of concepts has occurred, as Kroeber and Kluck- 
hohn (1952) point out. What was formerly 
ascribed to the mind of the Volk is now largely 
subsumed under the current concept of culture. 
“Personality in culture” promises to be a far 
more productive concept than “the mind of the 
group” has been. 

Collective representations. Emile Durkheim, 
1858-1917, was a French contemporary of 
Wundt. In many ways the views of the two 
authors were similar. Both felt keenly “the real- 
ity of the social.” Both insisted that individual 
minds were in some sense compounded into a 
collective mind. Each had a favorite example of 
the force that this collective mind may exert 
upon the individual: Wundt chose language, 
Durkheim religion. Both tended toward intel- 
lectualism, regarding the social mind as 
furnished with ideas, representations, appercep- 
tions (rather than with emotions, passions, voli- 
tions). Of the two, however, Wundt was the 
more voluntaristic. 

In one respect, however, Durkheim was more 
extreme than Wundt. He rejected the paral- 
lelistic view that men's bodies and men's minds 
are somehow closely bound together. Wundt, 
with his concept of “psychic actuality,” had 
taken liberties with this doctrine perhaps more 
than he knew. But Durkheim repudiated it al- 
together, To him events in the nervous system 
were one thing, events in the individual mind 
something entirely different, and events in the 
social mind a third distinctive type of reality. 

His argument, as set forth in the Révue de 
métaphysique (1898) is trenchant. He first sets 
out to demolish the epiphenomenal view that 
images, memories, thoughts are merely the ex- 
crescence of brain activity. William James had 
identified memory with brain traces, and denied 
that memory was a fact of the mental order at 
all. Huxley had likened thought to the shadow 
cast by the machinery of the brain. Durkheim 
ridicules the claim. If mental states are only an 
echo of the physical, why do we have conscious 
mental states at all? And this was a question 
that perplexed William James as well, and 
drove him, ultimately and inconsistently, to ar- 
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gue as Durkheim did for a certain independence 
of the psyche. 

Durkheim was willing to admit that only sen- 
sations, the primary elements of individual 
states of mind, are referable directly to cerebral 
states. But once existing, we find that sensations 
compound themselves by the laws of association 
into images and ideas. These are no longer 
referable to brain states. Why not? For one 
thing the basic law of association is similarity 
(similar ideas form larger clusters or concepts), 
and there is no mechanism in the brain for pro- 
ducing similarity. While the point is too tech- 
nical to discuss here, Durkheim's attack on the 
identical-element theory of similarity is still 
cogent. It is only whole ideas, he argues, that 
are similar to one another; they are not decom- 
posable into elements or reducible to brain 
states. 

Having established to his satisfaction that in- 
dividual representations (ideas) are not redu- 
cible to nervous operation, he then continues 
his case for emergence. 


Just as you cannot specify the contribution of 
each brain cell to an image, so you cannot specify 
the contribution of an individual to the collective 
representation. (1898, p. 296) 


Thus having leaped the physiological-psycho- 
logical gulf, he then leaps the psychological- 
social gulf. While individuals contribute to the 
conscience collective, this final emergent is ex- 
terior to, and independent of, any specified in- 
dividual: 


When we have said that social facts are in a 
sense independent of individuals and exterior to 
individual consciousness, we have only affirmed 
in the social realm what we have established in 
the psychic realm. (p. 293) 


While higher levels emerge from lower, there 
is a new autonomy achieved at each stage. 


brain cells 3* sensations -> images —> indi- 
vidual representations — collective represen- 
tations. 


Not only is Durkheim a dualist in his separa- 
tion of mind and body, but from the point of 
view of monists, he compounds the felony by 
arguing for the independence of the social and 
the individual. There are two consciousnesses 
in each of us, he insists: one pertains to our own 
private experience, the other to all portions of 
experience attributable to the fact of human 
association. This association of human beings 
produces thoughts and manners and practices 
that are external to, and independent of, any 
single human being. Religion (1912) is a collec- 
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tive representation to which the individual in 
his course of growth is exposed. The division 
of labor (1902) is similarly a fact of the social 
mind, not of the individual. Social disruption 
(anomie), even the causes of suicide (1897) , are 
exterior to any one member of the group. 

Besides having the property of exteriority, 
collective representations have likewise the 
property of constraint. There is a compelling 
force in society that requires the individual to 
behave according to its rules. Often, of course, 
we cooperate and conform gladly. The concept 
of constraint covers both cooperation and co- 
ercion. 

Durkheim, it is true, is a sociologist, yet the 
line of reasoning by which he achieves his 
“sociological sociology" is of special interest to 
the psychologist. His argument is more pains- 
taking, more sophisticated, than that of most 
writers who have postulated a group mind. 
There are many secondary sources that give 
fuller treatment of Durkheim's contributions, 
among them Gehlke (1915), Goldenweiser 
(1933), Parsons (1937). Durkheim's influence 
upon anthropological theory may be seen strik- 
ingly in Lévy-Bruhl's work, La mentalité primi- 
tive (1922). 

While the concept collective representations 
is peculiarly Durkheim's, other authors — psy- 
chologists among them — have shown its in- 
fluence. Take, for example, Fouillée's key 
concept of the idées-forces (1908). While this 
writer believed that Durkheim had pushed the 
criteria of exteriority and constraint too far, yet 
he accepted the basic fact that much of each 
person's mental life is of social origin. He pro- 
posed that the so-called constraint of social 
ideas lies in the psychologically anchored mech- 
anism of ideo-motor action. Fouillée saw no 
need to break with psychology as drastically as 
Durkheim had done. 

The latter's influence may be traced also in 
Piaget (1932), who regards the moral realism 
of the child as the result of the child's exposure 
to, and unquestioning acceptance of, prevailing 
conceptions of right and wrong, good and bad. 
Similar is Bartlett. (1932) , who regards memory 
as heavily conditioned by the thought forms 
current in a social group. While Piaget and 
Bartlett would not subscribe to Durkheim's view 
as a whole, the problems they treat and their 
manner of approach show a Durkheimian re- 
spect for the reality of the social. 

Cultural determinism. Strictly speaking, cul- 
tural determinism is not a theory of the group 
mind. It is, however, a doctrine that makes 
much of the exteriority and constraint of cul- 
tural factors. Extreme advocates seem at times 


to be saying that psychology can explain noth- 
ing of importance concerning social behavior. 
They quote with favor Durkheim's adage, 
"Every time a social phenomenon is directly 
explained by a psychological phenomenon, we 
may be sure that the explanation is false." The 
following passage, written by White (1947), 
presents the argument: 


To the culturologist the reasoning that says that 
one people drinks milk because they like it, an- 
other do not because they loathe it, is imbecilic. 
-.. It explains nothing at all. Why does one 
people like, another loathe, milk? This is what 
we want to know. And the psychologist cannot 
give us the answer. Nor can he tell us why a 
people does or does not avoid mothers-in-law, 
practice monogamy, inhumation, the couvade, or 
circumcision; use chopsticks, forks, the pentatonic 
scale, hats, or microscopes; form plurals by affixa- 
tion — or any of the other thousands of customs 
known to ethnography. 


White favors a cultural explanation even for the 
existence of nonconformists: 


Even a revolutionist is not an individualist: he 
is a human organism that is held and wielded by 
certain cultural elements that are moving in the 
direction of profound change. 


This manner of thinking about social be- 
havior is at a high level of abstraction. Only 
gross changes, broad trends, and long-range 
effects are considered. Culturologists happen to 
be interested in wide horizons; to them the in- 
dividual is unimportant. They do not worry 
about the fact that all cultures have plenty of 
men who deviate from the picture they create of 
a "modal personality." Even in a milk-drinking 
culture, plenty of individuals loathe milk. Nor 
do culturologists worry about the extreme selec- 
tivity individuals show in responding to this or 
that feature of their own culture. Nor are they 
concerned with the process of communication 
and learning by which children within a culture 
acquire its ways. Cultural determinism says in 
effect that children in English-speaking coun- 
tries will learn to speak English, but it does not 
explain why they learn to speak at all, or how 
they learn, or why among them there are almost 
infinite variations in habits of speech. 

The quarrel, of course, is unprofitable. Let 
us admit that cultural models prescribe what 
types of conformity will (probably) be learned. 
Also that over long periods of time language 
and other cultural forms seem to go through 
cycles of change independently of what any par- 
ticular individual may do or think. Yet within 
this cultural matrix certain psychological factors 
are presupposed: motivation, learning, the 
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processes of perception, concept formation, and 
the organization of attitudes and sentiments. 
Only when the cultural and the psychological 
factors are treated together is a full explanation 
of social behavior likely to be achieved. 

Common and reciprocal segments of behavior. 
We have pointed out that the term “group 
mind” disappeared from the scene after the 
1920’s, even though the problem itself has per- 
sisted. Boredom had much to do with dropping 
the label. Everyone seemed wearied by the cli- 
max of the controversy in the decade of the 
twenties. 

Certain nominalists attacked the “group fal- 
lacy” with vigor. Thus F. H. Allport (1924) 
wrote: 

Nationality, Free-masonry, Catholicism, and the 
like are not group minds . . . they are sets of 
ideals, thoughts, and habits repeated in each in- 
dividual mind and existing only in those minds. 
. .. All theories which partake of the group 
fallacy have the unfortunate consequence of di- 
verting attention from the true locus of cause and 
effect, namely, the behavior mechanism of the 
individual. (p. 9) 

This same author expanded and fortified his 
argument in Institutional Behavior (1933) . Op- 
posing views, based largely on the argument of 
cultural determinism, were stated by Wallis 
(1925) and Judd (1926). A judicious weighing 
of evidence, pro and con, came from the pens 
of the philosophers Perry (1922) and Hocking 
(1926) . 

Best known among the disputants is McDou- 
gall (1920). The first edition of his Group 
Mind seemed to affirm, with qualifications, his 


belief in its existence; the second edition (1928) 
bewailed the reader's misunderstanding of his 
position. Earlier in this section we stated Mc- 
Dougall's criteria for the existence of a group 
mind. He found these fulfilled most clearly in 
the case of nations, for nations have a perma- 
nent and highly organized character. McDougall 
does not regard his criteria as leading to the 
postulation of a social consciousness, for “there 
is no consciousness excepting in individuals"; 
yet, he insists, “we may still speak of collective 
minds, for we have defined mind as an organized 
system of interacting mental or psychical forces" 
(p. 66) . 

In spite of their disagreements, there is a com- 
mon thread in the thinking of McDougall and 
of F. H. Allport. What is called the group mind, 
they say, is essentially the abstraction of certain 
attitudes and beliefs from the personal mental 
life of individuals. We may, if we wish, regard 
all the common (comparable) attitudes of Eng- 
lishmen as comprising the group mind of the 
English nation. McDougall, in the Preface to 
the 1928 edition, explicitly states that he views 
the group mind f 

as consisting in the similarities of structure of the 

individual minds which render them capable of 

responding in similar fashion to the common fea- 
tures of the environment, social and physical. 

(p. ix) 


Social institutions, thus regarded, become ab- 
stractions from the behavior and consciousness 
of individuals. People are not parts of institu- 
tions, but institutions are parts of people. Figure 
3 represents the situation diagrammatically. 


Group mind 


or 
Institution 
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Individual Individual 
A B 


Individual 
c 


Individual Individual 
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Fic. 3. Common and reciprocal segment theory. 


It should be noted in Fig. 3 that common 
segments (i.e., similar habits and attitudes) are 
clearly represented, but that the reciprocal re- 


lation of these segments — also essential to a 

group mind or institution — is not well repre- 

sented. As it stands, the diagram overstresses 
H 
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conformity, which is indeed an important aspect 
of institutionalism, but understresses the reci- 
procity of roles, which is an equally important 
aspect. Both McDougall and F. H. Allport 
(1940) admit that both aspects are vital. The 
latter author considers conformity and reci- 
procity to be the essential features of operation- 
ally denotable "event structures" that have in 
the past been called group minds or institutions. 

Such a view preserves the strictly individual 
locus of mind, yet enables us for certain pur- 
poses to examine segments of the mental life of 
individuals in relation to comparable, or recip- 
rocal, segments of the mental life of others. This 
method permits culturologists to affirm the im- 
portance of custom, viewed as common con- 
formity. It allows the sociologist to affirm the 
independence (in a sense) of institutions, the 
activity of social systems, and the viability of 
role relations. 

Final word. After McDougall had wrestled 
with the problem for a time, he bewailed his 
"tactical error" in using the phrase "group 
mind." Probably it is regrettable that the con- 
cept was ever used by anyone. We see now that 
it has unnecessarily imposed metaphysical 
blocks in the path of constructive conceptuali- 
zation. It is important, however, for the student 
of social science to understand this historical 
background. Corporate groups have been mul- 
tiplying at a spectacular rate throughout mod- 
ern society. An individual is a member of many 
publics, of many institutions, of many social 
systems. Each of these memberships poses afresh 
the ancient question. The problem of the group 
mind will not be downed. 

But at the present time theories are being re- 
centered. Writers realize that the integrity of 
the personality system can be preserved even 
while we affirm the existence and action of social 
structures that are transindividual. It was Sapir 
who advised all social and psychological scien- 


tists to form the habit of looking at their data 
both from the concrete individual point of view 
and from the abstract social point of view. It 
enriches research and theory to do so. As we 
have said, the approach now growing in favor 
combines the idea of “common segment" with 
that of "role." Parsons and Shils (1951) write: 


Personality as a system has a fundamental and 
stable point of reference, the acting organism. 
It is organized around the one organism and its 
life processes. But ego and alter in interaction 
with each other also constitute a system. This is 
a system of a new order, which, however inti- 
mately dependent on them, does not simply con- 
sist of the personalities of the two members. (p. 


23) 


Having thus restated the problem in terms of 
“systems,” the authors proceed to say that “the 
most significant unit of social structures is not 
the person but the role.” Both the actor and 
those with whom he interacts possess certain 
expectations due to their places in the social 
structure. The final essential step in the argu- 
ment, obviously akin to the common segment 
theories, is stated as follows: 


The abstraction of an actor's role from the total 
system of his personality makes it possible to 
analyze the articulation of personality with the 
organization of social systems. The structure of a 
social system and the functional imperatives for 
its operation and survival or orderly change as a 
system are moreover different from those of per- 
sonality. (loc. cit.) 


Thus what was formerly called the “group 
mind” becomes a matter of abstracted role seg- 
ments, both common and reciprocal. These 
segments or aspects of personality provide the 
raw material for the social system. While this 
mode of conceptualization is still incomplete, 
it provides a promising new attack upon the 
venerable riddle of the group mind. 


Units OF ANALYSIS 


A firm science employs definitely agreed upon 
units of analysis: e.g., the quantum of physics, 
the element of chemistry, or the nerve impulse 
in physiology. By contrast, psychology has suc- 
cessively tried out (and in part discarded) a 
wide variety of elusive units: €g., sensation, 
image, idea, reflex, S-R bond, Social psychology 
has had a similar experience. It would like to 
have some relatively stable unit (or units) of 
behavior out of which the theorist can com- 
pound a systematic view of social interaction. 
But, as in the case of general psychology, the 
search has met with frustrations, Some social 


psychologists, especially Gestaltists and field 
theorists, say that the search for elements is 
doomed to failure because the individual does 
not react to stimuli in fixed ways; he alters his 
adjustment to accord with the total pressure of 
internal and external conditions (e.g, Asch, 
1952) . 

Still, historically, the postulation of units has 
played an important part in the development of 
social psychology, and it is by no means clear 
that they can be, or should be, dispensed with. 
Most important among the assumed units are 
instinct, habit, attitude, sentiment. From this 
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list we shall leave out idea, which, as we have 
already made clear, declined in popularity at 
the end of the nineteenth century. 

Instinct. While faculty psychologists since 
the time of Descartes affirmed and classified the 
"passions" of men, their efforts led to no sys- 
tematic discussion of the nature of these facul- 
ties or of their social consequences. Darwin's 
doctrine of natural selection changed the situa- 
tion. It gave grounds for believing that all be- 
havior — animal and human, individual and 
social — lay in the evolutionary array of in- 
stincts that make for survival of the species. 
Following Darwin's lead, naturalists immedi- 
ately began to prepare lists of animal instincts. 
It was not long before writers of psychological 
textbooks (e.g, James, 1890) set forth with 
more abandon than reflection long schedules of 
the presumably basic motivational units under- 
lying human behavior (cf. Bernard, 1926). But 
it remained for McDougall to clarify the impli- 
cations of Darwinism and build a completely 
coherent system of social psychology upon the 
instinct hypothesis. His definition of instinct, 
if fully understood word by word, adumbrates 
his entire system of social psychology. 


We may, then, define an instinct as an in- 
herited or innate psycho-physical disposition 
which determines its possessor to perceive, and to 
pay attention to, objects of a certain class, to 
experience an emotional excitement of a particu- 
lar quality upon perceiving such an object, and 
to act in regard to it in a particular manner, or, 
at least, to experience an impulse to such action. 
(1908, p. 30) 


Under the broad tent of this definition Mc- 
Dougall is able to order many, if not most, social 
phenomena, The process of suggestion is a con- 
sequence of the instinct of self-abasement. 
Primitive passive sympathy, as we have seen, is 
a matter of induced instinctive seizure when one 
person sees another behaving in an instinctive 
manner. Laughter is an antidote to such sympa- 
thetic induction. Emotion is lockstitched to 
instinct. Instincts may form compounds: thus 
a person's religious orientation may be a blend 
of curiosity, self-abasement, flight (fear), and 
the tender emotion which accompanies the 
parental instinct. McDougall believed that the 
whole story of social behavior could be based 
on his hormic (urge-like) view of instincts. He 
readily admitted that the force of learning and 
integration creates complex units (sentiments) 
that are often of more immediate concern than 
instincts to social psychologists. But sentiments, 
he insisted, are not the ultimate units of be- 
havior. Only instincts are ultimate. 

The decade following the appearance of Mc- 


Dougall's Social Psychology was almost wholly 
dominated by instinct theory. We have already 
called attention to G. Wallas’ Human Nature 
and Politics (1908) , Trotter's Instincts of the 
Herd in Peace and War (1916). We may add 
Thorndike’s The Original Nature of Man 
(1913), Woodworth's Dynamic Psychology 
(1918) , and Dewey's presidential address to the 
American Psychological Association, in which 
he declared that the science of social psychology 
must be founded on a doctrine of instincts 
(1917). 

The same decade was marked by the world- 
wide spread of Freud’s theories. While Freud’s 
instinctivism differs in many ways from Mc- 
Dougall’s, it too helped fix the attention of 
social psychologists upon the primacy of im- 
pulses in social conduct. During the decade 
1910-1920, however, Freud’s influence on social 
psychology was not as marked as McDougall’s, 
probably because the instincts Freud postulated 
were far less definite and manipulable than 
McDougall’s. By 1932 McDougall had given 
up the term “instinct” (in response to violent 
attack by behaviorists) , but maintained all es- 
sential features of his doctrine under the label 
“propensities.” Between 1908 and 1932 he had 
considerably expanded his list of native propen- 
sities. Its final form includes eighteen items: 

Food-seeking; disgust; sex; fear; curiosity; pro- 
tective or parental propensity; gregariousness; 
self-assertion; submission; anger; appeal; con- 
structive propensity; acquisitive propensity; 
laughter; comfort; rest or sleep; migratory pro- 
pensity; a cluster of specific bodily needs — 
coughing, sneezing, breathing, elimination. (Mc- 

Dougall, 1932, Chap. 7) 

In 1919 Knight Dunlap hurled the first anti- 
instinct bombshell. From that time onward the 
whole theory came under attack. Objections to 
it are many. For one thing, there is no factual 
proof for the existence of instincts in Mc- 
Dougall’s sense of the term. For another, the 
doctrine brings with it a curious sterility in 
theorizing. It implies that what is given by 
heredity is final. Having discovered “first 
causes” and “prime movers” there is no incen- 
tive to look further and discover additional basic 
principles of human behavior. Furthermore, 
instinct doctrines inevitably regard certain mo- 
tives as primary, and all others as secondary, 
derived, and less important. It is questionable 
whether such a distinction can be sustained. 
McDougall himself felt compelled to supple- 
ment instincts by sentiments, but do such com- 
plex forms of moral and socialized behavior 
merely borrow their energy from a handful of 
original and unchanging instincts? Finally, 
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American social scientists generally have a dis- 
taste for nativism, the view that human nature 
is fixed and at bottom unalterable. The Ameri- 
can intellectual climate favors environmental- 
ism. 

But it must not be thought that instincts 
were discarded altogether. How could they be? 
Elementary motives and reflex adjustments are 
manifestly present in infancy prior to training 
and independent of experience. Even parsi- 
monious behaviorists had to admit innate 
pushes of some sort, variously called “tissue 
changes,” “prepotent reflexes,” or “primary 
drives" But these initial motives were not 
viewed, as in McDougall's case, as enduring and 
changeless units of human nature; they became 
incorporated through learning into a chain of 
habits which constituted the "true" unit. 

Besides behaviorism, which represents a dis- 
tinct countertrend to instinctivism, there is a 
somewhat colorless middle position that many 
authors today prefer to hold. They, like Mc- 
Dougall, feel the necessity for dynamic units, 
and variously affirm the existence of desires, 
wishes, needs, vectors, or simply, motives. Such 
units are employed as building blocks in various 
contemporary systems, but they differ from 
earlier instinctive units in that they try to side- 
step the thorny question of innateness. Modern 
writers seem not to care whether a motivational 

unit is inborn or learned. Likewise, they tend 
to hold their postulated units in a tentative, 
heuristic way — as mere "constructs" — and not 
to regard them as eternally fixed in the person 
or in the species. 

Habit. Although James affirmed the existence 
and importance of instinct, he gave chief weight 
to the habit unit. Instincts as such never en- 
dure; strictly speaking, they manifest themselves 
only once in a lifetime, thereafter learning im- 
mediately sets in. This doctrine of the transi- 
toriness of instinct prepared the way for James's 
famous apotheosis of habit. It is more than sec- 
ond nature, he insisted, it is “ten times nature." 


Habit is thus the enormous flywheel of society, 
its most precious conservative agent. It alone is 
what keeps us all within the bounds of ordinance, 
and saves the children of fortune from the envious 
uprisings of the poor....]t keeps different 
social strata from mixing. Already at the age of 
twenty-five you see profesional mannerisms 
settling down on the young commercial traveller, 
on the young doctor, on the young minister, on 
the young counsellor-atlaw. . . . It is well for the 
world that in most of us, by the age of thirty, the 
character has set like plaster, and will never soften 
again. (1890, I, p. 121) 


'The social and ethical consequences of this 
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view of habit are, according to James, “nu- 
merous and momentous.” Since in his view we 
acquire habits primarily through repetition of 
actions, it behooves us to pay alert attention to 
the formation of right habits. In building good 
habits never let an exception occur. Whether 
they are good or bad, all habits are a ball-and- 
chain. 

It was partly due to the eloquence of James 
that the habit unit came to be taken for granted 
during the following three or four decades. To 
be sure, it became recast in terms of “condi- 
tioned response” in the hands of behaviorists, 
but this unit is only another version of the habit 
doctrine: Holt (1915), Watson (1919), F. H. 
Allport (1924) . Somewhat later, in conjunction 
with what is sometimes known as “Yale learning 
theory,” the concepts "response" and "habit | 
family hierarchy" became preferred (e.g., Miller 
and Dollard, 1941). But while the habit theory 
has grown more sophisticated than that depicted - 
by James, the unit is of the same genre. 

A somewhat different conception of habit was 
employed systematically by Dewey “as the key 
to social psychology" (1922), only a few years 
after he forsook his advocacy of the instinct unit. 
In two important respects Dewey's view differs — 
from most others. In the first place, the unit is - 
more flexible; it is not merely a frozen connec: — 
tion in motor pathways, it includes general 
attitudes, outlooks, purposes, interests. A man 
may commit murder only once in his lifetime. 
The act is, however, due to habit — to a habit | 
of hatred. Furthermore, habit has a motiva- | 
tional character: 


We may think of habits as means, waiting, like | 
tools in a box, to be used by conscious resolve. - 
But they are something more than that. They are 
active means, means that project themselves, ener- 
getic and dominating ways of acting. (p. 25) 


In the second place, Dewey tried to include 
situationism within his conception of habit. The 
neural mechanism alone does not anchor the 
habit. The environment helps to sustain it 
If the environment changes the habit will 
change. Every manifestation of a habit is al- 
tered by other people who are concerned in 
act. "Conduct is always shared; this is the 
difference between it and the physiological pro 
ess" (p. 17). 
Dewey was attempting to claim too many 2 
tributes for the habit unit (fixity and chang 
neural impulse and situation, motivation 
instrumentality). As a result, the unit seemed 
vague. Later writers did not follow his lead oF 
attempt to refine the concept for systematic User 
The reason, no doubt, is that the concept attt- 
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tude was already established. For all intents and 
purposes it filled the need that Dewey felt. 

Attitudes. The following brief historical sur- 
vey of the attitude concept is abbreviated from 
the author's earlier and more extended chapter 
on "Attitudes" published in The Handbook of 
Social Psychology, edited by C. C. Murchison 
(1935) . 

This concept is probably the most distinctive 
and indispensable concept in contemporary 
American social psychology. No other term ap- 
pears more frequently in experimental and 
theoretical literature. Its popularity is not diffi- 
cult to explain. It has come into favor, first of 
all, because it is not the property of any one 
psychological school of thought, and therefore 
serves admirably the purposes of eclectic writers. 
Furthermore, it is a concept which escapes the 
controversy concerning the relative influence of 
heredity and environment. Since an attitude 
may combine both instinct and habit in any 
proportion, it avoids the extreme commitments 
of both the instinct theory and environmental- 
ism. The term likewise is elastic enough to 
apply either to the dispositions of single, iso- 
lated individuals or to broad patterns of culture 
(common attitudes). Psychologists and sociolo- 
gists therefore find in it a meeting point for 
discussion and research. This useful, one might 
almost say peaceful, concept has been so widely 
adopted that it has virtually established itself 
as the keystone in the edifice of American social 
psychology. In fact, several writers, starting 
with Thomas and Znaniecki (1918), have de- 
fined social psychology as "the scientific study 
of attitudes." 

Like most abstract terms in the English lan- 
guage, attitude has more than one meaning. 
Derived from the Latin aptus, it has on the one 
hand the significance of "fitness" or "adapted- 
ness," connoting, as does its by-form aptitude, 
a subjective or mental state of preparation for 
action. Through its use in the field of art, how- 
ever, the term came to have a quite independent 
meaning; it referred to the outward or visible 
posture (the bodily position) of a figure in 
Statuary or painting. The first meaning is 
clearly preserved in the phrase "mental atti- 
tudes," and the second meaning in "motor atti- 
tudes.” Since mentalistic psychology historically 
precedes response psychology, it is only natural 
to find that mental attitudes are given recogni- 
tion earlier than motor attitudes, One of the 
earliest psychologists to employ the term was 
Herbert Spencer. In his First Principles (1862), 
he wrote: 

Arriving at correct judgements on disputed 
questions, much depends on the attitude of mind 


we preserve while listening to, or taking part, in 
the controversy; and for the preservation of a 
right attitude it is needful that we should learn 
how true, and yet how untrue, are average human 
beliefs. (Vol I, 1, i) 


Somewhat later, when psychologists were for- 
saking their exclusively mentalistic point of 
view, the concept of motor attitudes became 
popular. In 1888, for example, N. Lange devel- 
oped a motor theory wherein the process of a 
perception was considered to be in large part a 
consequence of muscular preparation or "set." 
At about the same time Münsterberg (1889) de- 
veloped his action theory of attention, and 
Féré (1890) maintained that a balanced con- 
dition of tension in the muscles was a determin- 
ing condition of selective consciousness. 

In recent years it is uncommon to find ex- 
plicit labeling of an attitude as either "mental" 
or "motor." Such a practice smacks of body- 
mind dualism, and is therefore distasteful to 
contemporary psychologists. In nearly all cases 
today the term appears without a qualifying 
adjective, and implicitly retains both its original 
meanings: a mental aptness and a motor set. 
Attitude connotes a neuropsychic state of readi- 
ness for mental and physical activity. 

Perhaps the first explicit recognition of atti- 
tudes within the domain of laboratory psychol- 
ogy was in connection with a study of reaction 
time. In 1888 L. Lange discovered that a sub- 
ject who was consciously prepared to press a 
telegraph key immediately upon receiving a 
signal reacted more quickly than did one whose 
attention was directed mainly to the incoming 
stimulus, and whose consciousness was not there- 
fore directed primarily upon the expected re- 
action. After Lange’s work, the task attitude, or 
Aufgabe, as it came to be called, was discovered 
to play a decisive part in nearly all psychological 
experiments. Not only in the reaction experi- 
ment, but in investigations of perception, recall, 
judgment, thought, and volition, the central 
importance of the subjects’ preparedness became 
universally recognized. In Germany, where 
most of the early experimental work was done, 
there arose a swarm of technical expressions to 
designate the varieties of mental and motor 
"sets" that influence the subjects’ trains of 
thought or behavior during the experiment. 
In addition to the Aufgabe, there was the Ab- 
sicht (conscious purpose), the Zielvorstellung 
(or idea of the goal) , the Bezugsvorstellung (idea 
of the relation between the self and the object 
to which the self is responding) , the Richtungs- 
vorstellung (or idea of direction) , the determin- 
dierende Tendenz (any disposition that brings 
in its train the spontaneous appearance of a 
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determined idea), the Einstellung, a more gen- 
eral term (roughly equivalent to "set"), the 
Haltung (with a more behavioral connotation) , 
and the Bewusstseinslage (the "posture or lay 
of consciousness"). It was perhaps the lack of 
a general term equivalent to "attitude" that led 
the German experimentalists to discover so 
many types and forms. The lack may also ex- 
plain why no systematic social psychology was 
written in Germany based upon a unified con- 
cept of "attitude." 

'Then came the lively controversy over the 
place of attitudes in consciousness. The Würz- 
burg school was agreed that attitudes were nei- 
ther sensation, nor imagery, nor affection, nor 
any combination of these states. Time and 
again attitudes were studied by the method of 
introspection, always with meager results. Often 
an attitude seemed to have no representation in 
consciousness other than a vague sense of need, 
or some indefinite and unanalyzable feeling of 
doubt, assent, conviction, effort, or familiarity 
(Titchener, 1909) . 

As a result of the Würzburg work all psy- 
chologists came to accept attitudes, but not all 
believed them to be impalpable and irreducible 
mental elements. In general, the followers of 
Wundt believed that attitudes could be ac- 
counted for adequately as feelings, particularly 
as some blend of striving and excitement. Clarke 
(1911), a pupil of Titchener, found that atti- 
tudes in large part are represented in conscious- 
ness through imagery, sensation, and affection, 
and that where no such states are reported there 
is presumably merely a decay or abbreviation of 
these same constituents. 

However they might disagree upon the nature 
of attitudes as they appear in consciousness, all 
investigators, even the most orthodox, came to 
admit attitudes as an indispensable part of their 
psychological armamentarium. Titchener is a 
case in point. His Outline of Psychology in 
1899 contained no reference to attitude; ten 
years later, in his Textbook of Psychology, sev- 
eral pages are given to the subject, and its sys- 
tematic importance is fully recognized. 

Many authors have reduced the phenomena 
of perception, judgment, memory, learning, and 
thought largely to the operation of attitudes 
(cf., €g» Ach, 1905; Bartlett, 1932). Without 
guiding attitudes the individual is confused and 
baffled. Some kind of preparation is essential 
before he can make a satisfactory observation, 
pass suitable judgment, or make any but tlie 
most primitive reflex type of response. Attitudes 
determine for each individual what he will see 
and hear, what he will think and what he will 
do. To borrow a phrase from William James, 


they "engender meaning upon the world"; they 
draw lines about, and segregate, an otherwise 
chaotic environment; they are our methods for 
finding our way about in an ambiguous uni- 
verse. It is especially when the stimulus is not 
of great intensity nor closely bound with some 
reflex or automatic response that attitudes play 
a decisive role in the determination of meaning 
and of behavior. 

The meagerness with which attitudes are rep- 
resented in consciousness resulted in a tendency 
to regard them as manifestations of brain activ- 
ity or of the unconscious mind. The persistence 
of attitudes that are totally unconscious was 
demonstrated by Müller and Pilzecker (1900), 
who called the phenomenon "perseveration." 
The tendency of the subject to slip into some 
frame of mind peculiar to himself led Koffka 
(1912) to postulate “latent attitudes." Wash- 
burn (1916) characterized attitudes as "static 
movement systems" within the organs of the 
body and the brain. Other writers, still more 
physiologically inclined, subsumed attitudes 
under neurological rubrics: traces, neurograms, 
incitograms, brain patterns, and the like. The 
contribution of the Würzburger and of all other 
experimental psychologists was in effect the 
demonstration that the concept of attitude is in- 
dispensable. 

But it was the influence of Freud that en- 
dowed attitudes with vitality, equating them 
with longing, hatred and love, with passion and 
prejudice, in short, with the onrushing stream 
of unconscious life. Without the painstaking la- 
bors of the experimentalists, attitudes would 
not today be an established concept in the field 
of psychology; but also without the influence of 
psychoanalytic theory they would certainly have 
remained relatively lifeless, and would not have 
been of much assistance to social psychology. 
For the explanation of prejudice, loyalty, patri- 
otism, crowd behavior, control by propaganda, 
no anemic conception of attitudes will suffice. 

As we have said, the instinct hypothesis did 
not satisfy social scientists for long, for the very 
nature of their work forced them to recognize 
the importance of custom and environment in 
shaping social behavior. The instinct hypothe- 
sis has precisely the contrary emphasis. What 
social scientists required was a new psychologi- 
cal concept that would escape on the one hand 
from the hollow impersonality of "custom" and 
“social force," and on the other from nativism. 
They gradually adopted the concept of attitude. 

The credit for instituting the concept as a 
permanent and central feature in sociological 
writing must be assigned to Thomas and Znani- 
ecki (1918), who gave it systematic priority in 
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their monumental study of Polish peasants. Be- 
fore this time the term had made only sporadic 
appearances in sociological literature, but im- 
mediately afterward it was adopted with en- 
thusiasm by scores of writers. 

According to Thomas and Znaniecki, the 
study of attitudes is par excellence the field of 
social psychology. Attitudes are individual men- 
tal processes which determine both the actual 
and potential responses of each person in the 
social world. Since an attitude is always directed 
toward some object, it may be defined as a “state 
of mind of the individual toward a value.” 
Values are usually social in nature, that is to say, 
they are objects of common regard on the part 
of socialized men. Love of money, desire for 
fame, hatred of foreigners, respect for a scien- 
tific doctrine are typical attitudes. It follows 
that money, fame, foreigners, and a scientific 
theory are all values. A social value is defined 
as "any datum having an empirical content ac- 
cessible to the members of some social group 
and a meaning with regard to which it is or may 
be an object of activity." There are, to be sure, 
numerous attitudes corresponding to every so- 
cial value; there are, for example, many atti- 
tudes regarding the church or the state. There 
are also numerous possible values for any single 
attitude. The iconoclast may direct his attacks 
quite at random upon all established social 
values, or the Philistine may accept them all 
uncriticalll. Hence in the social world, as 
studied by the sociologist, both values and atti- 
tudes must have a place. 

Following closely in the same vein of thought, 
Faris (1925) proposed additional refinements. 


He would distinguish between conscious and ` 


unconscious attitudes, between mental and mo- 
tor attitudes, between individual and group 
attitudes, and between latent and kinetic atti- 
tudes. Park (see Young, 1931) , who is likewise 
in essential agreement with this school of 
thought, suggests four criteria for an attitude: 
(1) it must have definite orientation in the 
world of objects (or values) , and in this respect 
differ from simple and conditioned reflexes; 
(2) it must not be an altogether automatic and 
routine type of conduct, but must display some 
tension even when latent; (3) it varies in inten- 
sity, sometimes being regnant, sometimes rela- 
tively ineffective; (4) it is rooted in experience, 
and therefore is not simply a social instinct. 
The following are typical definitions of atti- 
tude that emerged: 
Attitude = the specific mental disposition to- 


ward an incoming (or arising) experience, where- 
by that experience is modified; or, a condition of 


readiness for a certain type of activity. (Warren. 
Dictionary of Psychology, 1934) 

An attitude is a mental disposition of the hu- 
man individual to act for or against a definite 
object. (Droba, 1933) 

An attitude is a mental and neural state of 
readiness, organized through experience, exerting 
a directive or dynamic influence upon the indi- 


viduals response to all objects and situations 
with which it is related. (G. W. Allport, 1935) 


As we have said, the attitude unit has been 
the primary building stone in the edifice of 
social psychology. It has, of course, had many 
critics. Especially in recent years, learning 
theorists, field theorists, phenomenologists have 
attempted to dislodge it. But it is questionable 
whether their combined efforts can do more 
than refine the concept for future use. After 
all, social behavior reflects so much organiza- 
tion, recurrent and habitual expectancy, that 
the doctrine of attitude (or some close equiva- 
lent) is necessary. Without some such concept, 
social psychologists could not work in the fields 
of public opinion, national character, or institu- 
tional behavior — to mention only a few areas; 
nor could they characterize the mental organiza- 
tion of social man. The term itself may not be 
indispensable, but what it stands for is. Mc- 
Dougall has argued that “tendency” is the basic 
postulate of all psychology (1937). While he 
did not like the term “attitude,” he was affirm- 
ing the need for some corresponding unit of 
analysis. His preference was for "sentiment." 

Sentiment. According to McDougall: 


The theory of sentiments is the theory of the 
progressive organization of the propensities (in- 
stincts) in systems which become the main sources 
of all our activities; systems which give consis- 
tency, continuity and order to our life of striving 
and emotion; systems which in turn become or- 
ganized in larger systems, and which, when har- 
moniously organized in one comprehensive system, 
constitute what we properly call character. (1932, 
p. 211) 


Thus conceived sentiment differs from atti- 
tude in four respects. (1) A sentiment presup- 
poses underlying propensities, whereas attitude 
is a disposition within the organism considered 
without regard to its origins or source of en- 
ergy. (2) An attitude may be either specific or 
diffuse in its reference, whereas a sentiment is 
centered around some definite object. One 
could speak, for example, of an “antisocial atti- 
tufle," but scarcely of an antisocial sentiment. 
(3) Sentiments are conceived as more lasting 
and hierarchical than attitudes need be. The 
transitory Aufgabe is an attitude as truly as is 
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an enduring philosophy of life, but it is not a 
sentiment. (4) Sentiment is conscious and benign 
as contrasted to complex, which is a morbid and 
repressed sentiment. Attitude could cover both 
wholesomely conscious and morbidly repressed 
dispositions. 

The concept of sentiment received its first 
systematic use in the hands of Shand (1896) 
whose theory that human character is composed 
of sentiments was adopted and elaborated by 
McDougall. Not only does sentiment seem to 
these authors to provide a proper unit for 
characterizing man’s social attachments, but also 
to provide a proper unit for the theory of per- 
sonality. To McDougall the unity of personality 
is guaranteed by the possession of the crowning 
self-regarding sentiment at the top of the hier- 
archical structure. 

Certain recent authors likewise have expressed 
strong preference for the concept of sentiment 
and strong dislike for the attitude unit. To 
Murray and Morgan (1945) , for example, “atti- 


Tue BEGINNINGS OF 

Experimentation. The year 1879 saw the for- 
mal establishment of the first laboratory of psy- 
chology. Credit goes to Wundt at Leipzig, al- 
though both James at Harvard and Wundt had 
demonstrational (not research) laboratories as 
early as 1875. But it took several years longer 
for social variables to enter the laboratory. The 
first experimental problem — indeed the only 
problem for the first three decades of experi- 
mental research — was formulated as follows: 
What change in an individual's normal solitary 
performance occurs when other people are pres- 
ent? 

The first laboratory answer to'this question 
came from Triplett (1897). While examining 
the official records of bicycle races Triplett 
noted that a rider’s maximum speed was ap- 
proximately 20 percent greater when he was 
paced by a visible multicycle. Desiring to learn 
more about the matter, he set up an experiment 
with children in the age range of ten to twelve, 
giving them the task of winding fish reels. Alter- 
nating situations alone and together, he found 
that when working together 20 of his 40 subjects 
excelled their own solitary record, while 10 did 
less work, apparently because they were over- 
stimulated by the desire to win, and 10 were 
essentially unaffected. All in all, he concluded 
that the group situation must normally be 
thought of as producing greater output of ea- 
ergy and achievement. His explanation is of 
interest: 

The bodily presence of another contestant par- 
ticipating simultaneously in the race serves to 


tude” seems too superficial, too motor, to repre- 
sent the basic organization of needs and cathexes 
that is, to their minds, the core of "sentiment." 

Final word. The four types of units we have 
discussed — instinct, habit, attitude, sentiment 
— all claim that dynamic dispositions of some 
type underlie social behavior. They all have the 
weakness of assuming a fixity of disposition in 
the person, and overlook the flexibility of be- 
havior when environmental situations alter. 
This weakness accounts for the strong attack 
leveled upon all conative units by field theorists 
and phenomenologists. In recent years the pen- 
dulum has swung away from emphasis on the 
static unit to a field or situational determinism. 
Yet individuals in varying situations are not al- 
together different; the same dispositions tend to 
be manifest in varying fields. In correcting for 
the exaggerated fixity implied in these units, it 
is not necessary to abandon them altogether in 
favor of momentary situationism, or in favor of 
an unqualified field theory. 


OBJECTIVE METHOD 
liberate latent energy not ordinarily available. 
... The sight of the movements of the pace- 
makers or leading competitors, and the idea of 
higher speed furnished by this or other means, 
are probably in themselves dynamogenic factors 
of some consequence. (p. 533) 


In his experimental design, as well as in his ex- 
planation, Triplett fails to distinguish two 
causal factors: emotional competitiveness on 
the one hand, and simple dynamogenic effect 
resulting from the sights and sounds of co- 
workers, on the other. We have already marked 
how dominant the ideo-motor theory was in late 
nineteenth-century psychology. Automatic and 
unconscious movements resulting from im- 
planted ideas or from mere sensory or affective 
stimulation, were objects of great interest. Féré 
(1900), a physician in Charcot's clinic, Binet 
(1900) , and others, had great faith in the ex- 
planatory value of the concept. Automatisms, 
mind-reading, suggestion, imitation, hypnotic 
phenomena were all traced to dynamogenic ef- 
fects. It was natural to explain group influence 
in this way. 

The same basic problem was an early concern 
of pedagogics. A. Mayer (1903) studied both 
the quantity and quality of school children's 
homework and classwork with respect to memo- 
rization, composition, arithmetic, and other 
tasks. On the whole, his results clearly favored 
group over solitary work, although he found, as 
Triplett did, that conditions of overkeen compe- 
tition in congregate groups affected adversely 
the quality of the production. 
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It was a full generation later that the im- 
portant distinction was drawn between emula- 
tion or competition on the one hand, and simple 
“social facilitation” (whether or not explained 
in terms of dynamogenesis) on the other. F. H. 
Allport ( 1924) , who drew this distinction, also 
pointed to the existence of two kinds of small 
groups: the co-acting and the face-to-face. In 
the former one finds the simplest psychology op- 
erating: people working side by side receive 
their social stimulation almost wholly from 
“contributory” social stimuli and thus most 
clearly display the effects of simon-pure “social 
facilitation.” In face-to-face groups, on the other 
hand, entirely new problems arise: conversa- 
tion, interaction, circular social behavior, 
“group thinking,” and all manner of direct (not 
merely contributory) social effects. Accepting 
this distinction, we may say that early work 
dealt entirely with co-acting groups. It took a 
longer time for face-to-face groups to become a 
central focus of interest. Bechterew and de 
Lange (1924), G. Watson (1928), and, espe- 
cially, Kurt Lewin (see collections of his studies, 
1948, 1951) gave incentive to research in face-to- 
face groups. 

In 1913 Moede began a monumental program 
of research on co-acting groups. He decided 
that a systematic attempt should be made to in- 
troduce the social variable into all, or nearly all, 
standard psychological experiments. If thresh- 
olds for the intensity of audible sound could be 
determined for the individual working alone, 
why not for two, four, six, or any number of 
subjects working in the presence of one another? 
Also, why should not the available instrumental 
devices for recording involuntary muscular 
movements be used in the study of imitation? 
The subject's involuntary movements could, for 
example, be measured while he was watching 
similar movements of the experimenter’s arm. 
Also associative processes, shifts in the fixation 
of attention, learning and forgetting — all well- 
known phenomena in experimental psychology 
— could be (and were) studied, using groups of 
Leipzig school and college students as subjects. 
Moede published his results in Experimentelle 
Massenpsychologie (1920) , but his pioneer con- 
ception had less influence than it should, partly 
because he failed to generalize his findings into 
systematic theory, and partly because his stimu- 
lating book was not translated into English. 

Moede's experimental program was, however, 
known to Münsterberg at Harvard. It was he 
who, in 1915, encouraged F. H. Allport to un- 
dertake work in this area. The latter confirmed 
the principal findings of Moede and his prede- 
cessors and formulated a series of generaliza- 


tions (1920). Even when the effects of 
competitiveness are minimized the mere pres- 
ence of co-workers seems ordinarily to increase 
the quantity of output (social increment) ; it 
tends, however, to make judgments conservative, 
to make associations less personal, and to dimin- 
ish the quality of reasoning and other complex 
mental operations (social subvaluent). This 
work was systematically extended by Whitte- 
more (1924) who added the variable of com- 
petitiveness, and found in general that it 
enhanced all these social effects. Unlike Trip- 
lett, these authors explained their results in 
terms of “contributory social stimulation" — a 
summation of cues and innervations resulting 
from previous conditioning and leading to the 
excess output over solitary production. When 
the variable of competition is added, emotional 
reinforcement from within summates with the 
contributory cues from without. 

"This type of work, with many elaborations, 
was carried on by subsequent investigators. 
Travis, for example, found that some of the ef- 
fects were reversed in the case of stutterers, 
whose social shyness seemed to be an inhibiting 
variable (1925). Dashiell summarized all this 
work and added his own research on the imag- 
ined as compared with the actual presence of 
others in his review of 1985. Among early ex- 
periments on group influence was that of H. T. 
Moore, who gauged the relative effect of expert 
opinion and majority opinion upon moral and 
aesthetic judgments (1921). Moore's research 
initiated a new method. Instead of having “ex- 
perts" or a "majority' physically present, he 
merely reported their judgments to his subjects. 
In this way it was discovered that the effects 
of social prestige can be measured simply 
through instructions that induce an attitude. 
Countless subsequent investigations have taken 
advantage of this fact, and have measured social 
effects through the artificial arousal of pre- 
existing attitude. 

Russian psychology, also employing the ex- 
perimental method, soon after the Bolshevik 
revolution turned its attention to problems of 
collective vs. individual behavior. Bechterew 
and de Lange (1924), using face-to-face (not 
merely co-acting) groups, report on the effects 
of discussion in modifying previous individual 
judgments. Collective thinking, they concluded, 
is not less efficient than private thinking; often 
it is superior in terms of the accuracy of judg- 
ments, Russian investigations centered atten- 
tion likewise on the relative merits of competi- 
tive and co-operative social situations. As 
summarized by Boldyreff (1937), their studies 
established in general the superiority of co- 
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operative arrangements. Recently, however, 
Bauer (1952) has shown that Soviet psychology 
has shifted from its earlier emphasis upon the 
importance of environmental, situational, and 
co-operative factors in human behavior, to an 
emphasis upon the duties and obligations of the 
individual. The superiority of free discussion, 
co-operation, and teamwork, established by 
early experiments, seems today less congenial to 
Communist Party ideology. 

In America the narrower formulation of the 
experimental question gradually gave way. 
Group influence upon the mental processes of 
the individual was no longer the sole focus of 
experimental interest. Lewin, Lippitt, and 
White (1939) introduced the conception of 
social climate or group atmosphere. They 
formed experimental groups under various 
types of leadership to approximate authorita- 
rian, democratic, and laissez-faire social at- 
mospheres. Following this historic experiment, 
interest in group dynamics, group structure, 
group decision, group cohesion, spread rapidly 
into laboratory, community, and industrial re- 
search. A broader horizon for experimental so- 
cial psychology opened. 

Controlled observation. While the experi- 
mental method approaches the ideal design for 
social research, it is not adapted to all of the 
problems with which social psychology must 
wrestle. No doubt most investigators, from the 
earliest days, would have insisted that their 
theories and dicta were based on the solid foun- 
dation of "facts" Comte, Spencer, Tarde, Durk- 
heim — all theorists — were interpreting social 
data as they observed them. What they lacked 
was a method for checking and extending their 
unaided observation. 

Perhaps the earliest attempt to improve pri- 
vate observation was through the adoption of 
questionnaire procedures. Boring (1950) re- 
ports uses of this method as early as 1869 in the 
Berlin schools. Galton (1888) leaned upon 
questionnaires in his famous study of imagery. 
Its use on a vast scale commenced with G. Stan- 
ley Hall (1891), chiefly in his studies of child 


psychology. Soon Starbuck (1899) adapted the 
method for investigations of the religious beliefs 
and practices of adults. James's work of genius, 
Varieties of Religious Experience (1902), was 
based in part upon Starbuck's questionnaire 
data. In the course of time the need for refine- 
ments and safeguards brought special methodo- 
logical researches in matters of wording, inter- 
viewing, coding of responses, and scaling. 
Originally a loose, naive instrument, the ques- 
tionnaire has gradually been forged into a tool 
of considerable precision. 

Gradually, too, statistical aids, invented by 
Quetelet, Galton, Pearson, Fisher, and others 
came to serve as safeguards for research in social 
psychology. Devoting himself specifically to the 
problem of attitude measurement, Thurstone 
(1927) first introduced statistical sophistication 
into this important area of investigation. So- 
ciometry, a quasi-quantitative technique in- 
vented by Moreno (1934), enabled social 
psychologists to assess the personal attractions 
and rejections among members of a group. Fol- 
lowing Gallup's first signal success with public 
opinion polling in the presidental elections of 
1936, an elaborate special branch of sampling 
and opinion research rapidly developed. 

Since most of these signal strides in method 
are of recent date, they do not form a part of 
our historical story, The fact is that empiricism ` 
and positivism did not enter social psychology 
to any appreciable extent until the decade of 
the 1920's. The ideals of objectivity and pre- 
cision then rapidly assumed a dominant posi- 
tion. By 1931 Murphy and Murphy, in the first 
edition of their Experimental Social Psychology, 
were able to list over eight hundred relevant 
studies, and in a revised edition (1937) ex- 
panded the entries by several hundred addi- 
tional titles. Hornell Hart (1949) has 
demonstrated that in the 1930's the upswing 
of published research began to conform to an 
exponential curve of increasing output. No 
tapering off of this accelerating rate of expan- 
sion is yet in sight. 


TEXTBOOKS IN SOCIAL PsvcHOLOGY 


One may learn something concerning the his- 
torical background of contemporary social psy- 
chology by surveying its textbooks. The 
accompanying list includes most, if not all, of 
the textbooks published in English through the 
year 1952. Many borderline titles are omitted, 
such as Dewey's Human Nature and Conduct: 


1908 E. A. Ross, 
1908 W. McDoucatt, 


an Introduction to Social Psychology (1922), 
and Znaniecki’s The Laws of Social Psychology 
(1925) , on the grounds that such books are less 
comprehensive than the ordinary textbook. 
Such exclusions, to be sure, are debatable and 
to some degree arbitrary. 


Social Psychology: An Outline and Source Book. New York: Macmillan. 
Introduction to Social Psychology. London: Methuen & Co. 


1917 
1918 


1922 
1923 
1924 
1924 
1925 
1925 
1925 
1925 
1927 
1928 
1929 
1929 
1929 


1930 
1931 
1931 
1931 


1934 
1934 
1935 
1935 
1936 
1936 
1936 
1936 
1938 
1938 
1938 
1940 
1940 
1941 
1942 
1947 


1948 


1948 
1948 
1949 
1949 
1949 
1950 
1950 
1950 
1951 
1952 
1952 
1952 
1952 
1952 


THE HISTORICAL BACKGROUND OF MODERN SOCIAL PSYCHOLOGY 


C. A. Ettwoon, 
E. S. Bocarous, 


J. M. Wurrams, 
R. H. GAULT, 
F. H. ALLPORT, 
E. S. BocARDUS, 


K. DunLar, 


C. A. ELLWOOD, 
R. H. THOULESS, 
L. L. BERNARD, 


K. Youwc, 
R. Muxeryjee and N. N. SENGUPTA, 
B. C. Ewer, 


J. R. Kantor, 
C. Murcuison, 


K. Younc, 


J. K. Fotsom, 


E. T. Kruecer and W. C. RECKLESS, 
G. and Lois B. Muprnx, 


L. G. Brown, 


A. MEYERSON, 


C. Murcuison, (Ed.) 
J. J. Smits, 


J. F. Brown, 
E. FREEMAN, 


H. GURNEE, 
R. T. La Pirre and P. R. FARNSWORTH, 
D. Kartz and R. L. SCHANCK, 
J. M. Remnarpt, 


R. T. La PIERE, 
C. Bro, 
O. KLINEBERG, 


S. H. Britt, 


M. H. Knovr, 


T. M. Newcoms and E. L. HARTLEY, 


D. Krecu and R. S. CRUTCHFIELD, 


M. SHERIF, 


M. F. VAUGHAN, 
N. S. PHADKE, 
A. L. BEELEY, 


A. R. LinpesMITH and A. L. STRAUSS, 


T. M. NEWCOMB, 
S. S. SARGENT, 
O. FRIEDMAN, 


E. L. QUEENER, 
S. E. Ascu, 


. W. Doon, 


L. 
H 
H 


Faris, 


ARTLEY and R. E. HARTLEY, 


. SrROTT, 


4n Introduction to Social Psychology. New York: D. Appleton & Co. 
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Essentials of Social Psychology. Los Angeles: University of Southern California 


Press. 


Principles of Social Psychology. New York: Knopf. 
Social Psychology: the Basis of Behavior Called Social. New York: Holt. 
Social Psychology. Boston: Houghton Mifflin. 
Fundamentals of Social Psychology. New York: Century Co. 
Social Psychology. Baltimore: Williams & Wilkins. 
The Psychology of Human Society. New York: Appleton. 
Social Psychology. London: Clive. 
Introduction to Social Psychology. New York: Holt. 
Source Book for Social Psychology. New York: Knopf. 
Introduction to Social Psychology. London: Heath. 
Social Psychology. New York: Macmillan. 
An Outline of Social Psychology. Chicago: Follett Publishing Co. 


Social Psychology: the Psychology of Political Domination. Worcester, Mass.: Clark 


University Press. 


Social Psychology. New York: Crofts. 
Social Psychology. New York: Harper & Bros. 
Social Psychology. New York: Longmans. 
Experimental Social Psychology. New York: Harper & Bros. (Revised Edi- 
tion, G. & Lois B. Murphy & T. M. Newcomb. 1937) 

Social Psychology: the Natural History of Human Nature. New York: McGraw-Hill. 
Social Psychology. New York: Prentice-Hall. 
Handbook of Social Psychology. Worcester, Mass.: Clark University Press. 
Social Psychology. Boston: Bruce Humphries. 

Psychology and the Social Order. New York: McGraw-Hill. 


Social Psychology. New York: Holt. 


Elements of Social Psychology. New York: Farrar & Rinehart. 
Social Psychology. New York: McGraw-Hill. 
Social Psychology. New York: Wiley. 
Social Psychology. Philadelphia: Lippincott. 
Gollective Behavior. New York: McGraw-Hill. 
Social Psychology. New York: Appleton Century. 
Social Psychology. New York: Holt. 
Social Psychology of Modern Life. New York: Farrar & Rinehart. 
Iniroduction to Social Psychology. New York: Harper & Bros. 
Readings in Social Psychology. New York: Holt. 


(Revised. 


Edition: G. E. Swanson, T. M. Newcomb, and E. L. Hartley. 


1952) 


Hill. 


Social Psychology. New York: Holt. 


Social Psychology. New York: Ronald Press. 
Fundamentals of Social Psychology. New York: Knopf. 


Social Psychology. London: Methuen & Co. 


H 


An Outline of Social Psychology. New York: Harper & Bros. 
Social Psychology. New York: Odyssey Press. 
An Outline of Social Psychology. Kolhapur: College Press. 
Outlines of Social Psychology. Utah: University of Utah Press. 
Social Psychology. New York: Dryden Press. 
Social Psychology. New York: Dryden Press. 
Social Psychology. New York: Ronald Press. 
Introduction to Social Psychology. London: Sylvan Press. 
Introduction to Social Psychology. New York: William Sloane. 
Social Psychology. New York: Prentice-Hall. 
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Theory and Problems of Social Psychology. New York: McGraw- 
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The first fact we note in the listing is a grad- 
ual acceleration of output excepting during 
World War II. The distribution in five-year 
intervals is as follows: 


1908-1912 
1913-1917 
1918-1922 
1923-1927 
1928-1932 
1933-1937 
1938-1942 
1943-1947 
1948-1952 


| &— 30 o c ro — N 


Total 52 


Textbooks may vary in content between two 
extremes, At one pole are the handbooks, de- 
voted largely to experimental and empirical 
reporting. These works do not presume to in- 
tegrate all the research surveyed into coherent 
theory. Examples are the handbooks edited by 
Murchison (1935) and Lindzey (1954), and the 
Experimental Social Psychology of Murphy, 
Murphy, and Newcomb (1937). At the other 
extreme we have textbooks almost totally de- 
void of reference to specific investigations. 
Their authors are interested in interpretation 
and in broad generalizations. When data are 
not available, the authors of such texts speculate 
freely about what may be the laws of social 
psychology. And, of course, there are all sorts 
of combinations wherein factual reporting is 
interspersed with interpretation, generalization, 
and speculation. In an article entitled “On the 
making of textbooks in social psychology,” L. L. 
Bernard (1931) defended the latter method. 
This author argued that single ad hoc studies 
are often as misleading as speculation. He 
points out further that many problems in social 
psychology are not accessible to experimental 
study, and that others, while in principle acces- 
sible, have not yet been studied experimentally. 
He likewise raises the question whether experi- 
mental validation is necessarily the one heaven- 
sent proof of assertions in social psychology: 


If one accepts the narrower alternative of put- 
ting into a text only those things which conform 
to one special criterion of validation — say the 
experimental method — he lays himself open to 
serious difficulties in addition to the limited 
character of his product. Who shall decide what 
is the proper criterion of validation? The experi- 
mentalists may naively answer that the experi- 
mental method is its own justification, because its 
results can be duplicated by any other worker 
under the same conditions. I am inclined to think 
that those who would urge a Methodist, a Catho- 
lic, or a Freudian criterion of validation could 


make the same argument. If one will put himself 
in the same attitude of mind as one who urges one 
or the other of these criteria, he will unquestion- 
ably be convinced by the same evidence. (p. 72) 


Bernard here warns us that ultimately deep 
epistemological decisions underlie the building 
of social psychology as a science. To assume that 
ad hoc experiments are themselves the final 
arbiter of truth is, as he maintains, naive. No 
experiment interprets itself; no fact speaks for 
itself. Theories that transcend the specific in- 
stance are necessary, and theory building in 
social psychology is still in its infancy. 

Textbooks vary not only in their addiction 
to findings or to broad interpretation, but also 
in respect to their systematic defense of a single 
point of view. Some authors are happy with a 
loose eclecticism; they worry not at all about 
the coherence between findings, or even between 
theories. Others, by contrast, are intent upon 
constructing a framework that will bind facts 
and theories into a superordinate and logical 
whole. Among the textbooks that can clearly 
be called systematic are those of McDougall 
(1908), F. H. Allport (1924), Brown (1936), 
Sherif (1948), Krech and Crutchfield (1948), 
Asch (1952). Most authors, no doubt, would 
claim that they are presenting a systematic point 
of view. But often their product is merely a 
matter of weighted preference for certain con- 
cepts. Genuinely systematic texts are those that 
tie the data of social psychology to a coherent 
psychological or sociological frame of thought 
broader than the data of social psychology itself. 
Thus McDougall's social psychology is wholly 
consistent with his over-all hormic approach to 
all psychological problems; F. H. Allport's text 
is consistent with the behaviorism prevailing in 
1924. J. F. Brown favors field theory; Sherif, 
cultural determinism; Krech and Crutchfield a 
phenomenological approach; Asch, Gestalt psy- 
chology. 

A final and chief cause of the great diversity 
in textbooks lies in the sociological-psychologi- 
cal bifurcation that marks the entire history of 
social psychology. In the accompanying list, 
by rough estimate, two-thirds of the texts are 
written by psychologists and one-third by so- 
ciologists. Many, therefore, reflect a strong 
psychological bias, and many a sociological. 
What is the difference? 

Earlier in the present essay we ventured to 
define social psychology as the attempt to under- 
stand and explain how the thought, feeling, and 
behavior of individuals are influenced by the 
actual, imagined, or implied presence of other 
human beings. This definition has an obviously 
psychological cast. By contrast, we may refer to 
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the following sociological type of definition. 
Ellwood (1925) writes: 


Social psychology is the study of social interac- 
tion. It is based upon the psychology of group 
life. It begins with an interpretation of group- 
made types of human reactions, of communica- 
tion, and instinctive and habitual actions. (p. 16) 


It would be invidious to designate each and 
all of the texts in the accompanying list as either 
psychological or sociological in approach. Some 


authors would be sure to protest against the 
typing. But there would be little disagreement 
in designating as psychologistic the texts by Mc- 
Dougall (1908) and by F. H. Allport (1924) . 
Nor would there be much opposition to labeling 
as sociologistic the texts by Ellwood (1917) or 
Lindesmith and Strauss (1949). Combination 
points of view are, of course, possible, and are 
increasing in frequency. Newcomb (1950) 
might be said to hold with equal tenacity to a 
social and to a psychological frame of reference. 


FiNAL Worp 


The diversity of textbooks is, of course, a 
natural reflection of the diversity that has 
marked the whole history of social psychology. 
Writers have differed in their definitions of the 
field, in their selection of problems for study, 
and above all in their conceptual tools and pre- 
suppositions. The student of social psychology 
has no choice but to learn many maps, realizing 
that the master chart is not yet available. Many 
maps, however, are preferable to no maps. In- 
vestigators who prefer to start from scratch do so 


at their own peril. When they attempt to put 
their myriad empirical findings together they 
may find that they are entering blind alleys that 
have trapped many in the past; or they may find 
that their “discoveries” are, historically consid- 
ered, mere truisms. But the investigator who is 
aware of the history of his subject is able to 
strike out with assurance. He will be able to 
distinguish new garbs from old, progress from 
platitude, and will thus help build a truly cumu- 
lative science, 
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Two aspects of the varied roles of Stimulus- 
Response theory in contemporary social psy- 
chology will be emphasized here. One general 
area of discussion will be a summary of S-R 
contiguity theory and S-R reinforcement theory; 
the other will be a consideration of the problem 
of generalizing S-R theory for social psychology. 
Before these topics are broached specifically, 
however, some precursors of these theories will 
be touched on. 

For some, the current notions of S-R theorists 
are direct descendants of various doctrines of the 
association of ideas; for others the theories are 
operationally bound games of runways and 
mazes; and for still others the theories are based 
on artificial data and are fitting only to the 
experimental laboratory whence they came. 
Decision as to whether any of these characteri- 
zations is correct will, of course, be an individual 
matter, but a view of the present range and 
extensions of S-R theory in social psychology is 
of quite general interest. Insofar as SR theory 
is associationistic, it has a history as long as 
that of western culture. Insofar as it is couched 
within a general empiricist philosophy, its his- 
tory is largely that of the British empiricist 
school. Insofar as it is experimental, its history 
is as short as the history of experimental psy- 
chology. 

Two major figures of modern psychology and 
physiology have furnished original insights of 
crucial importance in the early development of 
stimulus-response positions. These two men 
were Pavlov (1927) and Thorndike (1933). 
Pavlov's use of experimental methods in identi- 
fying the important variables involved in a 
conditioning situation served as a major im- 
petus to the developers of S-R theory. His work 
is well enough known not to require descrip- 
tion here. Although none of the recognized con- 
temporary theorists could fairly be described as 
a Pavlovian, these early experiments have per- 
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sisted as major points of reference for theoretical 
clarification and induction. The work of Thorn- 
dike has also been of great importance, and the 
attempt to integrate his thought within the 
Pavlovian tradition has been a fruitful area of 
controversy and advance. The major current 
theoretical systems have in one way or another 
attempted thus to integrate the work of Pavlov 
and Thorndike. An excellent picture of the 
present thinking on this topic can be found in 
Postman's (1947) recent analysis and evaluation 
and in Meehl's (1947) comment. 

The attempts at integration of Pavlovian and 
Thorndikian thinking have progressed as an 
area within the changing outlines of the be- 
haviorist movement, which has itself expanded 
to include much of present American psychol- 
ogy. Spence (1948) has stated that behaviorism 
as a school no longer exists but that some of its 
early emphases are now almost ubiquitous. The 
first of these emphases is on what Watson (1913) 
called the “naive use of consciousness.” Watson 
stated that psychology did not need to deal with 
consciousness in the sense of observing special 
“states of consciousness.” It should return, 
rather, to a nonreflective and naive use of con- 
sciousness, which is the “instrument or tool with 
which all scientists work." A second important 
influence is the emphasis upon behavioral refer- 
ents for concepts. A sample of this state of 
affairs is Bronfenbrenner's (1953) statement, in 
his current review of the field of personality, 
that operational definition is the eventual rest- 
ing place of all good scientific concepts. Bron- 
fenbrenner goes on to emphasize, however, the 
real point of present dissension: that the im- 
mediacy of the need for operational definition 
is variously interpreted. 

Another source of energy for modern S-R 

e theory, at least recently, has come from the field 
of philosophy of science and modern relational 
logic. The dual emphasis of Vienna logical 
positivism and Jamesian pragmatism on an em- 
pirical theory of meaning has helped furnish a 
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broader base for thinking in S-R psychology. 
"The influence of modern symbolic logic may be 
seen by glancing through recent volumes of the 
Psychological Review (e.g. Feigl, 1945; Koch, 
1941; McReynolds, 1950; Moore, 1952). S-R 
theory has provided some outstanding examples 
of explicit and testable operational theory con- 
struction (Hull et al.,1940). Whether or not 
the current “models” are long retained, the fact 
of their existence and use has pointed the way 
toward a general approach of importance to 
psychology. 

Among recent influences in the development 
of S-R theory has been the insistence of Gestalt 
and Gestalttype theorists on a larger unit of 
analysis. This has clearly contributed to a more 
intent appraisal of the definition of such root 
terms as "stimulus" and "response." Freudian 
psychology, too, has been of increasing interest 
to S-R theorists. Freudian speculation on the 
nature of motivation has led to the inclusion of 
much of the nonteleological area of McDougall's 
thinking, and interest has been growing also 
with regard to the Freudian mechanisms. The 
common aspects of the Darwinian tradition, 
accepted by both groups of theorists, has per- 
haps helped here. As Bergmann (1943) has 
pointed out, however, while the Freudians have 


accepted almost all of functionalism, S-R theory - 


has been more cautious, because of its preoccu- 
pation with the behaviors of men and animals 
rather than only with their accomplishments. 
The Freudian emphasis on structure has also 
been approached warily. Structure analysis is 
essentially an emphasis on full description of a 
momentary state wherein a large number of de- 
pendent variables are described in interaction. 
SR theorists have maintained their analysis of 
laws of development of partial structures as a 
prior commitment. More recently, and with 
increasing confidence in their experimental and 
theoretical tools, some S-R thinkers have em- 
phasized structure analysis. The work of Miller 
(1951a) and of Whiting (1948) on conflict are 
some foremost examples of this trend. In gen- 
eral, the phenomena and insights of Freud, as 
well as the structural notions of Lewin, are be- 
ing given more interested attention. 

Zeal for knowledge which has practical conse- 
quences has been a continued motif in the tradi- 
tion of behaviorism. Watson's programmatic 
boast about rearing children closer to the heart's 
desire of the parents is symbolic of an interest 
in knowledge for use, which has taken on new 
and more sophisticated forms in recent years 
but which has maintained its strength. And, of 
course, the eventual criterion for deciding theo- 
retical issues may be seen to be the reinforce- 


ments provided to the society which underwrites 
the scientific endeavor. However, although any 
S-R scientist may be pleased by being able to 
make practical recommendations for training 
programs, like Jenkins and Stanley (1950), and 
although everyone enjoys reading Guthrie's quo- 
tations from a bird-training manual in a dis- 
cussion of his theory, theirs is a different attitude 
from that of continued emphasis on particular 
practical problems. Whereas application of 
theory is desirable, the focus of the search is 
knowledge and understanding rather than im- 
mediate application. Selection of problems for 
study has been dictated chiefly by theory, rather 
than by immediate pressures of applied prob- 
lems. This attitude is represented by F. H. 
Allport's (1942) recommendation to S.P.S.S.I. 
during World War II, that the need was not for 
values, which we had, but for a careful scientific 
analysis of the facts. 

"These statements apply to those S-R. theorists 
who have contributed to social psychology, as 
well as to others. The work of the former has 
been aimed at the immense problem of finding 
the classes of data, and the conditions thereof, 
in general human (and animal) interaction 
which test the empirical laws and principles of 
S-R psychology, as previously generated experi- 
mentally. This work involves, among other 
things, utilization of data obtained by other 
social scientists, together with occasional or fre- 
quent laboratory clarification. 

Recently there has been the very encouraging 
development of mutual attacks on the problems 
of social psychology by combined efforts of 
psychologists and other social scientists, such as 
anthropologists, economists, sociologists, and 
educators. In the forefront of this movement 
have been several S-R theorists (e.g, Miller and 
Dollard, 1941; Whiting and Child, 1953). Some 
theoretical intertranslation has been accom- 
plished, and the notion of a general theory of 
social behavior remains an impelling idea, al- 
though the development of a multitude of 
"small systems" will no doubt continue to repre- 
sent the more typical approach to this problem. 

At present social psychology may be charac- 
terized as showing two opposite, although not 
dissonant, tendencies. One is the urgency of 
expansion to new problems and areas of data, 
and the other is a tendency toward systematiza- 
tion, consolidation, and refinement. MacLeod's 
(1947) emphasis on the phenomenological ap- 
proach is, in part, an instance of the first 
tendency, an insistence on freedom of inquiry 
in the search of new problems. This mood is 
reflected by the interdisciplinary and inter- 
theoretical advances mentioned above, afd it 
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also shows a belief in the desirability of a more 
comprehensive social theory and an expectation 
that personal experience still holds the secret of 
many new variables and theoretical notions. 
The second tendency, toward refinement and 
systematization, also seems widespread. S-R 
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theorists appear to have joined in each of these 
tendencies, wishing to maintain the formulation 
of SR theory, and also showing increased in- 
terest in the phenomena and hypotheses of other 
researchers. 


THe THEORETICAL SYSTEMS 


Stimulus-response theory has developed along 
several lines, and the present discussion will 
furnish an introduction to the theories of Hull 
(1943, 1950), of Mowrer (1950), and of Guth- 
rie (1935, 1952), which are outstanding ex- 
amples of three major kinds of S-R theory. We 
shall call these systems “theoretical systems" to 
distinguish them from the “theoretical-empiri- 
cal systems" which will be discussed in the next 
section. "Theoretical system" is a term which 
refers to the conceptual scheme as worked out 
by the theorist, including specifically the ex- 
perimental reference data which serve to define 
the empirical realm which the conceptual 
scheme subsumes or explains. “Theoretical- 
empirical system” refers to a generalization of 
the conceptual scheme (that is, its concepts and 
the relations among the concepts) to some ad- 
ditional realm of data other than that which 
has helped define the theoretical system. We 
shall use the term here to refer to attempts to 
apply such theories as those of Hull, Mowrer, 
and Guthrie to areas of data which usually are 
considered as being in the field of personality 
and social psychology. It should be stressed 
that this is a different endeavor from that of 
testing implications of the theories within the 
area of the defining data itself. There is a broad 
history of studies of this latter kind, and we 
shall merely refer the reader to other discussions 
of this important problem (see Hovland, 1951; 
Brogden, 1951; Spence, 1951; Miller, 1951b; and 
Hilgard, 1951). 

The distinction between these two kinds of 
system, the theoretical and the theoretical- 
empirical, is used here as a heuristic device. 
It is very difficult, in fact, to state at just what 
point a theoretical-empirical system has oc- 
curred. Further, the distinction is often not 
necessary, since some scientific questions, if 
correctly asked, can be answered by looking to 
the reference or defining data itself. One can 
ask, "Do ‘values’ affect perception?,” and per- 
haps get an answer, "Yes," from one of Bruner 
and Postman's (1947) experiments. These ex- 
periments can also be used, in conjunction with 
other data and a conceptual scheme, to attempt 
an answer to further and more difficult ques- 
tions (Bruner and Postman, 1949). It is this 
second use of data which is our present concern. 


Perhaps we can best orient the discussion by 
taking an example of the behavior of an Ameri- 
can child and directing our questions at this 
behavior. Our focus will not be, however, on 
the structure of the child's personality as an 
individual nor specifically on the cultural or 
social factors within which this behavior can 
also be viewed. Our focus is rather on the gene- 
sis of the behavior involved, and those aspects 
which we shall abstract for discussion will be 
pertinent to specific problems of learning the- 
ory. The experiments to which we shall refer 
deal with problems which are theoretically rele- 
vant and which appear requisite for scientific 
description of the process of learning. 


A LEARNED BEHAVIOR SEQUENCE 


Children are trained to do certain tasks — to 
keep themselves clean, to do chores, to read 
books, to write their names, and to like the 
"right" objects, people, and ideas. Any of these 
could furnish an example for us. Let us choose, 
however, the following situation in the life of an 
American child. We shall describe this situa- 
tion, inserting reference letters, as follows: 


Johnny is three years old. His daily behavior 
during the summer months has come to include 
a certain sequence of acts which we shall describe 
here. Let us look at Johnny on a day when his 
learning of this sequence of acts is fairly well 
advanced. We note that Johnny is standing on a 
chair in the kitchen and has just reached into 
the cupboard for a box of cookies when the bell 
of the ice-cream truck rings (A). Johnny hesi- 
tates for a moment and then says, "Oh, darn!" 
He gets down quickly and runs immediately into 
the living room where his mother is sewing (B). 
He asks her, "Mommy, can I have a nickel to 
get some ice cream?" (C). His mother looks up 
from her work, smiles and says, "Go get a nickel 
from my pocketbook." (D). Johnny runs to the 
pocketbook and extracts a nickel (E), then runs 
out to the ice-cream truck, hands over his nickel 
(F), and with mouth watering (G), gets his ice 
cream (H), and eats it (I). 


tA CATEGORIZATION OF THE SEQUENCE 


This description represents a common-sense 
description of Johnny's behavior. The letters 
have been placed in the description for the pur- 
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pose of later analysis into categories according 
to S-R learning theory. We shall make our cate- 
gorization of Johnny's behavior and then out- 
line those aspects which different theorists have 
emphasized. The categories are the following: 

(A) , the sound of the ice-cream bell, is a con- 
ditioned stimulus (CS) which the remainder 
of the behavior sequence follows. (B), running 
into the sitting room, (C), asking for a nickel; 
(E), running and getting the nickel; and (F), 
running to the truck and handing over the 
nickel — all these can be termed responses 
which are instrumental and which are in a par- 
ticular sequence to a goal. In (I), eating ice 
cream, a response is made which appears to 
terminate the sequence and provides reinforce- 
ment. It is to be noted that (D), mother's state- 
ment, is another stimulus to Johnny which we 
shall call a subgoal. An additional subgoal 
which should explicitly be mentioned is the 
nickel, which is a stimulus or stimulus object. 
Further, let us call (G), the watering of John- 
ny's mouth at the sight of the ice cream, a con- 
ditioned response (CR), in the sense that it 
once occurred only when ice cream was on 
Johnnys tongue or was being eaten, but now 
occurs sometime in the temporal interval be- 
tween (A), the sounding of the truck's bell, and 
(H), the taste of the ice cream, which later we 
shall term the unconditioned stimulus (US). 
One other category is needed at this point — the 
drive, which motivates, at least in large part, this 
entire sequence of behavior. Since it is 4:30, we 
shall simplify our problem by assuming that it 
is hunger which is motivating Johnny, although 
we know that the "appetite," or "secondary 
drive," for ice cream is probably also present 
and active. 

We have now presented a sequence of be- 
havior in common-sense terms and have given 
names to some of the occurrences in this se- 
quence, It must be kept in mind that we are 
not naming everything which goes on in this 
complex behavior, nor have we given names 
which would satisfy all the theorists whose ideas 
we wish to introduce. 

There are differences among theorists in mod- 
ern learning theory in regard to the following 
factors: (1) some would feel that not all of our 
categories are necessary or particularly fertile; 
(2) some would choose different terms for the 
categories on grounds of the “meaning” with 
which they wished to endow them, or on the 
grounds of their feeling that the rules of inter- 
relations among the categories are different from 
what is here assumed. This last kind of dif- 
ference among theorists may be due to (a) lack 
of clarity or explicitness of some categories and 


rules; (b) lack of evidence, or of ways of ob- 
taining evidence at any particular time on spe- 
cific issues; (c) differences in what would be 
accepted as evidence. 

Before going on to our theorists, however, let 
us point out some things about what Johnny has 
learned. At some time in his life Johnny did 
not know that the sounds of the truck’s bell, or 
nickels, could be connected with eating ice 
cream. At the time when we described his be- 
havior, however, these stimuli (conditioned and 
subgoal stimuli) have been tied up with “un- 
conditioned stimuli,” or “reinforcing states of 
affairs,” and have taken on new properties rele- 
vant to Johnny and his behavior. Further, we 
may note, if we look very carefully at Johnny 
(and if current theories are fitting), that his 
mouth will water when the stimulus of the bell 
is heard by him, ie. the “unconditioned re- 
sponse” moves back in the chain so that Johnny 
can be said to behave as if he anticipated or "ex- 
pected” ice cream when the bell sounded. 

As this procedure goes on day by day, we may 
see that Johnny might tend to “short circuit” 
the sequence. Parts may drop out, or the in- 
strumental acts near the goal may occur earlier 
in the sequence, as when Johnny goes to the 
pocketbook and takes the nickel without asking 
his mother. Further, if we measure the various 
instrumental acts which Johnny goes through, 
we may discover a “gradient of locomotion” in 
the sense that as Johnny approaches the goal, 
the speed with which he performs each act be- 
comes relatively faster. Again, if we should use 
Bruner's "egometer" (Bruner and Goodman, 
1947) to measure the apparent size of the nickel 
to Johnny before and after it has become tied 
up with Johnny's needs or his goals or both, 
we may find the experience has led to an ac- 
centuation for him of the size of the nickel, or 
that a perception has been learned or altered 
by the experience of adding this sequence of 
behavior to his repertoire (Lambert, Solomon, 
and Watson, 1949) . 

We say that Johnny "has learned something," 
since the ice-cream truck sounds have become 
important to him. What has he learned? How 
did he learn it? What are the laws necessary to 
predict the resultant or consequent behavior 
which we have described and named? What 
has changed in Johnny? These are some ques- 
tions to which learning theorists attempt an- 
swers, 

For our present purposes it will be necessary 
first to select from the richness of the theoretical 
systems of three kinds of learning theorists the 
most characteristic aspect of their positions and, 
second, to analyze in more detail what these 
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theorists say about how behavior sequences like 
Johnny's are acquired and lost, and what be- 
havior should characteristically accompany such 
acquisition and loss. 


ADDITIONAL PROCESSES OF LEARNING 


The experimental study of both Pavlovian 
conditioning and Thorndike’s problem-solving 
situations has provided several repeatable phe- 
nomena which have been considered by most 
learning theorists and investigators as important 
kinds of data for any learning theory to deal 
with. Let us discuss these kinds of data in the 
context of our example in order that we may 
better see the problem that a systematic theorist 
faces in integrating this area. The uninitiated 
reader is referred, for a more detailed discussion 
than follows, to the several definitive chapters in 
the Handbook of Experimental Psychology (Ste- 
vens, 1951) on the broad empirical base of 
learning experimentation, particularly concern- 
ing the phenomena which have been of special 
interest to SR theorists. 

Although until recently most learning theory 
controversies (see Spence, 1951) have been fo- 
cused around how to state the necessary and/or 
sufficient conditions for learning, there are some 
signs that other (subsidiary) empirical and 
theoretical principles are of equally great im- 
portance for work in social psychology. The 
notion of “stimulus generalization,” for ex- 
ample, may provide hunches for défining “mod- 
al personalities," and for assisting in the tre- 
mendous problem of categorizing social 
situations (see Tolman, 1951, and Parsons and 
Shils, 1951). We shall conclude our discussion 
of the theoretical systems by summarizing the 
current systematic positions of these theories, 
and by adding certain further concepts which 
have not yet been fully integrated into the sys- 
tems. This will provide us with a clear picture 
of just what is at our disposal for use in more 
purely social psychological areas. 

Temporal relations. Conditioned responses 
are most strongly established when the uncon- 
ditioned (or reinforcing) stimulus follows the 
conditioned stimulus by a short time interval. 
With human subjects, the strongest condition- 
ing was found by Kimble (1947) to occur when 
the conditioned stimulus preceded the uncondi- 
tioned by 0.5 second. The empirical facts seem 
to confirm the general notion that in the classi- 
cal conditioning situation there is a relationship 
of decreasing strength of conditioning as this 
temporal interval is increased. Backward con- 
ditioning, where the conditioned stimulus fol- 
lows the unconditioned stimulus, has been 
reported (see Hovland, 1951) but is generally 


less effective than when the conditioned stimu- 
lus precedes the unconditioned stimulus. These 
temporal relations, as well as others, have been 
important in theorizing about learning since the 
time of Pavlov. 

Spontaneous recovery. It appears to be the 
case with most conditioned responses that when 
time is allowed to elapse after the response is 
extinguished, the response may reappear spon- 
taneously and increase in strength without fur- 
ther reinforcement. In other words, if Johnny's 
mother desired to end the behavior we have 
described above and had apparently succeeded 
in doing so, the chances are that the responses 
would tend to recur at some later time. Such 
spontaneous recovery provides an interpretation 
for much of the seeming naughtiness of chil- 
dren, who, having learned the right thing to do 
after much trouble, occasionally revert to a 
bad habit. 

Stimulus generalization. One of the focal 
points, and a remaining problem, in condition- 
ing and learning theory is the fact of stimulus 
generalization. This concept refers to the fact 
that when a response has come to occur to a 
certain stimulus or stimulus pattern, the same 
response will also occur to similar stimuli with- 
out any reinforcement for such occurrences. 
Hovland has recently summarized the salient 
fact: 

Immediately following the conditioning trials 
generalization may be almost complete, that is, 
as large a response being made to stimuli not 
previously paired with the unconditioned stimu- 
lus, as to the stimuli given paired presentation. 
With subsequent testing, the generalized responses 
appear to extinguish more rapidly than those es- 
tablished to the original conditioned stimulus. 
(Hovland, 1951, p. 616) 


One of the important areas for quantification 
in learning theory is the identification of factors 
that affect the shape of “gradients” of generali- 
zation, but for general purposes one can imagine 
that the form is roughly that of a falling off of 
the amplitude of the response as one makes the 
conditioned stimulus less and less similar to 
the conditioned stimulus that had been paired 
with the unconditioned stimulus. Thus if we 
varied the sound of the ice-cream bell up and 
down the intensity scale away from the tone to 
which Johnny has been conditioned, we should 
find Johnny's salivating response decreasing in 
amplitude. The shape of any generalization 
gradient is, however, a difficult matter to predict 
at the present time except under limited con- 
ditions where psycho-physical controls can be 
maintained. Lawrence (1952) has recently 
raised some serious questions on this score. As 
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we shall see below, gradients of stimulus gen- 
eralization have been studied in many kinds of 
situations, and the general concept has been 
found useful by such varied social scientists as 
Stroud (1946) in educational psychology, Miller 
and Dollard (1941) in social psychology, Dol- 
lard and Miller (1950) in personality theory, 
and Murdock (1949) in cultural anthropology. 
The knottiest problem in this area involves first 
how to measure learned stimulus generalization 
gradients and, second, a statement of the com- 
plex laws of how such continua or scales become 
learned by the individual. 

Discrimination. The important fact of dis- 
crimination is stated by Hovland as follows: 


If reinforcement is always present when the 
original conditioned stimulus occurs but is always 
absent when the generalized stimulus occurs, the 
combination of increased strength of the condi- 
tioned response and extinction of the generalized 
response reduces the degree of generalization un- 
til finally the response occurs to only a very 
narrow range of stimuli. The accuracy of dis- 
crimination is reduced when the stimuli are pre- 
sented rapidly and in irregular order. (Hovland, 
1951, p. 615) 


Therefore, although in the early stages of learn- 
ing the sequence of behavior Johnny might have 
responded to many kinds of bells and cymbals, 
he would soon come to respond by asking for a 
nickel only when the bell was the bell of the ice- 
cream truck, or one very similar. This would 
be the case unless, of course, his mother always 
gave him a nickel when any bell rang (an im- 
probable situation), and he had previously 
learned to use nickels to go to the store. 
Higher order conditioning. Probably the no- 
tion of higher order conditioning is the most 
important concept which conditioning experi- 
menters have provided for understanding com- 
plex human social behavior. 'The term refers 
to the experimental fact that a conditioned 
stimulus which has previously been associated 
with an unconditioned stimulus, will, inde- 
pendently of any further reinforcements, in 
some circumstances function as an "uncondi- 
tioned stimulus" in setting up a second condi- 
tioned response to a third stimulus. Such 
conditioning can sometimes be continued for 
several stages. 'This capacity for previously 
neutral stimuli to take on the capacity of being 
reinforcers is a core notion in learning theorists' 
approach to the development of valuing be- 


havior towards environmental objects, In our" 


example, ice-cream bells, five-cent pieces, and 
probably the sight of ice-cream vendors have all 
come to "have value" to Johnny. Furthermore, 
Johnny would probably learn new sequences 


just to get nickels and go out of his way just to 
get close to sounds of ice-cream trucks. 

Anticipatory phenomena. The flow of saliva 
which follows the sounding of the conditioned 
stimulus in Johnny's behavior sequence origi- 
nally occurred only when the ice cream was 
in his mouth. This “fractional anticipatory goal 
response” has, in learning, commenced to occur 
out of place in the sequence, so that Johnny 
may be said “to anticipate” getting ice cream. 
Such anticipatory responses can be seen in some 
kinds of experimental situations; in others, they 
cannot be seen directly but have been inferred 
or “hypothecated” by some experimenters (see 
Spence, 1951) . 

If nursery school children are taught to enter 
a room and turn a crank and get candy, the 
saliva flow which occurs while they are working 
for candy may tequire the use of a mop for the 
floor, and such children are quite free in calling 
loudly, “I want some candy” or “I came here 
to get candy,” providing clear evidence of their 
“anticipation” (Lambert, 1950). 

Such observations of child behavior are very 
silly unless the major point is grasped; such 
“purposive” behavior is so much with us that 
some "cognitive" theorists have tended to as- 
sume it in their theorizing about human beings 
or about rats. SR theories have viewed the 
problem of purposes as one which is not to be 
assumed, but as one which must be explained in 
the sense of outlining the laws whereby organ- 
isms develop those “purposes” they are not born 
with. Further experimentation with anticipa- 
tory phenomena may help in this matter. 

These, then, are some of the basic facts of 
conditioning. We have here limited our dis- 
cussion to repeatable phenomena which have 
been obtained with human subjects. Many of 
the definitive experiments on these phenomena 
have, however, been conducted with infra- 
human organisms as subjects. This has been 
the case for at least two reasons: (1) some psy- 
chologists are interested mainly in comparative 
psychology and the study of infrahuman be- 
havior; (2) and most important, infrahuman 
experimental situations permit much more 
careful control of the broad scope of variables 
which must be taken into consideration in these 
kinds of studies. "The aim of research in the 
second case is often the attempt to uncover more 
clearly some of the basic factors in the environ- 
mental control of behavior changes irrespective 
of species. In the present context this is our 
major interest, although the study of compara- 
tive social psychology (Smith and Ross, 1952) is 
in itself an area which probably will increase 
in importance as social psychologists begin to 
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take into consideration more seriously the need 
to control genetic factors in their experiments 
(see Tinbergen, 1951) . 

It is also important to note that a great many 
of the above relationships have been found in 
other kinds of learning situations than those 
of classical conditioning, and with human be- 
ings as well as lower organisms. Skinner (1938), 
for example, has been able to obtain highly 
repeatable and clear-cut data in all of these 
areas by using an instrumental conditioning 
situation similar to Thorndike’s problem box. 


THREE THEORIES OF ACQUISITION AND EXTINCTION 


Let us quickly outline the major characteris- 
tics of the three types of theory concerned with 
what is learned and how. We shall also try to 
show how these theories deal with the categories 
as named above in our analysis of Johnny’s 
sequence of behavior. 

“S-R contiguity” theory. The most outspoken 
exponent of this type of theory is Guthrie 
(1952). Guthrie talks in the language of re- 
sponses and of the contiguity between them and 
stimuli (internal and external). However, he 
would suggest that we watch the development 
of Johnny’s sequence of acts much more pre- 
cisely than we have as yet done. In fact, hardly 
anyone has ever watched an organism suffi- 
ciently carefully to give referents to Guthrie’s 
more “molecular” terms. 

Guthrie would suggest that the "acts" of 
Johnny developed out of tiny, more or less 
discrete movements, which become attached to 
stimuli (say the internal physiological stimuli 
of Johnny's drive state). These stimuli evoke 
and maintain the sequence of movements by 
laws of strict contiguity. The notion of coh- 
tiguity has been restricted by Guthrie to the 
meaning, "temporal simultaneity." The simple 
and sovereign principle of learning here, then, 
is that if a stimulus has occurred in contiguity 
with a response, the response will always so 
occur, unless some other response has become 
conditioned to the same stimulus. Getting the 
ice cream and eating it is merely a "terminating 
response" which affects some stimuli (say, the 
drive state) so that new and interfering move- 
ments cannot become conditioned to the stimuli 
prior in time to such ice cream eating. 

“Extinction,” or disappearance of the re- 
sponse, is a matter of interference, where the 
necessary abrupt change in stimuli or responses 
does not occur (for example, Johnny keeps on 
desiring ice cream after having been out at 
the truck) and new and interfering responses 
become attached to the stimuli. Johnny, for 


example, might leave the truck and sit down on 
the porch and mope. "Moping" responses might 
become conditioned to the internal stimuli rep- 
resenting the desire for ice cream. 

Guthrie really suggests four specific ways in 
which extinction can be accomplished; (1) the 
conditioned stimulus may be acting, but since 
it is below threshold the response will not oc- 
cur; (2) the response may be eliminated 
through exhaustion or fatigue; (3) the response 
may be inhibited by the action of incompatible 
responses. These three ways "involve the pres- 
ence of the cue and the prevention of the re- 
sponse" (1952, p. 59) , and the reader can readily 
apply them to the situation with Johnny. (4) 
The fourth method Guthrie describes is a case 
where "'the substitute — conditioned — stimu- 
Ius is applied so soon after its response that the 
response is still in what is called its refractory 
period." In principle, then, if we rang the ice- 
cream bell directly after Johnny salivated (as- 
suming “salivating” is the response conditioned 
to the bell), then some new response would be- 
come conditioned to the bell, and if salivation 
has been occurring early enough in the se- 
quence, some new sequence of behavior will 
develop. Of course, salivating may be only ap- 
parently conditioned to the bell when in 
actuality some other response (to which salivat- 
ing is attached in a chain) may be the key to 
the sequence. In this case the sequence might 
continue as before unless we diagnosed cor- 
rectly which response is the "key" one to the 
sequence which costs the mother a nickel, and 
then applied this rule to the refractory period 
of this key response. 

“S-R reinforcement” theory. The most elabo- 
rate and formally systematic theory of learning 
is that of Hull (1943), which emphasizes the 
occurrence of drive reduction as a prime mover 
of learning. Hull's formulations represent an 
attempt to integrate the Pavlovian classical con- 
ditioning tradition with the reinforcement or 
law of effect tradition stemming from certain 
British associationistic philosophers, and more 
recently from Thorndike (1933). Thorndike 
was interested in the temporal relationship of 
the instrumental response (like Johnny's act of 
the receipt of the reward (or punishment) . The 
handing the nickel to the ice-cream man) and 
strength of the learning bond between a stimu- 
lus and a response was found to be some func- 
tion, other things being experimentally equal, 
of the temporal (or spatial) discrepancy be- 
tween the response and the reduction of some 
aspect of the drive state. 

Hull also considered theoretically the im- 
portance of the temporal contiguity of the con- 
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ditioned and unconditioned stimuli, and 
anchored to the reinforcement context the ef- 
fects usually ascribed to such asymmetry of 
stimuli. He did this by broadening Pavlov's 
notion of the physiological trace left by the CS, 
in order to tie such a trace up with the occur- 
rence of the "reinforcing state of affairs." Such 
traces decay with time and are less strongly 
conditioned to the instrumental response the 
more it is separated in time from reinforcement. 

In short, Pavlovian conditioning phenomena, 
as well as simple problem-solving phenomena, 
become tied to the same basic event — the re- 
duction of a drive or strong impelling stimulus 
— which Hull calls a reinforcement. Extinction 
is explained by Hull in terms of a dual process. 
If Johnny indulges in his sequence and it is 
not reinforced, inhibition will develop. One 

‘kind of inhibition is close to the common-sense 
notion of fatigue which develops just because 
Johnny does the act, and the effects of which 
dissipate fairly quickly with time; the second 
kind of inhibition is permanent but develops 
more slowly. Any "resting response" which re- 
duces the fatigue-type inhibition (another form 
of reinforcement) leads to the increase in proba- 
bility of the resting response becoming attached 
to the cues of the ice-cream bell, so that Johnny 
dawdles or sulks, instead of running to mother 
for the nickel for the ice cream. 

A separate notion that Hull has taken from 
Pavlov, and has altered, has to do with "secon- 
dary reinforcement," where a previously neutral 
stimulus (like the nickel in our example) comes 
to take on the capacity to reinforce responses 
to other stimuli. The nickel thereby becomes a 
reward to some degree and comes to function 
as such — Johnny would learn new sequences 
of responses in order to get nickels (after nickels 
have taken on secondary reinforcing proper- 
ties). 

After analysis of the general experimental 
literature in learning, Hull attempted to set up 
a “law of reinforcement” which would state the 
necessary and sufficient conditions for learning 
to occur. To do this he found it necessary, of 
course, to specify, in the light of available evi- 
dence, his best guesses as to what the other 
major factors are that one must relate to the 
basic law in order to increase its utility to ex- 
perimentation and to generalizing. Since most 
of his guesses were based on certain defining 
evidence present when, in 1943, he first proposed 


this major theoretical system, he found it felicis 


tous to change the system in some respects in 
1950. In light of the fact that a good percentage 
of basic research in learning was focused on 
testing in various ways his early formulation, 


he managed, until his premature death in 1952, 
to achieve a great degree of cooperation from 
other people in his theoretical endeavor, and 
to receive free (and often rather strong) sug- 
gestions from others. We shall discuss Hull's 
system more thoroughly after a discussion of 
our third type of theorist. 

Dual theories of acquisition. It may be of 
value to discuss in the context of Johnny's be- 
havior another notion of what is involved in 
the process of acquisition of behavior sequences. 
Several modern S-R theorists have come to the 
conclusion that two principles are necessary to 
handle the experimental literature in the fields 
of conditioning and problem solving. These 
dual theorists include B. F. Skinner (1938), 
Harold Schlosberg (1937), O. H. Mowrer 
(1950), and Eleanor J. Gibson (1952). These 
theorists accept both the principle of contiguity 
stimuli and some principle of reinforcement 
(that is, success, but not always drive reduc- 
tion) as necessary in formulating an adequate 
theory of learning. 

Skinner's (1938) famous distinction is be- 
tween “type R” conditioned responses and “type 
S" conditioned responses. In type R condition- 
ing, which depends upon the principle of rein- 
forcement, the early occurrences of the 
conditioned (instrumental) response are not 
under the control of the experimenter. It is 
as if the experimenter were the ice-cream man, 
who cannot force Johnny to bring him a nickel 
for ice cream but must wait until a stimulus 
brings the first act out. Of course, once the act 
is out, the ice-cream man has a modicum of 
control by virtue of the degree to which he has 
reinforced Johnny's behavior. In type S (classi- 
cal) conditioning, the experimenter can control 
the occurrence of the to-be-conditioned re- 
sponse, just as Pavlov could bring about the 
dog's salivation by introduction of food powder 
in the dog's mouth. According to social conven- 
tion, Johnnys mother, if not the ice-cream 
man, might be permitted to bring about salivat- 
ing responses in Johnny by presenting him with 
his first taste of ice cream along with his other 
food at meal time, whether Johnny wished it 
or not. 

Skinner suggests that conditioning of type R 
depends on the occurrence of a reinforcing 
stimulus (not necessarily drive reducing) fol- 
lowing the response. Type S conditioning de- 
pends on the temporal relationships the 
experimenter (or mother) sets up between the 
CS and the US. In most complex behavior se- 
quences, like Johnny's, both kinds of condition- 
ing occur. The probability of the salivary 
response (type S conditioning) might be deter- 
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mined by the temporal contiguity of the bell 
and the receipt of ice cream. The other (type 
R) response chain (running and asking mother 
for a nickel, etc.) might depend essentially on 
receipt of ice cream after the responses have 
occurred. Skinner emphasizes the fact that in 
type R the experimenter often has no notion 
as to what stimuli evoked the first response. 
That is, the experimenter is in the position of 
the ice-cream man who sees Johnny come run- 
ning out for the first time and doesn’t know 
who was the source of the nickel, or whether 
Johnny had received his mother’s permission or 
not. This is, in a sense, the problem of the 
Rorschach tester who gets a great many re- 
sponses from the subject but has little notion 
as to what evoked them except that the ink blot 
and the results of the individual’s past experi- 
ences have something to do with it. Skinner 
has also suggested that type R responses are 
largely skeletal, whereas type S responses involve 
those types of effectors that are controlled by the 
autonomic nervous system. 

Mowrer’s (1947) conception is close to that 
of Skinner, as is that of Schlosberg. Mowrer has 
attempted to show that anxiety can be profitably 
viewed as a secondary drive and anxiety reduc- 
tion as a reinforcement. Learning to be "an: 
xious” occurs according to laws of stimulus 
contiguity, whereas there must be reinforcement 
(drive reduction) for a skeletal type of response 
to be learned to the anxiety stimulus. 

There is no data yet which can crucially de- 
cide the issue as to whether an explicit monistic 
theory of either the Hull or the Guthrie type is 
more or less adequate than these dual theories. 
White and Schlosberg (1952) have reported 
evidence which they feel is opposed to either the 
Mowrerian or the Hullian interpretation, but 
more evidence of this kind will be necessary for 
these issues to be clearly decided. 

We now turn to a more systematic presenta- 
tion of the theories of Hull, Mowrer, and Guth- 
rie, 


HULL'S THEORETICAL SYSTEM 

Leeper, who is critical of Hull's theory, has 
summarized Hull's importance as due to the fact 
that “more than any other psychologist, he has 
emphasized that the goal of psychology is the 
discovery of quantitative relations among basic 
variables" (Leeper, 1952, p. 490). Leeper also 
agrees with Hilgard's (1948) statement to the 
effect that work in the field of learning will 
need ultimately to follow the form which has 
been suggested by Hull's work — we must look 
to the development of abstract, quantitative 
propositions stated to a considerable degree in 
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terms of inferred factors (Leeper, 1952). Al- 
though, as we shall see, new methods of theoriz- 
ing are already developing in learning theory 
(Mosteller and Bush, 1951) , undoubtedly Hull’s 
preliminary formulations have greatly speeded 
up theoretical development in the field — some 
would say (Skinner, 1950) that they have 
speeded it up too much. 

There are three recurrent themes in Hull's 
thinking: (1) Vulnerability of one's theory is 
vital to scientific progress (that is, the propo- 
sitions of the theory can be tested and proved 
wrong), and explicit mathematical formula- 
tion, even when inductive procedures are not 
elegant, is one means of assuring vulnerability. 
(2) The hypothetico-deductive method, where 
the hypothetical relationships are based on the 
"best" induction possible, and where rigorous 
deductions from the set of variables can be 
tested, is a defensible procedure and may help 
to advance the integration of the behavior sci- 
ences, (3) Any theory should be expected to 
change as crucial experiments are carried out, 
and as the theory is generalized to new areas 
of data. Hull hoped, however, that his theory, 
which attempts to integrate the experimental 
findings in the areas of conditioning and prob- 
lem solving, would provide at least a first ap- 
proximation to the variables needed in the 
analysis of more complex phenomena. 

An example of Hullian analysis. We shall 
use Hull’s (1943) original model, in the main, 
in this exposition, since it is this statement of 
Hull's position which has given rise to a whole 
movement in experiment and theorizing. Hull 
(1950) has recently outlined certain changes in 
this 1943 theory, but the present exposition will 
provide an adequate introduction to this out- 
line. Hull's death has left future development 
of his theory in the hands of others, and further 
alterations are to be expected from his original 
statement of theory. In this section we shall 
present the Hullian position as it emerges from 
a recent attempt by Brown and Farber (1951) to 
formally add "emotion" as a new intervening 
variable in the system. "Their presentation has 
two main virtues — it is compact, and it shows 
clearly the susceptibility of the theory to change, 
where desirable. 

First, let us look at Brown and Farber's rea- 
sons for adding another construct. After con- 
sidering several sources of evidence and thought 
about "emotion," and considering Mowrer's 
theorizing of a decade ago (see Mowrer, 1950) 
and Leepers (1948) more recent suggestion 
that emotion be analyzed as a motive rather 
than as a disruptive state, they attempt to inte- 
grate these notions around the consideration of 
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“emotion” as an intervening variable. The 
notions involved in their induction procedure 
stem from the work of Mowrer (1950) and Mil- 
ler (1951b) on fear, the thinking involved in 
the Yale group’s frustration-aggression hypoth- 
eses (Dollard et al., 1930), Maier’s (1949) analy- 
sis of frustration and "abnormal fixations” in 
rats, and from the work of French (1944) on 
frustration from the Lewinian point of view. 
‘These various sources led them to some ideas 
on how the emotion of "frustration" (or 
"anger") can be placed within Hull's system, 
assuming as they do that: 


A theory of emotion, stripped of its nonessen- 
tials, is simply a series of guesses as to the manner 
in which emotions are presumed to result from 
environmental events (past and present, internal 
and external), guesses as to how they may re- 
semble, differ from, and interact with other 
hypothetical states or processes within the organ- 
ism, and guesses as to how different emotions, or 
different degrees of the same emotion may affect 
behavior differentially. It is furthermore stipu- 
lated that these various guessed-at relationships 
be stated specifically enough to assure their sus- 
ceptibility to experimental test. (Brown and 
Farber, 1951, p. 468) 


Let us use the "whole-teaching" method of 
explication and ask the reader to look at Fig. 1, 
which will provide us with the results of Brown 
and Farber's thinking as well as much of Hull's 
theorizing. Let us first note that the analysis 
moves from left to right across the page. Hu- 
morously speaking, this may be the reason why 
Hebb (1951) and Bronfenbrenner (1953) refer 
to rigorous theorists as the "rightists" in con- 
temporary psychology. 

In the left column are the statements of the 
"stimulus variables" of the system, ie, the 
antecedent manipulable conditions, which, in 
conjunction with the intervening variables 
(middle column), determine the “observable 
response characteristics.” Such antecedent-to- 
response analyses are generally characteristic of 
S-R theory. Skinner (1945) has suggested that 
this be considered the hallmark of “behavioris- 
tic’ analysis. We shall discuss later some of 
the other characteristics of SR theory. 

Before focusing on the star of this theoretical 
performance (i.e., the “F” in the middle of the 
diagram) let us look at Hull’s intervening vari- 
ables that are present in the analysis. First, 
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Fic. l. Diagram summarizing the antecedent manipulable conditions and the hypothetical 
variables employed in the formulation of the present theory of frustration. The symbols for the 
hypothetical variables are defined as follows: sn, = habit strength of an ongoing response 
tendency; D = relevant drive; sin, = habit strength of competing (weaker) response tendency; 
Ig = competitive inhibitory tendency; gEg, = initial excitatory strength of ongoing response 
tendency; Ep, = initial excitatory strength of competing tendency; F = frustration state; D = 
irrelevant drive due to frustration; D = effective drive strength resulting from combination of D 


and 


; Sp = frustration generated stimuli; ¢,H_ = acquired habits associated with frustra- 


tion; aen = innate response tendencies associated with frustration; sEm, = final excitatory 
strength of initially dominant response tendency; sEn, = final excitatory strength of initially com- 
peting response tendency; s, Ey = strength of excitatory tendency (either innate or learned) asso- 
ciated with frustration. (From Brown and Farber, 1951) 


STIMULUS-RESPONSE CONTIGUITY AND REINFORCEMENT THEORY 67 


note that this diagram is, like a Lewinian dia- 
gram, a “picture” in symbols of a very short 
molar sequence of behavior. That is, it is an 
attempt to capture a quantifiable picture of an 
organism in an act of being “frustrated,” which 
involves being "in conflict." If one considers 
the diagram as including only the intervening 
variables and observable behavior characteris- 
tics, then the diagram is analogous to Lewin's 
"momentary field." Further speaking in Lewin's 
(1935) terms, these two columns set up the fuller 
specification of the relatively empty formula 
B = (p, e). 

One of the important differences between this 
kind of diagram and that of a field theorist is 
that the S-R theorist attempts to go further in 
setting up the antecedent manipulable condi- 
tions which elicit the particular quantified 
pattern of intervening variables and of observ- 
able response characteristics which define “the 
present field." This statement of "stimulus- 
intervening variable" laws is contingent, of 
course, upon the limitations of learning itself. 
That is, certain factors which are responsible 
for behavior change (e.g., fatigue processes and 
innate processes) are not fully “manipulable,” 
and therefore must be “built into” the set of in- 
tervening variables. Theoretical models differ 
in the degree to which they rest upon such 
built-in factors as ''prágnanz," "libido," etc., 
and in doing so assume that behavior is to 
differing degrees "docile" to environmental 
events. 

We have assumed that Johnny learned the 
particular response sequence that we analyzed 
above. We do not mean that the process of 
learning substantively created his walking, talk- 
ing, and salivating. We mean merely that as a 
result of the operation of certain antecedent 
conditions, Johnny came to behave in the way 
we described, and that as a result, in part, of 
certain currently operating conditions Johnny 
will either continue to go after ice cream every 
afternoon that the bell rings, or he will not. 
That is, he will “learn” (i.e., be "reorganized") 
as a result of what is happening now, so that 
the particular pattern of intervening variables 
will be a little bit altered as a result of his 
“experience” on this particular afternoon. 
Thus, dependent on the currently active en- 
vironmental conditions of tomorrow, his be- 
havior will be different tomorrow. "Learning" 
is happening to Johnny whenever the environ- 
mental conditions active on him while he is 
performing meet the requirements for learning 
to occur. The problem of learning theory is 
simply to specify the conditions under which 
changes in structures occur. 


The major structural or “associationistic” 
units in Fig. 1 are the intervening variables, 
sHg, and so, The factors responsible for the 


particular strength of these at the present mo- 
ment are given in the list of antecedent con- 
ditions: number of previous reinforcements, 
amount of reward present in those previous 
reinforcements, and the delay in the reinforce- 
ment (i.e., the delay between the time of arrival 
in the previous situation (S) and the reward, or 
the delay between the occurrence of the response 
sequence and the reward in past commerce with 
this particular situation). 

Let us return to our example at this point and 
view Fig. 1 as representing Johnny's state when 
the ice-cream bell rang. The reader should re- 
call that Johnny was busily engaged in reaching 
for the cookie box at the time that the sounds 
of the ice-cream truck intruded themselves into 
the situation. This tendency to go after cookie 
boxes has a certain “habit strength” (sHr) even 
when the ice-cream bell intrudes on the situa- 
tion. Let us further suppose that this is the first 
time that Johnny has ever gotten around to 
dealing in this way with the cookie box. The 
habit strength is a result of Johnny's having 
symbolically recalled the time when he had 
pleased himself and amused his parents by 
climbing up on a kitchen chair to lick the re- 
mainder from a bowl in which his mother had 
been mixing cake frosting. The strength of this 
"associative" factor in Johnny's present be- 
havior is referred to by Ha, which Brown and 


Farber describe as “habit strength of competing 
(weaker) response tendency" The habit 
strength of the major response tendency (get- 
ting ice cream) is referred to by the rubric sHg,. 


This general intervening variable, Hz, is one 
of the major concepts in Hull's theory and, it 
is important to note, the concept is defined not 
only on the response side (that is, objective 
response characteristics to be expected with 
changes in Hg) but also on the antecedent 
side (that is, in terms of the antecedent condi- 
tions, a knowledge of which permits us to pre- 
dict the strength of sHg). Our example should 
make it clear to the reader that the concept sn 
is a very general one which could subsume as 
special cases a large number of specific classes or 
instances of habits, such as getting cookie boxes 
or going after ice cream. 

Let us now look at the symbol D in the first 
column of intervening variables. Brown and 


"Farber describe this as the strength of the rele- 


vant drive. It should be noted that the brackets 
in the diagram suggest that there is a certain 
strength of this relevant drive for each of the 
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response tendencies. Such a concept forces us 
to be explicit with regard to the concept of 
motivation. In our present example, we can 
tentatively name the drive involved both with 
sHg, and with sHg, as hunger. The strengths 


of the drive components involved in both re- 
sponses are a function of the antecedent con- 
ditions of degree of deprivation — in one case 
“ice-cream deprivation,’ and in the other 
“cookie deprivation." 

At the top of the diagram the reader will note 
that there is an intervening variable entitled Ig 
which involves what is, in effect, an additional 
competing factor which decreases the strength 
of what we have called the response tendency 
“to get ice cream.” The strength of this “in- 
hibitory” factor is a function of the antecedent 
conditions of the degree of blocking which the 
response tendency has very recently encoun- 
tered, the degree to which Johnny has tried “to 
get ice cream” in this situation but failed (the 
degree to which nonreinforcement is operat- 
ing) and, finally, the degree to which Johnny 
has done work in the present situation in his 
search for ice cream. 

Having stated these major concepts in Hull's 
system, let us now look at how they are interre- 
lated. Hull (1943) has suggested that ;H, and 
D are related in a multiplicative way to produce 
sEg. For example, in Fig. 1, sn, is a resultant 


of sn, X D. Furthermore, this resultant 
gEn, can be viewed as being in competition with 
either the Ij operative or the sn, (or both) 


operative at a given time, Such competition 
between "inhibition" and "initial excitatory 
strength of a tendency" or between two "initial 
excitatory strengths of two competing tenden- 
cies" results in the arousal of the intervening 
variable frustration (F in the diagram). The 
two arrows which lead away from F refer to the 
two major effects which F has in the on-going 
situation. The upper one indicates that frustra- 
tion (F) produces a certain degree of irrelevant 


drive (D) which combines with the relevant 


drive (D) to produce the effective drive (D). 
The second major effect of F is shown by the 
lower arrow, This refers to the fact that frustra- 
tion tends to generate certain specific frustration 
stimuli (Sy). These stimuli, Brown and Farber 
assume, may have innately associated response 
tendencies (sUr), or such specific frustration 


stimuli may have acquired new associative con- 
nections through past processes of learning 
(s, Hg). The last column of intervening vari- 


ables suggests how certain "final excitatory 


strengths" of the various response tendencies 
may be summated (algebraically) in order to 
produce one or more observable responses, the 
nature of which will depend upon the relative 
strengths of these several possible excitatory 
tendencies. 

In our protocol of Johnny's behavior we re- 
ported that in his conflictful state, "Johnny 
hesitates for a moment and then says ‘Oh, darn!’ 
He gets down quickly and runs immediately 
into the living room . . ." The remark “Oh, 
darn!" (the first response tendency in Brown 
and Farber’s theoretical formulation) can be 
interpreted as a very strongly learned response 
to certain specific frustration stimuli, learned 
by way of some identification mechanism (see 
below). Once this strongly learned and specific 
response had been emitted, then the stronger 
of the remaining response tendencies — getting 
ice cream — took over, and Johnny began the 
sequence of behavior we have recorded above. 
It is important to note that Brown and Farber's 
theory includes the important prediction that 
whichever responses occur in the "conflict situa- 
tion" will occur with greater intensity because 
the frustration has increased the potential driv- 
ing factor in the situation. Johnny "gets down 
quickly and runs immediately . . .” Hull has 
previously stated that the effect of an increase 
in drive on later behavior can be found in any 
of the following ways: (1) an increase in ampli- 
tude of the response, particularly if the response 
is an "emotional" one; (2) a decrease in the 
latency of the response; (3) an increase in the 
probability of the occurrence of the response; 
or (4) temporary increase in resistance to ex- 
tinction, Such changes are also used to infer 
the effect of an increase in lx (other things 
constant) , or an effect through any other factor 
which increases the quantity of the intervening 
variable bn, since gE, is the factor which is 
closest to the observable behavior. 

Let us review at this point what we have done 
in explicating Hull's and Brown and Farber's 
theorizing. First, we have presented a cate- 
gorization of certain observable aspects of 
Johnny's behavior in common-sense terms. We 
then translated. these observable aspects into 
the categories utilized for descriptive purposes 
by various learning experimenters. Then we 
analyzed the theories of the processes of acqui- 
sition and extinction that have been developed 
by the S-R reinforcement theorists and by Guth- 
rie's SR contiguity theory. We then outlined 
the empirical facts involved in certain other 
areas of the process of behavior change, and fi- 
nally we have presented an analysis of Johnny's 
behavior in terms of Hullian theory. 
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In doing this latter task we have presented 
only the major concepts which Hull's system 
uses, and these have been largely in the form 
in which he defined them in his important 1943 
work, There are other intervening variables 
which Hull uses and which define his total 
system. Further, we have only cursorily dealt 
with the relationships Hull has postulated as 
obtaining among his intervening variables, and 
we have neglected the detailed mathematical 
statement of the relationships which hold, ac- 
cording to theory, between the “antecedent- 
condition variables’ and the “intervening 
variables,” and between the intervening var- 
iables and the consequent behavior variables. 
The reader is referred to other sources for most 
of these more precise statements (see Hull, 
1943, 1951). 

The nature of intervening variables. It is 
important at this point to raise the following 
question: What are these intervening variables? 
Do they exist anywhere, or are they just vagaries 
which Hull has foisted upon us? Two answers 
can be made to this question. Spence (1951), 
for example, and others have interpreted Hull’s 
theory as a molar one, and they consider these 
intervening variables as "existing" only in the 
sense that the unseen atom exists, i.e., they are 
defined (operationally) in terms of antecedent 
"inputs" (observable, and for experimenters, 
manipulable) and also in terms of observable, 
measurable consequent “outputs.” In fact, 
Hull's books are filled with references to such 
defining data. It is because of this fact that we 
have presumed to call Hull's theory a “theoreti- 
cal system," whereas for contrast consider the 
"General Theory of Action" of Parsons and 
Shils, which has been termed a "categorical" 
system, wherein some educated guesses as to 
the important categories necessary for the analy- 
sis of action, and only such guesses as to cate- 
gories, have been worked out. Although some 
readers might choose to place the intervening 
variables in some “phenomenal space,” and in 
so doing might generate hunches for further 
study, this is not the intent of Hull. His intent 
is to leave the intervening variables in a mathe- 
matical space (as a factor analyst might do), 
and obtain deductions for test by way of mathe- 
matical manipulation in a "mathematical 
phenomenal space." A good percentage of psy- 
chological experimentation in the past few 
years has been devoted to such an attempt (see 
Spence, 1952b) . 

The second answer to our question about the 
nature of intervening variables has been sug- 
gested at times by Hull (1943). He hopes that 
these intervening variables may eventually be 


related to physiological processes which will be 
independently defined by physiological meas- 
urements. In fact, Hull has added “meaning” 
to such concepts as gH, by trying to make a 
place for such definitions. In light of the papers 
by Klein and Krech (1951) and Hebb (1951), 
such matters may become increasingly impor- 
tant to social and personality psychologists, but 
for the present discussion we can consider them 
irrelevant. In this second answer to the ques- 
tion, we have dealt with some of the reasons why 
some people have called Hull's theory such 
names as "atomistic" because they prefer a dif- 
ferent kind of physiology from that which he 
appears to espouse, or "'reductionistic" because 
they are opposed to or uninterested in relating 
"psychological" intervening variables to the 
soma. According to Spence's interpretation, 
given above, such criticisms may be considered 
irrelevant. 

Let us return for a final look at Hull’s general 
theory and note an important distinction which 
Sears (1951) has emphasized. There are two 
foci which can be used when analyzing behavior 
in this SR manner. In short, one can be in- 
terested in the changes of structure which occur 
Oe, learning) , or one can be interested in the 
on-going action itself Ge, performance) and 
its consequences for others. We might be in- 
terested in what the experiences do to Johnny 
(learning), or we may be interested in the fact 
that he did not take the cookie box, or that 
children will do all kinds of things to get ice- 
cream cones. The only important moral here 
is, as we have stated elsewhere, that both these 
kinds of analysis are important, and SR theory 
is increasingly grappling with both. F. H. All- 
port's (1940) "event system" theory can perhaps 
be viewed as an extension of S-R analysis into 
those aspects of performance which have hither- 
to been left largely in the hands of other social 
scientists. 

The openness of Hull's system. One final 
point regarding S-R theory deserves statement 
here. Note in Fig. 1 the degree to which S-R. 
theory is open to integration with other social 
sciences. “Social situations" can be viewed as 
the language area in which the learning and 
performance aspects of Johnny's action are de- 
veloped and controlled, and therefore integra- 
tion with concepts of “market” and “social set- 
ting" is very possible. The problems involved 
in such integration are great, however, since our 
seciological friends who, as Doob (1952) sug- 
gests, study the "stimuli" involved in everyday 
behavior, have their own ways of defining “so- 
cial situations." 

Integration of sociological and psychological 
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thinking may come by way either of their defi- 
nition of “sociological stimulus factors” in terms 
of some psychological theory such as Hull’s, or 
through redefinition of psychological theory in 
terms of some sociological theory, such as that 
of Homans (1950). The probable long-range 
alternative is the developing of a kind of meta- 
language which will permit a rough translation 
from one kind of theory to another. It may be, 
however, that other constructs in social science 
will take on the meaning of the terms of the 
most powerful theory extant at any given 
time. If so, the fact that one can defend Hull’s 
theory as being an approximation to a theoreti- 
cal system, whereas that of Parsons is confessed 
to be only a categorical system, may be an im- 
portant consideration for social psychologists. 

Hull's postulate system. The above discussion 
is only a rough summary of Hull’s system. We 
have left implicit much of the content of Hull's 
postulates and corollaries. Yet it is to this 
exact statement of the system that the critic 
and/or generalizer must turn. Hull's (1950) 
new postulate set represents a consolidation and 
rearrangement of the earlier (1943) system. The 
most important change is a de-emphasis of the 
notion son (or habit strength) and greater 
emphasis on gE, (or excitatory potential) , 
which is now the most important notion in the 
system. 

Let us mention just two matters with regard 
to the postulate system. The first is that Hull's 
postulates include explicit recognition of almost 
all of the “additional processes of learning” 
which we have outlined above. Hull accepts all 
of these as, so to speak, important basic notions 
necessary to a predictive theory of learning. 
Guthrie, on the other hand, tends to consider 
these as phenomena which are to be explained 
by what we have termed his “principle of acqui- 
sition.” The reader desirous of getting a rough 
notion of Hull’s postulates can find it in our 
discussion of the additional empirical functions 
discussed above. The notion of Rg, or “frac- 
tional anticipatory goal response,” is not sys- 
tematically dealt with in either of Hull's 
statements of his system, but Spence (1951) has 
displayed the way in which it would be viewed 
as functioning in the system. 

The second matter regarding the postulates 
which is germane to the present discussion is 
the contents of Hull's “Individual Differences" 
Postulate — Postulate XII. This postulate states 
something about how Hull would generalize his 
own theory. He suggests that although the gen- 
eral form of the relationships that he has plotted 
should remain the same when generalized, the 
constants in the equations would be expected 


to vary according to the species of the organism 
studied, the individual involved, and also with 
certain states of the individual so involved. It 
might be worth while to add to this list some 
of the entities of social psychology, ie., the 
constants should vary with whatever “modal 
person" you are studying, or whatever “cul- 
ture" or subculture the modal person comes 
from. Atany rate, Hull's "vulnerability," which 
he greatly valued, exists with regard to the gen- 
eral form of the relationships, not the details. 
It is in this spirit that we have discussed the 
theory. 

The problem of learnable drives. In the con- 
text of Brown and Farber's thinking we find it 
necessary, according to the requirements of the 
theoretical systems, to name the two relevant 
drives active in Johnny at the beginning ‘of his 
action sequence. The problem of how to name 
such drives in the most useful way has been a 
recurrent one in social psychological thinking. 
Murray (1938) has provided us with some fruit- 
ful guesses on this score, and McClelland (1951) 
has extended some of Murray's thinking to an 
R-R operational level. The basic problem, how- 
ever, of a full S-R analysis of such concepts as 
needs or motives is still in the developmental 
stage. Sears (1951) has pointed out that one 
of the fallacies in past motivational analysis has 
been the assumption of static traits or drives, 
whereas the problem probably should be stated 
in the context of changing structures, therefore 
in terms of a theory of learning. 

Two present lines of thought and research in 
S-R theory may eventually help solve some of 
these problems. Within the framework of Hull's 
postulate III (1950), Miller (1951) has dis- 
cussed how “fear” can be viewed as a learnable 
drive. Miller views fear as a response which 
is probably innately conditioned to painful or 
noxious stimulation. Insofar as it functions as 
a response, then it should follow the laws of 
response, one of which is that responses are con- 
ditionable to previously neutral stimuli. Miller 
also views fear as a response which produces 
strong stimulation, and therefore it has im- 
pelling or drive properties. Further, Miller 
views fear as a “cue,” that is, as a provider of 
information which can also function as a gen- 
eralizing or mediating stimulus. He has pro- 
duced an impressive array of evidence to this 
effect, and this example of a learnable drive 
was found to be quite useful in Dollard and 
Miller's (1950) analysis of psychotherapeutic 
procedures from an SR point of view. "Fear" 
becomes established as a response to previously 
neutral stimulation only under conditions 
where the fear is reduced as a result of some 


STIMULUS-RESPONSE CONTIGUITY AND REINFORCEMENT THEORY 71 


trial and error or instrumental activity of the 
organism in which it has been aroused. Such 
an extension of the principle of drive reduction 
has been criticized by Mowrer (1947 and 1950) , 
and much research is currently being done in 
this area to clarify the factual and conceptual 
problems. 

"There are obviously other classes of learnable 
drives besides fear. Miller (1951) himself has 
provided a careful summary of all the empirical 
data with regard to motives which develop as 
the result of the operation of secondary rein- 
forcements of other kinds than merely fear 
reduction. Probably no aspect of Hull's be- 
havior theory needs greater clarification than 
the area of "secondary phenomena." In the 
study of motives which develop as a result of 
positive reinforcements from the social or intra- 
personal environment, the present tendency is 
to infer the strength of the motive from the 
frequency or intensity of instrumental activities, 
which are classified together by reason of the 
fact that they have a similar effect on the en- 
vironment. As an example of this, let us 
imagine our friend Johnny in a nursery play- 
room playing with dolls. We shall use the doll 
play situation as a “projective” test and will 
categorize as "aggressive" all the things which 
Johnny has the dolls do which have a destruc- 
tive effect on the other dolls or the nursery 
schoolroom. We can use some index of the 
frequency of such acts as a definition of the 
strength of Johnny's “aggressive motive" in this 
situation. One of the theoretical problems will 
be to relate Johnny's behavior in this situation 
with the relative frequency of his aggressive acts 
in other situations (R-R analysis), but the ma- 
jor purpose of S-R analysis will be to ferret out 
the antecedent conditions that led to the current 
strength of aggression that Johnny displays in 
the present field. This will necessitate con- 
sideration of other emotional and "cognitive" 
factors which are interacting with drive-aggres- 
sion in Johnny's personality structure. Several 
complex social motives are being approached in 
this fashion — dependency, guilt, affiliation, and 
achievement have been dealt with either on the 
response side or in terms of a full S-R analysis 
(Sears, 1951; Whiting and Child, 1953; McClel- 
land, 1951). 


MOWRER's DUAL THEORY 


A brief discussion of O. H. Mowrer's recent 
theory (1947) may provide us with closure 
for our section on Hull's theoretical system, 
since his dual theory (like that of Schlosberg, 
1937, Skinner, 1938, and others) is intimately 


related to S-R reinforcement theory. Until 
1947, Mowrer was one of the most active ex- 
ponents of a monistic theory of the drive reduc- 
tion variety. Since then, however, he has 
provided a clear and consistent exposition of a 
dual theory which can be succinctly described 
as an S-S and S-R theory. The theory has not 
been worked out in a formal fashion such as 
Hull's, but such formal expansion is implicit 
in his writings. 

Two approaches will assist our understanding 
of Mowrer's thinking. First, we may note that 
Mowrer places great weight on the fact that in 
the Pavlovian and Thorndikian experiments 
two different classes of response are measured. 
If Johnny were a subject in.the Pavlovian situa- 
tion, we would be interested in the measurement 
of his salivary response (controlled by unstriped 
musculature). In the Thorndikian situation we 
would have our instruments prepared to record 
Johnny’s walking, talking, and chewing be- 
havior. These differences, and certain relation- 
ships that obtain regarding these kinds of 
response, lead Mowrer to make a fundamental 
distinction between emotional learning (Pav- 
lovian conditioning of unstriped muscle re- 
sponses, mediated by the automatic nervous 
system) and problem solving (Thorndikian 
learning of striped muscle responses mediated 
by the central nervous system). 

Second, we must recall that Mowrer (1939) 
had earlier proposed that the emotion of "fear" 
be used as an intervening or mediating variable, 
the reduction of which would be a form of rein- 
forcement. This concept provides an interpre- 
tation of many experimental facts (see Miller, 
1950) , and it has led to many experiments (see 
Mowrer, 1950). 'The problem that worried 
Mowrer, however, was this: What are the laws 
which will best predict the strength of learning 
of a "fear"? In the avoidance conditioning 
situation a conditional stimulus (e.g. a bell) is 
followed in time by a shock (US). Eventually 
the animal makes the motor response necessary 
to avoid the US, and this response occurs when 
the CS is sounded. The motor response leads 
to reduction of shock stimulation (and also, 
presumably, of *'fear"). A consistent monistic 
drive-reduction theory would say that this drive 
reduction (i.e., reduction of "pain" and “fear’’) 
also led to the learning of the fear response to 
the CS. 

It is here that Mowrer departs from drive- 
xeduction theory. Mowrer (1950) argues that 
it is against common sense, as well as some ex- 
perimental facts, to think that the fear is con- 
ditioned to the CS as a result of eventual reward.’ 
Emotional learning (or the learning of a “prob- 
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lem") is achieved neither by Guthrie's principle 
of SR contiguity, nor by Hull's principle of 
drive reduction. Rather, it is learned solely by 
the contiguity in time of two stimuli, the CS and 
the US, and the contiguity laws worked out by 
Pavlov and others apply to this learning. Mow- 
rer's thinking here is quite close to that of 
Tolman (1932), and we can use Tolman's com- 
mon-sense terminology to make the notion more 
clear: the CS comes to function as a sign of 
the US through the mediation of the fear re- 
sponse, which provides the "meaning" to the 
sign. 

bui let us not leave the animal with only a 
problem of feeling afraid. He also sometimes 
learns to do something about it — he learns 
to jump when the CS is sounded and to escape 
the shock. This learning, Mowrer feels, cannot 
be predicted by an S-S theory, but requires 
principles of the Hullian type, leaning on 
reduction of the strong stimuli aroused by the 
fear response. The relevant laws are those 
which relate a response to a drive reduction — 
those laws which deal with response selection 
or problem solving. In short, S-S laws will pre- 
dict the learning of our emotional problems, 
and S-R drive-reduction theory will predict the 
way in which solution of such problems will 
occur (when it does occur) . 

Another way of describing Mowrer's two 
kinds of learning is to call conditioning “cue- 
drive" learning, and problem solving "drive: 
response" learning. Mowrer has talked mainly 
of fear as the drive in "cue-drive" learning, so 
we must expand his present model to discuss 
Johnny's ice-cream seeking behavior. The prob- 
lem is how to name the ice-cream appetite which 
motivates the sequence of responses. Presum- 
ably, Mowrer would consider the bond between 
the salivation response and the ice-cream bell as 
an example of “cue-drive” learning, the strength 
of which could be controlled (and predicted) 
by arranging the temporal relationship between 
the bell (CS) and arrival of ice cream on the 
tongue (US). The important point here is that 
eating the ice cream, or reducing the drive, is 
not the important factor, but is merely the onset 
of the US. This gives the ice-cream bell mean- 
ing or, in Mowrer's terms, conditions the drive 
to the cue. The distinction between the “onset 
of the US" (i.e., the arrival of ice cream on the 
tongue) and a "drive-reducing state of affairs" 
is more clear when we think of the avoidance 
situation discussed above, where the "onset of 
the US" is the application of shock, and the 
“drive-reducing state of affairs" is the arrival of 
the animal in a nonshocking, nonfearful place. 

The overt responses of running, asking 


mother, and getting the nickel, as well as the 
behavior involved in paying the vendor, would 
all be learned according to the basic principles 
discussed by Hull. Mowrer does not, however, 
attempt to expand these principles to include 
“cue-drive” learning, as Hull does. 

Mowrer also differs from Hull in his use of 
the important concept of secondary reinforce- 
ment. Hull uses a concept of secondary rein- 
forcement which permits a previously neutral 
stimulus (like Johnny's nickel) to take on the 
capacity to trip off the reinforcing mechanism. 
That is, obtaining a nickel affects Johnny, for 
at least a time, as if he were eating ice cream (or 
reducing some other drive). Mowrer (1952), 
however, does not think that this occurs, but 
rather that secondary reinforcement should be 
given the meaning of “a reduction of a secon- 
dary drive." This requires, therefore, that when- 
ever a secondary reinforcement occurs, some 
secondary drive must be present in order to be 
reduced. Mowrer (1952) suggests that when- 
ever any drive is strongly aroused, fear is always 
a concomitant, that fear is a drive which is 
conditionable to external stimuli, and that it is 
also a stimulus which can subsequently be re- 
duced. When it is reduced, the neutral stimulus 
objects (such as nickels) can become condi- 
tioned signs of a fear reduction. Their subse- 
quent presence will thus maintain behavior in 
the way usually found in secondary reinforce- 
ment experiments. 

A crucial test between these two approaches 
to the vital question of secondary reinforce- 
ment is very difficult to achieve, but could be 
of great importance to behavior theory, since it 
impinges on the area of greatest interest to social 
psychologists. 

Mowrer has continued to emphasize fear as 
his major example of a secondary drive, but he 
often speaks of "secondary drives." 'This sug- 
gests one of the central problems of behavior 
theory and of social psychology: How can we 
approximate a more adequate classification of 
such secondary wants and how do such new 
wants become learned? Perhaps fear is the only 
secondary drive necessary, but the implications 
of such extreme parsimony have not yet been 
worked out. 

Perhaps the clearest evidence regarding this 
theory of Mowrer's is the finding by Solomon 
and Wynne(1950) that when a measure is taken 
of the latency between the CS and the motor 
response as avoidance learning progresses, the 
curve is not a smooth single function, but shows 
two points of inflection. One of these may re- 
flect the learning of fear, with the other re- 
flecting the learning of the motor response to 
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this fear. Solomon and Wynne find, however, 
that even when the autonomic nervous system 
is inactivated by use of drugs, some avoidance 
conditioning occurs. This may mean that what 
Mowrer calls emotional learning involves some 
skeletal responses as well as emotional ones. 
Alternative explanations of much of the data 
is still possible, however, and Miller has been 
able to make a spirited and ingenious defense of 
the monistic drive-reduction position (Miller, 
1951b). 

Mowrer's theory has generated considerable 
research, and the social psychological implica- 
tions of the theory have been partly worked out 
by Dewey and Humber (1951), and most com- 
pletely by Mowrer himself (1950). One might 
include Skinner's (1953) recent discussion of 
personality and social psychological issues as a 
treatise argued from much the same position as 
that of Mowrer, although Skinner's empirical 
notion of reinforcement and his attempt to 
avoid intervening mechanisms, particularly 
physiological ones, makes the discussion appear 
to be more different than it is. 


GUTHRIE'S GENERAL THEORY 


Guthrie has recently remarked that the dif- 
ference between his own theory and that of the 
drive-reduction theorists "is of great ultimate 
importance but of slight practical and immedi- 
ate importance" (1952, p. 235) . From the point 
of view of systematic generalization to social 
phenomena, this does not follow, because Guth- 
rie is here discussing the essentials of the two 
doctrines of reinforcement, not the form of the 
theories he has compared. One of the main 
values of a theory like Hull's is its heuristic 
value for those who wish to create a social psy- 
chology on the basis of funded experimental 
facts, Heuristic value in this case is a matter of 
great practical importance. 

Guthrie, for example, agrees that his theory is 
not set up to handle felicitously some of the 
systematic changes in response phenomena 
which occur when the same stimulus situation 
is repeated again and again. Hull has focused 
upon such phenomena to build his system, 
whereas Guthrie agrees when Dollard and Mil- 
ler (1951) simplify the system by dropping out 
such factors, and focus, as Guthrie does, on 
the single principle of the necessary and suf- 
ficient conditions for acquisition (1952, p. 225) . 
Yet to the student of child development and 
socialization, the fact of repeated feedings of a 
baby by the same person at about the same time 
every day is a type of fact which permits a basis 
for ordering the empirical field. In this sense 


Hull's heuristic model is a very practical gadget. 
Guthrie can generate these phenomena, but 
only with confessed difficulty (1952, p. 229). In 
short, Guthrie has not developed a predicting 
theoretical system but prefers to use with great 
intuitive wisdom a basic principle, and to show 
with startling acuity how his principle can be 
viewed as operating in any learning situation. 

Guthrie's writings do not fit the title “theo- 
retical system" in the same sense as Hull's. His 
thinking might be characterized as a notion plus 
wisdom. We include a discussion of his theory 
here because there are various signs that his 
thinking is being built into one or more theo- 
retical systems (Estes, 1950; Mosteller and Bush, 
1951), and will probably become more sys- 
tematically important in the future. The theory 
has already been important for some time in 
its effect on other theories. It has also been 
experimentally fruitful. Guthrie is somewhat 
of a latter-day psychologist's psychologist. 

The “stimulus-response” formula. The prob- 
lem of "stimulus" and “response” in Guthrie's 
thinking has remained closer to that of J. B. 
Watson (1913) than has that of any other cur- 
rent S-R theorist. Guthrie puts it this way when 
he is comparing his own thinking with that of 
Tolman: 


This book [his own] has undertaken a very 
different form of restriction for the purpose of 
understanding, namely, the restriction of the basic 
facts to stimuli to sense organs and to animal 
movements, with the limited admission of certain 
movement groups or classes defined by their end 
results. This is the highly controversial stimulus- 
response formula. Unlike Tolman's view, move- 
ments are not regarded as alternatives to the 
formation of likes and dislikes, to the perception 
of time and space, to the possession of the equiva- 
lent of a map, or to the sets which make short 
cuts and localization possible. Every horse owner 
knows that his animal has a strong tendency to 
maintain an orientation toward the stable where 
it is fed even when this place has been out of 
sight for some time. But orientation toward the 
stable is not something the horse uses instead 
of movement patterns. Men can demonstrate 
virtual possession of a "map in the head" by 
speech. But in this book speech is regarded as a 
form of motion. It is much more. But it is also 
motion. (Guthrie, 1952, p. 196. See Tolman, 
1932, for comparison.) 


Although probably all S-R theorists do accept 
the general notion of the S-R formula expressed 
in Guthrie's statement, there are differences, of 
course. Hull accepts it insofar as his associative 
concept (ie. sHp) defines an association bond 
(H) between an afferent (S) event and an ef- 
ferent (R) event. Hull has gone beyond this in 
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other concepts, however. Drive (D) is defined 
(Hull, 1951) in the postulate in which it is in- 
troduced as a strong stimulus, but the stimulus 
is not further defined except by way of the en- 
vironmental condition or operation of depriva- 
tion. There are further nonassociative elements 
in Hull's system. For example, I, has no spe- 
cific S subscript. Although Solomon (1948) has 
provided a peripheral stimulus definition for Ip, 
this is not the only possible interpretation of 
Hull's concept. Such nonassociative factors and 
such "operations" may refer to the fact that 
Hull is not a strict S-R. theorist but rather an 
S-O-R. (stimulus-organism-response) theorist in 
the sense that Woodworth has used these terms. 
(See Spence, 1952b.) 

Further, the "S-R. formula" above does not 
note the newer uses of the term "stimulus," such 
as that of Skinner (1938), who uses the term 
to refer to any part of the environment which, 
when modified, induces the occurrence of a 
given part of behavior. 'There has, therefore, 
been a de-emphasis of any particular existential 
definition of the term stimulus and of the term 
response. Possibly this freedom to try out dif- 
ferent notions or functional definitions (see 
Miller, 1951b) of stimulus and response has 
freed S-R theorists for the broad lines of re- 
search they have explored in recent years. It is 
certainly a basis for the notion in this chapter 
of the possibility of fruitfully generalizing basic 
experimentally derived theories. 

Guthrie himself reflects this change in part. 
He accepts the notion of "actions" as function- 
ally equivalent and overlapping (ergo, “organ- 
ized") sequences of movements. These actions 
occur on an all-or-none basis, but stable and 
predictable repetition of such sequence depends 
on the conditioning of a “key” or lead response 
to a broad number of the stimulus aspects of 
the situation. In this sense Guthrie deduces 
from his basic theory the possibility of a unit 
of complex instrumental responses, but only 
under conditions where such instrumentality is 
actually achieved. The explanatory level re- 
mains that of "movements." 

Guthrie also reflects a change in the use of 
the term "stimulus." In his recent book he 
writes: 


"There are two requirements which stimuli must 
satisfy in order to serve as scientific data. The 
first is that they must be physical changes af- 
fecting and activating sense organs. The second 
requirement serves a different purpose. In order 
to be usable as psychological data stimuli must 
be describable in terms acceptable to all scientific 
observers, whatever their theoretical position . . . 
[such as] the sound of, the whistle . . . the sight 


of the pattern on the door .. . being recumbent. 
(1952, p. 276) 


Guthrie sees the second criterion as one which 
serves a practical purpose in scientific communi- 
cation, despite the fact that these examples of 
"stimulus" are already highly organized for the 
scientific observer as the result of past experi- 
ence. 

Guthrie, therefore, reflects to some extent 
the more “molar” uses of these terms S and R, 
but he still has as his major suggestion that in 
studying behavior change, we note very care- 
fully just what the animal does. That is, saying, 
"Johnny went and got some ice cream," is not 
an adequate description of behavior for Guth- 
rie. It refers merely to the fact that Johnny was 
successful, but it does not tell enough about 
what he did. When Skinner (1938) records 
how many times a rat presses a bar, he is count- 
ing "environmental events," or the effects of the 
rats movements on the environment. But 
Guthrie argues that the rat could have achieved 
the change in the environment by use of his 
paw, his snout, his ear, or one of many other 
“movements.” ‘That is, calling the response a 
"bar press" or an “ice-cream getting" covers up 
the important event, that is, whether the re- 
sponse was “pawing at the bar," “a snout-press," 
"an ice-cream snatch,” or “an ice-cream run.” 
Imagine how far we are from Guthrie's type of 
objective description if our protocol merely says 
“Johnny stole an ice-cream cone”! à 

Guthrie agrees, of course, that stealing and 
ice-cream-getting occur, but his point is that 
these should be given a special term — "ac 
tions.” He also argues that, in learning, some- 
times it is actions which are learned, that is, 
whole patterns of movements (which carry with 
them their own stimuli to which the next move- 
ment is conditioned) are sometimes conditioned 
to a new stimulus, and we might refer to an 
entire bundle of responses of this kind in terms 
of its results. 

The principle of S-R contiguity. But the most 
important thing to keep in mind, from Guth- 
rie's point of view, is the deceptively simple 
principle that learning occurs when (and only 
when) a stimulus occurs in close contiguity 
with a movement. We have already outlined 
this principle, which Guthrie uses to explain the 
empirical facts of the acquisition and extinction 
of a behavior sequence. Let us consider it more 
carefully in the present context. 

Guthrie states his principle of learning by 
contiguity of S and R in the following way: 


The present theory assumes that the repertoire 
of responses is set by the physical structure of the 
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responding organism and the indefinitely large 
number of combined patterns of which the effec- 
tors are capable. It assumes also that once any 
pattern of response has been evoked in a stress 
situation (all stimuli set up stress) which includes 
a given signal or cue, the given signal or cue 
becomes an elicitor of the response pattern and 
remains an elicitor until the signal has been pre- 
sented with a new response. Acts and movements 
must terminate and make their continuance im- 
possible. One can descend the stairs but cannot 
immediately repeat it because he is now at the 
bottom, not the top. The raised arm cannot be 
raised; it can only be lowered. 


In any general situation, therefore, there will 


be a series of responses, and each new response 
will alienate its predecessors from their association 
with any signal present when the new response is 
given. In stress situation S, (m indicates the 
stimuli here called "maintaining stimuli"), a se- 
ries of responses, Ry, Re, Ra, . . ., R,, R, is evoked. 
If R, removes S,, or removes the animal from S,» 
then S,, remains a signal for R,, its last associa- 
tion. (1952, p. 297) 


Guthrie’s analysis of other learning phenom- 
ena. After we presented our example of John- 
ny's sequence of behavior we outlined a series of 
empirical phenomena of learning which Hull, 
we saw, considered important enough to be 
included as separate postulates in his system. 
Guthrie recognizes these phenomena, but he 
does not consider them as sufficiently representa- 
tive of the basic behavior law he is working out 
to be considered as essential reference data. 
Stimulus generalization, so important in the 
Hullian system, is dealt with by Guthrie in a 
different manner. The basic empirical fact is 
that other stimuli which are similar to the 
conditioned stimulus will evoke the same re- 
sponse, to the degree to which such stimuli are 
similar. Johnny, for example, would probably 
salivate to a knife-grinder's bell unless he had 
Specifically learned to discriminate between the 
two sounds. If this occurred, Guthrie would 
describe the phenomenon as due to the fact that 
there is a common response which has occurred 
to both bells, or at least responses which have 
identical components. That is, Johnny tends to 
"listen" as a response to both sounds, and this 
orienting (or "key") response is chained to the 
conditioned response through the mediation of 
"response-produced cues." In short, the orient- 
ing response of listening leads to the evocation 
of the same series of responses, including the 
conditioned response, but the causal agent is the 
similarity in the orienting responses, not in the 
similarity of the external sounds themselves. If 
Johnny had made a discrimination between the 
two stimuli, then there would be a difference in 
the orienting responses such that the generaliza- 


tion phenomenon would not occur. Johnny 
might say “knife grinder” to S, and “ice-cream 
man" to So, while listening. 

We shall see below that Osgood (1952) has 
attempted to set up ways of measuring inde- 
pendently these mediating responses, which in 
an SR theory (Hull's included) are a prototype 
of "meaning.". It should be noted, however, that 
Guthrie's theory does not accept generalization 
as a special process. He tries to show how gen- 
eralization phenomena, when analyzed care- 
fully, are not exceptions to the rule that we 
always respond to the same stimulus with the 
same response, The problem is to keep our eye 
on the changes that occur in responses and in 
stimuli. Generalization appears to imply that 
organisms sometimes respond with a condi- 
tioned response to a stimulus to which the 
response had never been conditioned. To 
Guthrie this interpretation is only an appear- 
ance — there is always a specific mediating 
stimulus which sets off the observed response. 
In short, transfer of training is always accom- 
plished by identical elements. The illusion 
arises when we record the wrong stimulus or the 
wrong response or have not noted that the 
mediating response is conditioned to many “ex- 
ternal" stimuli, or that the stimulus produced 
by the mediating response is conditioned to 
many other measurable responses. 

The reader who is unacquainted with Guth- 
rie’s thinking might wonder how Guthrie would 
explain such phenomena as the apparent stabi- 
lization of learning which occurs as the number 
of trials increases. It should be remembered 
that Guthrie sees reinforcement as an all-or- 
none phenomenon. One might expect him to 
agree with discontinuity theorists of learning 
that learning has occurred when the cognitive 
(stimulus) field is suddenly restructured. To 
think this would be incorrect, since Guthrie 
talks about the fact that the learning curves we 
usually find are due to the fact that the response 
being learned is becoming conditioned to more 
and more of the stimulus components of the 
situation, and therefore the performance seems 
to be improving slowly. This means, then, that 
in most situations, learning, although it occurs 
in an all-or-none fashion, appears to be cumu- 
lative, unless we can watch more carefully for 
the movements and stimuli which are suddenly 
and completely conditioned or the useless move- 
ments which drop out for one of the reasons we 
lrave given above. This constant emphasis on 
the necessity for behavior analysts to look care- 
fully at what the animal last did in the situation 
is the major advice Guthrie has to offer. ) 

Guthrie's primary reference data (Guthrie 


D 


76 HANDBOOK OF SOCIAL PSYCHOLOGY 


and Horton, 1946) are contained in a series of 
experiments in which he and Horton photo- 
graphed the behavior of cats who, upon being 
placed in a box, learned to press a pole away 
from the vertical and thereby escape from the 
box. Guthrie found that his best chance of 
predicting the exact pattern of movements the 
cat would use on the next trial was to predict 
that the cat would do what he last did. This 
prediction would not always work, of course, 
but it worked very often. Further, Guthrie 
found that the cats tended to repeat over and 
over a limited set of movement sequences, so 
that knowledge of these possible sequences 
might assist the prediction even more. Note 
that Guthrie, in studying cats in the puzzle box, 
was interested not in the speed with which the 
cats would “succeed,” but mainly in studying 
the exact sequence of movements which the 
cat used in achieving success. Perhaps the in- 
creased use of moving pictures in social psychol- 
ogy and anthropology will render Guthrie's 
thinking more directly relevant to analysis of 
problems involved in teaching new skills to 
members of cultures with highly stylized move- 
ment patterns, to uncovering laws of the 
“language of movements,” which has hardly 
been touched upon except in the limited field 
of verbal movements, or to uncovering the ante- 
cedent laws related to Allport and Vernon's 
(1933) basic factors of expressive movement. 
Guthrie's studies of cats emphasize the rich- 
ness of the cat's movements and also the tre- 
mendous degree of stereotypy which charac- 
terizes behavior when viewed in this careful 
manner. Other research which his theory has 
generated includes attempts to test Hull's no- 
tion that reinforcement is a necessary condition 
of learning (Sheffield, 1950, 1951), studies of 
methods of response extinction which other 
theories would not generate (Seward, 1949), 
analyses of the role of response factors in the 
recall of uncompleted tasks (Boguslavsky, 
1951), and the explanation of Thorndike's 
"spread of effect" by invoking the notion of 
"guessing habits" (Jenkins and Sheffield, 1946) , 
to name only a few. Boguslavsky (1953) has 
recently developed a predicting probability 
model for use with Guthrie's notion of orienting 
response. Guthrie and his colleague Stevenson 
Smith have used the notions of the theory in 
discussing the concept of personality (Guthrie, 
1944), the problems of psychological conflict 
(Guthrie, 1937), and a very sane discussion of 
the problem of eneuresis (Smith, 1948). Some 
of this writing, as Hilgard (1948) has suggested, 
bears no direct or logically necessary relation to 
Guthrie's theory, but it shows that the principle 


of S-R contiguity can function very comfortably 
in the personality analyst’s kit of tools. 

This method of generalizing is usually an 
advance over common sense, or at least casts a 
fresh light on common-sense answers to ques- 
tions. This is possibly due to the fact that Sp 
theorists, either of the contiguity or the rein- 
forcement school, view such problems as "see- 
ing,” “wanting,” and “purposing” as activities 
of the organism which require explanation, 
whereas many writers in psychology assume 
these activities as givens or as properties which 
all human beings display (which, of course, they 
do, much of the time) and use such activities as 
explanations rather than as phenomena to be 
explained. Until social psychology has organi- 
ized its data in a systematic and reliable fashion, 
the systematic extension of general behavior 
theory must give way to the use of “notions plus 
wisdom.” Further, Guthrie’s long-continued 
emphasis on response-produced stimulation has 
been used to great experimental and theoretical 
advantage by other psychologists, particularly 
by Miller (1951b). Guthrie has himself out- 
lined the relation of this emphasis to the more 
recent notion of “feed-back” as developed by 
Wiener (Guthrie, 1952, p. 296). 

We began our discussion of Guthrie's think- 
ing by criticizing his over-all comparison of his 
theory with that of Hull. We shall reiterate this 
criticism and then go on to agree with what 
Guthrie probably meant, Guthrie deals mainly 
with a basic principle whereby behavior change 
occurs, whereas Hull attempts to deal not only 
with such a principle, but also with whatever 
other processes and variables must be consid- 
ered in order to arrive at a rough first approxi- 
mation of a theory which will predict, and 
possibly control, such changes in behavior. 
Guthrie remains parsimonious, and perhaps 
eventually correct, on the issue of whether or 
not drive reduction is "necessary" in a basic law 
of "acquisition." Hull drops the parsimony but 
provides us with a gadget with which social 
psychologists can play at the "game" of predic- 
tive science. 

But Guthrie's notions are, in another practi- 
cal sense, close to Hull's. Guthrie feels that 
Dollard and Miller (1951) are essentially talk- 
ing his language, for all practical purposes. The 
reduction of a drive is one way of changing the 
situation so as to preserve the S-R connection for 
the next trial, particularly if "fear" is the drive 
or maintaining stimulus which is reduced or 
altered. Further, the S-R theorist in the psycho- 
therapeutic situation such as discussed by Dol- 
lard and Miller (1951) has little use for the 
quantified laws relating antecedent conditions 
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to intervening variables, although the serious 
student of personality development or sociali- 
zation may find them of considerable use. 
More explicit attempts at pulling the S-R 
contiguity position closer to that of the S-R re- 
inforcement theory are under way. Mosteller 
and Bush (1951), in a display of theoretical 
fortitude, have attempted to partial out the 
strengths and weaknesses of the two theories 
and to put the virtues together in the format 
of a probability model close to that of Estes 
(1950). The future may see an amalgamation 
of the S-R contiguity notion added to a rein- 


forcement notion, so that one could say that 
the probability of the response occurring on 
the next trial is greater if it (1) occurs and (2) 
is also “reinforced,” than if it only occurs. 
When other processes are explicitly added to 
this kind of theory, the question of the correct 
theory of reinforcement remains a matter for 
quantitative experimental study at the level of 
the theoretical system, and a more useful set of 
equations is required for purposes of generaliz- 
ing (ie. of constructing what we have called 
theoretical-empirical systems) . 


Tue EMPIRICAL-THEORETICAL SYSTEMS 


THE PROBLEM OF GENERALIZING IN SOCIAL SCIENCE 


We have outlined in the context of a sample 
of everyday behavior the theoretical notions of 
three important S-R theories. We have noted 
that the data for much of these theories have 
been induced and that the data according to 
which they are formally defined are limited, 
One of the next steps to be taken by those in- 
terested in such theories would be to deduce 
and test further consequences of the theoretical 
position and thereby increase or decrease the 
confidence one has in the applicability of the 
model to the defining data. This endeavor is 
going forward in many laboratories of the U.S., 
as well as elsewhere in the world, and the theo- 
ries discussed have, over a period of years, been 
changed as a result. A second step which one 
can take is to work out systematically the rele- 
vance of the theory to new classes of data, that 
is, to classes of data which are currently viewed 
as being different in some respects from the 
data used in devising the theoretical system it- 
self. It is to this endeavor that many social 
psychologists have turned in their attempts to 
advance our systematic understanding of prob- 
lems and areas of data usually discussed under 
personality and social psychology. This is the 
problem of generalizing theory in its most 
generic and systematic, and therefore most 
simple, sense, 

This problem is not one which is limited to 
S-R theory, of course. It is a problem faced by 
anyone interested in the advancement of a sys- 
tematic theory in the field of social psychology. 
The problem of “relevance” of a theory to a 
particular set of observations, or of a particular 
set of observations to a particular theory, can be 
a most distressing one, Anyone working with a 
certain social or psychological problem will 
develop a language for discussion of the matter 
which appears to him to be the most relevant, 
and which becomes for him the language for 


describing the problem or area under investi- 
gation. Such descriptive language may be de- 
veloped for a variety of reasons, and these 
reasons need not include the attempt to explain 
systematically, but merely may reflect an in- 
terest in capturing what meaning one can from 
the observations he has made. 

Many formulations of social psychological 
problems have been made in this descriptively 
relevant fashion, and many of the problems of 
social psychology remain in this state of limited 
usefulness. Much of the theory which has 
grown from the study of social phenomena in 
the "natural" setting still suffers from the fact 
that it is a happenstance result of the descriptive 
method, rather than having been systematically 
constructed on a general basis. Such theories 
present, however, a sense of immediate rele- 
vance, and diverging from them may give one a 
sense of reducing the richness and complexity 
of the original problem to something less 
relevant. Many of the theories of this kind are 
based upon what S-R theorists, from their own 
position, describe broadly as R-R (response- 
response) relationships. One may, for example, 
have studied the problem of intelligent behavior 
and have developed measures of an aspect of 
this behavior, working out a series of relation- 
ships among such measures. This immediately 
practical endeavor provides the researcher with 
speedy and amply deserved reinforcement, but 
from an SR point of view such formulations 
often reflect a methodological limitation of the 
field under study. Very complex analyses 
emerge from the structural approaches which 
R-R theory develops by using these measure- 
ments derived from study of the relatively un- 
controlled field. Some theories, such as Lewin's 
and the structural aspects of Freud, as well as 
the empirical research areas of intelligence test- 
ing and projective testing, have in common the 
limitations imposed by the decision to remain 


78 HANDBOOK OF SOCIAL PSYCHOLOGY 


within what one might call “field” or “natural 
observation” methodology. 

Whereas attempts at descriptive pictures of 
already-produced structures have an apparent, 
and often genuine, relevance as their major 
virtue, they are too often at the present time 
limited in the extent and the precision with 
which they can predict more than momentary 
changes in the structures so set up, S-R theorists 
have tended to take another approach to the 
problem of theory construction and therefore 
the problem in generalizing is a somewhat dif- 
ferent one. Recognizing that laws of changes 
of structure may require a different unit of 
analysis and different structure of theory from 
that of the attempt at description of the struc- 
tures themselves, they have placed primary 
effort on the experimental study of situations 
where such change in response hierarchy or re- 
sponse probability formulations occur. To 
maximize the possibility of experimental ma- 
nipulation of variables and controlling of ir- 
relevant factors, they have been forced away 
from the maximization of immediate relevance 
of their units and their language. Spence (1952) 
has noted that this decision was a relatively 
self-conscious one, and in light of some of the 
current advantages that accrue from S-R. theo- 
rists' reanalysis of structures, this decision does 
not need defense. 

Having found the relationships among some, 
at least, of the important variables involved in 
those aspects of behavior change which are more 
or less environmentally manipulable, the tend- 
ency recently has been to try out, either by 
experimental means or by naturalistic observa- 
tion, new classes of referents in terms of their 
interrelations. This is the problem of explicit 
generalizing which marks, in the present writer's 
view, much of the endeavor of social psycholo- 
gists interested in S-R theory. Ceding value to 
this approach does not in the least negate the 
reasonableness of the sophisticated attempts at 
structural analysis of such people as Lewin, 
Parsons, Malinowski, Thurstone, or Tolman. 
To study structures first and their change second 
is quite as reasonable as to study changes of 
structure first and to refine the notions of struc- 
tures later, and there are many times when one 
kind of analysis can complement the other. 
However, Bergmann and Spence (1944) may be 
correct in their assertion that, in principle, if 
you have the relation between S, and R; and 
also the relation between S, and Ry, you can 
often solve for the relation between R, and Rg, 
whereas knowledge of the relation between R, 
and R, does not solve the S-R problem. And 
perhaps, for example, the failure of Pareto's 


attempt at a mathematical theory for sociology 
was due to the impossibility of dealing with the 
large number of variables which structural 
analysis often entails. Perhaps the painstaking 
emphasis on experimental criteria for the ac- 
ceptance of variables and the statement of their 
relative importance will save S-R theory from 
some of the confusing problems that other at- 
tempts have met. In this sense, one of the major 
values to social psychology of the experimental 
method is that it assists in reducing a set of 
variables to a usable number in attempting a 
first approximation to a general theory of be- 
havior. 

It has been fairly widely accepted by S-R 
theorists, or at least by the reinforcement variety 
of S-R. theorist, that a hypothetical construct 
or intervening variable must eventually be de- 
fined in terms of both antecedents and conse- 
quents. This, however, does not restrain pre- 
liminary analyses of a new social or personality 
problem area from being defined tentatively “on 
one side" or the other. Thus it is possible for 
an SR theorist to define on the response side, 
in whatever complexity he finds best, a new (for 
S-R theory) concept such as "drive identifica- 
tion" and to search, with the aid of his theoreti- 
cal system, for the particular pattern of 
antecedent variables necessary to generate the 
sample of complex patterns of responses he has 
tentatively defined as "drive identification." 
One may view Freudian theory as having em- 
phasized antecedent definition to the de-empha- 
sis of both antecedent and ` response side 
operational specification, and such current 
forms of response-response theory as Lewin's as 
having emphasized responseside definition to 
the exclusion of those laws which would gen- 
erate some of the possible patterns of ante- 
cedents for a given “momentary field." Tolman 
(1951) has seen this problem and has apparently 
attempted to construct the general outlines of 
a model to include a solution of all these prob- 
lems. However, his insightful cognizance of 
the full range of problems, and the aim for 
comprehensiveness which his thinking displays, 
probably cannot continue too long to substitute 
for a precise statement of relationships among 
constructs, 

The above propositions may be summed up 
by stating that in selecting his language of de- 
scription or of explanation, the social psycholo- 
gist can usefully recognize that structure analysis 
without process is only part of our task, and that 
process analysis without eventual refinement of 
structural description is not fully satisfactory 
either. Further, the decision as to choice of 
descriptive and explanatory language should 
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not be too colored by what might be called a 
form of “aesthetic fallacy.” That is, theoretical 
analysis does not require that the phenomena 
under study be “phenomenologically” reflected 
in the theory. Most useful explanations go be- 
yond full and aesthetically satisfying descrip- 
tion, and Marx (1951), among others, has 
described the scientific orientation as one at a 
polar extreme from the aesthetic or literary. 

Whether or not these preceding remarks are 
accepted, there will be agreement, the writer 
feels, on the proposition that a theory which is 
anchored to a set of defining data must face the 
problem of checking the legitimacy of any trans- 
lation of this formulation to a new area of data. 
Let us look then at this basic question for a 
moment: How can one generalize, or in other 
words, how can one construct what we have 
called an empirical-theoretical system? 


SOME WAYS OF GENERALIZING 


Operational extension. The first and most 
important way of generalizing is accomplished, 
as the expression used above suggests, by ex- 
tending the class of operations of the theoretical 
system. That is, methods which are similar to 
those of the defining operations of the theoreti- 
cal system are utilized in situations concerning 
which the stated principles or relationships of 
the theory seem to make predictions. A simple 
example of this is the following: the theoretical 
system may say, for instance, that organisms in 
a state of hunger drive move toward food more 
rapidly than those which are not in such state. 
The defining operations of the theory, to sim- 
plify, may be those concerning rats in a runway 
situation. Now we may extend this to predict 
that children coming home from school at noon, 
when they are hungry for their lunch, will make 
the trip more rapidly than if they come home 
from school at some other time when the hunger 
drive is lower. Or, to consider Johnny's situa- 
tion, if the ice-cream man comes around just 
before lunch or dinner, he may run faster to his 
mother for the nickel, and on to the ice-cream 
man, than if the vendor arrives just after he has 
finished a meal. The operations for measure- 
ment in these latter situations are very similar 
to those of the rat runway problem; they in- 
volve the measuring of comparative time inter- 
vals. The extension may be considerably less 
obvious than this one, of course. For example, 
having found such a temporal relationship in a 
rat runway problem with such an antecedent 
index as strength of drive, one could also, with 
human subjects, use latency of response in an 
association test which included food words and 


other words, having varied the degree of food 
deprivation of these subjects as the antecedent 
variable. One may even attempt to take some 
complete theoretical model which is defined in 
terms of specific reference data and apply it to 
a different species of organism, or to another 
aspect of the behavior of the same species. 

If the theoretical relationship is as predicted, 
and particularly if additional deduced conse- 
quences are also upheld, one can feel relatively 
confident that the original theoretical model 
applies to the new area of data. This procedure 
is an exact and a difficult one, however, particu- 
larly for social psychology. Among other prob- 
lems is the decision as to what operations are 
"similar" to the defining ones of the theoretical 
system. A program of research undertaken by 
Solomon and his colleagues can be placed in 
this category of operational extension and will 
be discussed further below. Another example is 
the study by Dinsmoor (1952), who extended 
the operational denotations of some of the “new 
look" perception findings to a Skinner-type con- 
ditioning situation, obtaining similar relation- 
ships to those found in the former situation. 

A major advantage of this method of general- 
izing is that one can feel relatively sure that the 
theoretical model does not apply to the new 
area, or is at fault in some way, if a clear mal- 
prediction occurs. It may be useful to spot check 
in this way in some new area before going on 
to less conclusive, although no doubt easier, 
methods of generalizing. 

Descriptive analogy. This type of generaliz- 
ing is at first glance appealing because of its 
simplicity, but too often the simplicity is de- 
ceptive. It has been widely used, however, and 
certainly merits discussion. Here one finds that 
a description of a theoretical concept, a bit of 
defining data, or a relationship sounds similar 
to the description of some other concept, bit 
of data, or presumed relationship outside the 
theory. One then, by such analogy of descrip- 
tion, adds to the latter some of what is known 
to be the case with the former, and assumes 
generalization to have taken place. The extra- 
theoretical data and the theoretical propositions 
often are found to fit quite well, but here con- 
siderable caution is needed, since the similarity 
may be only that which first suggested the 
analogy, with nothing having been added — 
except a heuristic device of doubtful propriety. 
Many examples of this method are found in 
discussions of the social structure in analogy 
to the human organism. F. H. Allport (1942) 
has pointed out problems raised in this sort 
of proceeding. A very early instance of this 
in modern social thinking is Hobbes’ compari- 


80 HANDBOOK OF SOCIAL PSYCHOLOGY 


son of the state with the human organism. Some 
other “simple and sovereign” concepts of social 
science are of this nature. One takes a notion, 
such as power, sympathy, or imitation, from a 
situation where it is applicable and applies it 
to other similar-appearing types of relationships 
as an explanatory device. 

To use this method of descriptive analogy at 
all usefully, some restrictions should be ob- 
served. To make predictions which are more 
precise than the early observed similarity which 
prompted the analogy, the theoretical context 
from which the analogy-term is drawn should 
contain a set of at least two variables, or one 
quantifiable variable. It may be that the origi- 
nal variable has implicit properties which will 
fit it for such use, if these properties are first 
explicated carefully. It probably goes without 
saying that where such quantification, or dif- 
ferential prediction, is possible it should be 
undertaken, if descriptive analogy is to be 
useful. 

Operational analogy. In this method the 
analogy is between the measure we would like 
to have and the measure which it is possible, or 
most feasible, to obtain. This may sometimes 
be a measure of approximation, where a part 
of a realm of data stands for the entire realm 
of data desired, but often it involves operations 
purely analogous to the desired operations. The 
method of operational analogy is a very reason- 
able compromise procedure for many areas of 
investigations. Thinking in terms of Brown and 
Farber’s outline of Hull’s theoretical system, we 
might, in doing a study of the socialization of 
aggression in the Hopi culture, for instance, 
desire a measurement of the frequency with 
which the child has been punished for naughti- 
ness. Let us assume that it is not feasible to 
obtain an actual frequency count of such pun- 
ishments in a random sample of Hopi children, 
the time sampling technique extending over a 
period of years. We may compromise with an 
analogous operation. We attempt with inter- 
view methods, or some other device, to ask cer- 
tain strategic questions of the parents of these 
children, and, with some tentativeness, accept 
the estimates of the number of punishments for 
naughtiness as sufficiently analogous to the kind 
of measure we really would like to obtain. 

An example of an excellent study using opera- 
tional analogy is Pauline Sears' (1940) investi- 
gation of level of aspiration. Her operational 
analogy was in utilizing the records of schoo! 
children's marks as standing for their full his- 
tory of failures and successes, and she related 
these to patterns of level of aspiration deter- 
mined experimentally. 


Operational analogy meets the criteria sug- 
gested above for the reasonable use of descrip- 
tive analogy, and goes somewhat further toward 
the kind of integration of data and theory that 
many social psychologists desire. One of the 
ways in which the researcher can check whether 
or not he has achieved a reasonable analogy to 
the measure he would like to have, would be to 
see whether such analogous measures are re- 
lated to one another, or to actual measures, in 
the manner suggested by the theoretical system. 
Thus one value of the theoretical system, as dis- 
cussed previously, is that it can provide us with 
specific notions which state the relationships 
among a set of potentially useful variables, for 
broader operational approximation. A second 
value here of the theoretical system is that it 
can provide us with a fairly large set of ex- 
plicitly interrelated variables so that any tem- 
porary success we have with our analogous 
measures in approximating the experimentally 
found relationships of the theory can be further 
tested by relating our measures to additional 
approximations, as suggested further by the 
theory. 

At this point a problem arises concerning 
which S-R theorists have disagreed among them- 
selves. Some argue that the S-R position rules 
out the use of human yardsticks in research. 
Thus Bergmann and Spence (1944) would argue 
that the use of human observers or interviewers 
in making judgments or ratings of behavior is 
a measurement which is one step away from the 
kind of direct measurement of behavior which 
they recommend. Other SR psychologists feel 
that rating techniques and content analysis 
methods, such as the ones being developed for 
use with the TAT, are acceptable either for 
basic experimental purposes or for operational 
analogy. It is interesting to note that some quite 
influential research (e.g., Eleanor Gibson, 1942) 
in the recent human learning laboratory has 
found it desirable to utilize human judges in 
order to calibrate the degree of similarity among 
stimulus materials used in the experiment. It 
is amusing to note that Schlosberg and Solomon 
(1943) utilized psychological methods with hu- 
man beings in order to calibrate degrees of gray 
on stimulus cards later used with rats, with pre- 
dictable results, Perhaps the resolution of this 
kind of methodological controversy among so- 
cial psychologists who have found Sp psychol- 
ogy stimulating would be to consider the 
psychophysical problems involved in the train- 
ing of judges and one-way screen observers as 
the basic procedural research area it undoubt- 
edly is. Most social psychologists consider such 
problems as merely additional operations, to be 
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tested for reliability and then added to the 
operational limitations for generalization of the 
findings obtained with such methods. 

Quandaries in generalizing. Vincent and 
Helen Nowlis (1952) have recently given a good 
summary of some of the problems which arise 
when one is faced with the failure of a particu- 
lar hypothesis to be upheld when generalizing 
from a basic experimental system. They see at 
least three possibilities: (1) There has been in- 
adequate translation of the concept or set of 
concepts from the simple laboratory situation 
to social situations, (2) the research may have 
failed to develop adequate instruments for in- 
dexing the very difficult phenomena being 
studied, or (3) the law which one is attempting 
to apply is not fitted for application in such 
complex situations. All these possibilities are 
to be considered when such failure to uphold 
the hypothesis occurs. 

Sears (1951) has given an example of the 
utilization that can sometimes be made of data 
which fails to verify a hypothesis. To be brief, 
he found that this failure in confirming the 
hypothesis, when considered in conjunction 
with a suggested change in the basic theoretical 
system, motivated not only the search for fur- 
ther evidence in the area which he was studying, 
but also suggested experiments to be done at 
the level of the theoretical system itself. 


SOME GENERALIZED S-R MODELS 


It remains for us to outline some of the uses 
which have been made of S-R theories in at- 
tempts to increase our understanding of social 
psychological phenomena. We shall briefly dis- 
cuss a series, by no means comprehensive, of 
such attempts, ranging from extensions of whole 
areas of theory to extensions which involve a 
very limited set of relationships. 

A basic methodological factor should be men- 
tioned at this point. Sophisticated use of S-R 
theory makes no assumption as to whether one 
must begin the analysis on the response side or 
on the stimulus side. Thus, most of the ex- 
amples which we shall use reflect the acceptance 
by the S-R theorist of response-side definitions 
of phenomena as they have been discussed by 
social psychologists who have often been of 
different theoretical persuasions. Learned mo- 
tives, attitudes, personalities, roles, neuroses, 
identifications, meanings, and clinical insights 
are entities which represent phenomena in vary- 
ing degrees of clarification and measurement. 
When a psychologist interested in the extension 
of any theory decides to grapple with a problem 
in one of these areas, he must come to terms 


with the situation that the phenomenon de- 
fined may be of a most complex nature.relative 
to any current statement of the theoretical sys- 
tem. We have noted in discussing Hull's theory 
that Brown and Farber found it necessary to 
suggest elaboration of the theory in order to 
make it relevant to certain emotional phenom- 
ena. Sometimes such extension is not necessary, 
but the investigator must often tread the careful 
path of deduction for a considerable distance in 
order to find a proposition which is consistent 
with theory and which bears upon the empirical 
phenomenon which he wishes to understand. 
Further, the extension sometimes deals only 
with certain aspects of the complicated phenom- 
enon under analysis. 

Basic definition for a social psychological re- 
search endeavor can therefore occur on the 
response side or on the stimulus side. One of 
the hallmarks of S-R theory, however, is that 
the analysis must eventually include a careful 
definition of the consequent phenomena under 
discussion as well as an explication of those 
antecedent factors which can relate to the con- 
sequent phenomena in a fashion relevant to 
the intervening variable set. In our sample of 
theoretical extensions we shall find that some 
of the researchers have barely started a con- 
sideration of the antecedent factors, as they 
have found the task of systematizing and vali- 
dating response-side measures to be a major 
research task in itself. 

Imitative behavior. Probably the most in- 
fluential recent extension of S-R analysis to 
social behavior has been the work of Miller 
and Dollard in their analysis of the ancient 
social psychological concept of imitation. Their 
book, Social Learning and Imitation, was an 
attempt to redefine the place of this concept in 
social psychology. Whereas Tarde had said, 
“All society is imitation,” Miller and Dollard 
overstate their purpose as trying to show that 
“All society is learning." An analysis of the 
kinds of behaviors which occurred when “imi- 
tation” was supposedly operating led them to a 
trichotomy of copying, same behavior, and 
matched-dependent behavior. They view these, 
in effect, as response-side phenomena derivable 
in terms of a drive-reduction theory of learning. 
Imitation, therefore, comes out of the analysis 
having shifted from its place (for Tarde) as an 
intervening variable to a position as a response 
pattern which occurs under given conditions 
as an accomplishment of the learning process. 
They do outline, however, the conditions under 
which a learned motive to imitate may develop 
in children and therefore be utilized for some 
analytic purposes as an intervening variable. 


82 HANDBOOK OF SOCIAL PSYCHOLOGY 


Matched-dependent behavior develops in a 
certain kind of interaction pattern, usually 
where a less skilled, or subordinate, person is in 
interaction with a person of higher status or 
greater skill. The subordinate person finds him- 
self gaining rewards by doing what the super- 
ordinate person does, but he learns to respond 
only to the former's acts, and not to the cues to 
which the former is responding. Sufficient repe- 
tition and reinforcement of this behavior in 
such social contexts can lead to the development 
of a learned drive to imitate. This formulation, 
with changes, has only recently begun to gen- 
erate attempts to experimentally induce learned 
motives of this class in the behavior of children, 
and to generate predictions in the behavior of 
children where experimental controis are not 
possible. (See Sears, 1950.) A recent study by 
Olds (1952) on the experimental control of 
learned drives in children is in part a resultant 
of this general conception of Miller and Dol- 
lard. 

Miller and Dollard's theory of the develop- 
ment of matched-dependent imitation will prob- 
ably be found relevant to the important task 
of developing an adequate theory for stating the 
social psychological conditions under which 
differing strengths of dependency motives are 
developed and in explaining individual dif- 
ferences which are found clinically in this area. 
The development of standardized instruments 
for the measurement of this motive are still 
under way. 

Psychotherapy. More recently Dollard and 
Miller (1951) have suggested certain syntheses 
of behavior theory and psychoanalytic theory, 
following in the tradition set by Miller (1944) , 
Sears (1943, 1944), Mowrer (1950), and Dol- 
lard and Doob, et al. (1939). They attempt to 
derive from S-R theory certain practices which 
have developed in therapy. Evaluation of such 
an attempt can be based on at least two criteria: 
(1) whether or not they have been able to place 
certain interaction procedures derived for lim- 
ited use in therapy within the context of a much 
more broad and explicit theory, and (2) whether 
or not the translation leads to further insight 
in clinical interaction. (A successful translation 
of Freud into some form of modern psychologi- 
cal language could not fail to be of advantage, 
at least in the sense that it would reduce the 
authoritarian atmosphere which so often sur- 
rounds interpretation of Freud's thought.) 
These evaluations cannot as yet be made with 
any finality, but there is little doubt that these 
formulations have led to fruitful experimenta- 
tion and to further theorizing. 

This last statement can be briefly documented 


by reference to an empirical study and a theo- 
retical exercise, Clark (1952) has recently aided 
in the task of untangling some of the factors 
involved in the complicated phenomena of guilt 
expression and the function of guilt. Using the 
theory of conflict of Dollard and Miller, Clark 
predicted that sexual drives would be more re- 
flected in TAT protocols when the subjects were 
under the influence of alcohol. He further pre- 
dicted that with those exposed to sexually stimu- 
lating pictures, but without alcohol, guilt would 
tend to reduce symbolic sexual expression below 
that found in a control group who were not 
exposed to sexually arousing stimuli. Both 
predictions were upheld, and the theory made 
possible a clarification of most of the data. 

S-R conflict theory has been long in develop- 
ment, the earlier work having been summarized 
by Miller in 1942. Miller's more recent work 
(1948) has been focused on generating, among 
other things, some of the phenomena subsumed 
under the psychoanalytic concept of displace- 
ment. Displacement phenomena are related to 
the process of generalization, and the theory 
involves making certain explicit assumptions 
about how approach and avoidance responses 
generalize from one stimulus situation to other 
similar situations when the approach response 
is inhibited by a conflicting response. As an 
example of this important kind of small model 
construction within a broader theory, let us 
look at Miller's assumptions: 


(1) that the direct response to the original 
stimulus generalizes to other similar stimuli, with 
the amount of generalization becoming smaller 
the less similar the stimuli; (2) that the response 
which conflicts with the occurrence of the direct 
response to the original stimulus also generalizes 
to other similar stimuli, becoming weaker the 
less similar the stimuli; (3) that the gradient of 
generalization of the conflicting response falls off 
more steeply with dissimilarity than does that of 
the original response which it inhibits; (4) that 
when two or more incompatible responses are si- 
multaneously excited, the one with the greatest 
net strength will be the one which will occur; 
and (5) that the net strength of a response will 
be its strength minus that of any response incom- 
patible with it which is excited at the same time. 
(Miller, 1948, pp. 167-168) 


With these assumptions Dollard and Miller 
are able to generate many practices and predic- 
tions, from the tendency among therapists to 
focus first on reducing a patient's anxiety, to 
predicting the major findings of Clark’s study. 
Others have made use of this “model” also. It 
has been used in analyzing data from children’s 
doll play (Hollenberg and Sperry, 1951), data 
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from a cross-cultural study of myths (Wright, 
1952) , data from a cross-cultural study of theo- 
ries of disease (Whiting and Child, 1953) and 
data gathered from the behavior of pigeons 
(Brush, Bush, Jenkins, John, and Whiting, 
1952). 

The above tests of the usefulness of the Mil- 
ler assumptions have led also to further theoriz- 
ing. One suggested direction of change is the 
work of Bush and Whiting (1953), which sub- 
sumes many of the suggestions of Brown and 
Farber which we have outlined above. Bush 
and Whiting have broadened Miller's formula- 
tion, taken out certain “flaws,” and have ele- 
gantly ensconced their notions in an extension 
of a “set-theoretic model" developed by Bush 
and Mosteller (1951). In terms of several cri- 
teria, this theory ranks as one of the most 
developed of its type in social science: it has 
generated and subsumed several classes of data 
within an explicit framework of assumptions, 
is open to alteration, and also makes use of a 
probability language which is possibly the kind 
of language best fitted to social and psychologi- 
cal theorizing, Future experimentation will 
reveal whether this trend continues to be valu- 
able, but in the meanwhile, Bush and Whiting 
state that: 


Any formal quantitative model of a complex 
psychological process necessarily involves numer- 
ous abstractions and simplifying assumptions and 
as a result is inadequate for describing many, 
indeed most, instances of that process. 

The price of precision is the loss of generality, 
and many psychologists may feel that the price 
is often too great. The argument hinges upon, 
it seems to us, the strategy one uses in his re- 
search, ie, whether precision or generality has 
priority. There is always hope of making a gen- 
eral theory more precise and of making precise 
theory more general, but which is easier is a mat- 
ter of faith not fact. Our faith, and apparently 
Miller's also, is in the procedure of slowly making 
more general a model with builtin precision. 
When simplifying assumptions are made, atten- 
tion is focused on the factors which make the 
phenomena complex. Generality is increased by 
examining these assumptions, one by one, and 
finding suitable substitutes. Such a mechanism for 
improvement is hard to find for increasing the 
precision of a general theory. (Bush and Whiting, 
1953, p. 261) 


Extensions to child development. One of the 
most extensive attempts at systematic extension 
of S-R theory to the development of personality 
in children has been undertaken by Sears and 
his co-workers at the University of Iowa and 
at Harvard University. Although only a small 
portion of this research has yet been published, 


it represents a broad attempt to develop instru- 
ments for the measurement of certain per- 
sonality properties of children and also the 
measurement of certain important patterns of 
antecedent variables which are suggested by the 
theories of Hull and of Miller and Dollard. 
Sears has reported the results of a portion of 
this study in his recent presidential address to 
the American Psychological Association (Sears, 
1951). He pointed out that a very low positive 
correlation was found between the antecedent 
degree of frustration of middle-class children 
during socialization and the consequent fre- 
quency of aggressive activities shown in their 
doll-play stories. When, however, the ante- 
cedent variable of punishment for naughtiness 
is taken into consideration, considerable in- 
crease in correlation with the aggression indices 
is found. Doll-play aggression frequency ap- 
pears to increase as a positive function of 
previous punishment for aggressive behavior, 
as well as the degree of frustration imposed by 
the structure of family interaction. 

One possibility for explaining these two fac- 
tors is by integrating them in a conflict theory 
liké that of Bush and Whiting (1953) or of 
that of Brown and Farber (1951). This has 
been suggested by Hollenberg and Sperry 
(1951). As a result of these researches, Sears 
has been led to emphasize the importance of 
the different social conditions under which one 
would expect a learned motive, such as aggres- 
sion, to be expressed in behavior. He found, 
for example, that both the high- and low- 
punished groups of children showed less inci- 
dence of aggressive behavior in the school situa- 
tion than did those who were punished for 
aggression to a medium degree in the home. 
The problem, therefore, for S-R theory becomes 
one of developing measures and laws for pre- 
dicting the transfer of a set of behaviors learned 
in one situation (e.g., the home) to other social 
situations such as school and doll-play situa- 
tions. After outlining an S-R theory of dyadic 
interaction, Sears suggests: 


With a conceptualization of the dyadic variables 
involved . . . it is possible to make a statement 
that goes beyond the narrow confines of these 
measuring situations. In this instance, the more 
general statements can be made that, first, the 
amount of aggression will be a relative function 
of severity of punishment; and second, with sever- 
ity of punishment held constant, the amount of 
aggression will vary positively with dissimilarity 
of the dyadic situation to the punishment situa- 
tion in the home. (1951, p. 476) 

This raises a problem which we shall discuss 
below and which Doob (1952) has recently dis- 
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cussed at some length, the problem of develop- 
ing a rationale and adequate measures for the 
“meanings” which different kinds of social situa- 
tions have for individuals. 

The research of Sears and co-workers has 
dealt with other phenomena of social psychol- 
ogy, as these phenomena appear in the sociali- 
zation of children. In company with Mowrer 
(1950) these researchers have found it necessary 
to focus attention on the mechanism of “identi- 
fication” as a means of explaining the learning 
of certain personality properties and role be- 
haviors by children where there has been no 
direct reinforcement for behaving in these ways. 
Mowrer has suggested that the sounds and 
sights of parents’ behavior take on secondary 
reinforcing value and therefore when a child 
indulges in these behaviors he is self-reinforced, 
and certain learning is thereby accomplished. 
In short, identification may well become an ad- 
ditional learned drive in S-R reinforcement 
theory and, if so, statement of the antecedent 
factors and of the consequent measures will be 
necessary. For example, does more identifica- 
tion with a parent occur when the parent’s 
occasional absence permits the child to play the 
parental role? Is such identification increased 
to the extent of taking on symbolic self-punish- 
ment mechanisms if the socialization of the child 
is delayed until parental behavior has taken on 
strong secondarily reinforcing properties? These 
are questions which are also being investigated 
by Whiting and others with cross-cultural meth- 
ods (Whiting and Child, 1953). In these ways 
S-R theory may assist in explicating this very 
important concept which the Freudians have 
done much to keep before us. 

Sears (1943) has also provided a broad sum- 
mary of the studies which can be viewed as 
bearing on certain psychoanalytic formulations. 
The objective data he presents provide a clear 
enumeration of the reasons why S-R theorists 
feel there is a need for radical reformulation of 
Freudian theory. Nevertheless, there is clear 
evidence that learning theorists have obtained 
much stimulation from Freudian thinking, 

Attitude change. S-R theorists have long been 
interested in the measurement and function of 
attitudes, but Hovland, et al (1949) have re- 
stimulated interest in the study of attitude 
change, the area where learning theory is prob- 
ably most directly relevant at present. Whereas 
most studies have shown that the effect of some 
educative effort directed toward verbal attitudes 
usually is greatest immediately after the attempt 
at change, Hovland uncovered some cases of 
"sleeper effect." Here, with some items, the 


effect became most pronounced some time after 
the educative effort. 

A stimulating set of papers has followed, at- 
tempting to unravel the conditions for a sleeper 
effect, and to place the phenomenon within a 
larger theoretical framework (Hovland and 
Weiss, 1952; Weiss, 1952; Weiss, 1953). One 
hypothesis advanced was that the subjects may 
have felt doubtful regarding the source of those 
items which later show a sleeper effect. This 
doubt would lead to temporary avoidance of 
use of the material (nonacceptance), but the 
doubt would be forgotten sooner than the ma- 
terial itself, if the material had been well 
learned. The result would be a delayed effec- 
tiveness of the communication. Hovland and 
Weiss (1952) have confirmed some of the 
implications of this hypothesis, and Weiss (1953) 
has recently set up an experiment which con- 
trolled the acquisition of the content as well 
as explicitly controlling the discounting factor, 
with similar results. In short, there has been an 
increasing control of the variables involved and 
an increase in the explicit use of S-R theory as 
the search has gone on. These researches may 
give social psychology the explicit and general 
theory of attitude development and extinction 
that has long been needed to keep up with the 
development of measurement technique, which 
has been used for more limited applied pur- 
poses. 

"Meaning." Osgood (1952) has recently sum- 
marized the general S-R attempts to study 
"meaning" with objective methods. He focuses 
his discussion on the problem of "mediating 
responses,” which have loomed so importantly 
in Guthrie's thinking and in the theorizing of 
Miller and Dollard and many others. He has 
also developed an attempt at an independent 
measure of the complex matrix of responses 
which "meaning" implies. Hunt (1952) has 
recently suggested that these developments may 
change drastically our methods of measuring 
public opinion and also our techniques of clini- 
cal diagnosis. Further, since the measures Os- 
good is developing are placed in the context of 
general SR theory, the implications of the meas- 
ures are broad and are quickly testable. This 
departure represents another attempt to obtain 
a valid measure of a complicated response-side 
Structure and then to relate it with behavior 
and, presumably, to antecedent conditions, It 
may well be one of the most important of the 
large number of recent endeavors in systematic 
generalization of S-R theory. 

Other SR theorists are interested in this same 
area but are more concerned with the develop- 
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ment of the methods of “information” analysis 
which have been summarized by George Miller 
(1951). In this article Miller also discusses 
aspects of the unpublished William James Lec- 
tures on language behavior given by B. F. 
Skinner at Harvard in 1947, and touched upon 
in his recent book (Skinner, 1953) . 

Other generalizations. This volume will attest 
in other contexts the degree to which SR con- 
cepts have been generalized in terms of limited 
hypotheses or as larger systems. Let us briefly 
list an additional sample of these attempts. 
Mitrano's (1939) extension of Hullian concepts 
to the behavior of children stands as an explora- 
tory attempt at systematic operational extension 
to which the work of Miller and Dollard (1942) 
and of Solomon and his co-workers should be 
added (Lambert, Solomon, and Watson, 1948; 
Walk, 1952; Minturn, 1952; and Olds, 1952) . 
Solomon's students have also worked on opera- 
tional extensions of the rich controversies of the 
animal laboratory to the behavior of human 
adults, and with Howes (1951), Solomon has 
developed a language response-probability the- 
ory for handling certain data in the area of the 
"new look" in perception, relating speed of 
recognition to certain "value" factors. Mowrer's 
version of a dual theory has motivated Solomon 
and Wynne (1950) to an attempt to vary the 
condition of the autonomic nervous system in 
dogs by use of drugs and surgery and to study 
the effect of these changes on avoidance learn- 
ing. Such experimentation has led to the im- 
portant suggestion that certain kinds of anxiety 
learning may be unidirectional, i.e., a fear of the 
dentist based on strong pain conditioning may 
never be completely extinguishable. Such a 
principle, if further upheld, would lead to im- 
portant practical implications in setting the 
limits on some kinds of therapy. Some practical 
suggestions for therapy have been interlarded 
in a recent report of these studies with dogs 
(Solomon, Kamin, and Wynne, 1953) . 

The outline by F. H. Allport (1942) of a 
descriptive language based upon S-R thinking 
for handling the problem of “collective be- 
havior” cannot be adequately discussed here. 
He has outlined the properties of an “event 
system" based on the individualist position 
which he has brilliantly defended for many 
years (F. H. Allport, 1924, 1942). The research 
possibilities of this model are now being worked 
out (Vallance, 1951). The conception is prob- 
ably the most inclusive theory for discussing 
broad problems of collective behavior yet ad- 
vanced by a psychologist and may provide an 
explicit denotative referent for the notion of 


collective behavior, in a sense which most so- 
ciologists and anthropologists have failed to 
do. As far as the notion of “total social system” 
is concerned, Allport suggests that such an as- 
sumption be dropped and the degree of sys- 
tematization be left for empirical discovery by 
use of his descriptive techniques. 

McClelland (1951) has presented the results 
of a long series of ingenious studies in an 
attempt to develop an objective measure of 
achievement needs in the members of several 
cultures. Work on the antecedent factors in- 
volved in the maximization of this learned drive 
has hardly begun, but work on the reliability 
and validity of the response-side measures is 
well advanced. Others are working on adequate 
measures for affiliative drives (Shipley and 
Veroff, 1952) , and work on both S-R and R-R 
analysis of aggressive drives has continued as a 
lasting interest of social psychologists since the 
publication of the frustration-aggression hy- 
pothesis by the Yale group (Dollard et al., 1939). 

J. J. Gibson (1952) has recently reanalyzed 
the autokinetic experiments in terms of a learn- 
ing theory developed for “perceptual learning” 
which has clear S-R. implications. Gibson sug- 
gests that a focus on the possibilities of speci- 
fying the conditions under which human beings 
come to respond differentially to more and more 
abstract and complex dimensions of stimuli may 
lead to less emphasis on postulating intervening 
variables. Such a suggestion may be readily 
accepted by some S-R psychologists who have 
long held that the current proliferating lists of 
intervening variables are rarely worth the 
trouble, and that some current uses of the no- 
tion of stimulus are unnecessarily vague. 


RELATION TO OTHER BEHAVIOR SCIENGES 


Learning theory has intrigued social sicentists 
who are dealing with problems of the fields of 
sociology, anthropology, education and psychi- 
atry. It may be recalled from the diagram of 
Hull's system that there is explicit reference to 
environmental factors on both the antecedent 
and the intervening area. Complication of the 
diagram can be used to represent the "situa- 
tion," social or otherwise, and the intervening 
matrix can be complicated to describe the rela- 
tively lasting effects of past experience. The 
basic machinery is thus provided for translation 
into other languages of the social sciences. Cul- 
ture, for example, has often been defined in 
terms of habituation, and “habit strength" can 
therefore be expanded to explore the notion of 
“culture strength,” and the notion of the “in- 
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dividual” in the diagram can be expanded to 
represent a “modal individual” at some stated 
level of abstraction. Any particular interpreta- 
tion into S-R theory is, of course, not necessarily 
acceptable to the original user of the concept. 

It should also be recalled that S-R theorists 
have long focused on the relatively limited 
problem of stating the conditions for change 
of structures, Since they are, therefore, not usu- 
ally ego-involved, at least by theoretical pre- 
conception, in any particular categorization of 
structures or “habit family hierarchies,” this 
opens up the system for trying out those notions 
of structure developed by other social sciences. 
The notion of “role” as a categorizing or unit 
concept has long been in use by S-R theorists, 
as have “culture,” "interaction," "meaning of 
the situation," and many other concepts of social 
science. 

Several anthropologists have found S-R theory 
useful, Murdock (1949) in his cross-cultural 
analysis of taboos has found the concept of 
stimulus generalization of value in his explana- 
tions. Kluckhohn in collaboration with Mowrer 
(1944) has provided a broad outline for the 
study of personality development in the context 
of culture. Gillen (1948) and Dollard (see Mil- 
ler and Dollard, 1942) have both found it pos- 
sible to integrate S-R concepts into the 
conception of anthropological problems. Whit- 
ing's Becoming a Kwoma (1941) makes con- 
siderable use of S-R. theory and also suggests 
some alternative lines of development of the 
theoretical system itself. Whiting, in collabora- 
tion with Child (1953), has, through the inte- 
gration of S-R theory with certain Freudian 
concepts, laid the foundation for a broad use 
of anthropological materials in the testing of 
hypotheses regarding the effects of socialization 
practices on personality development. 

Davis and Havighurst (1946) have worked 
with S-R reinforcement notions in an analysis 


of the differences between certain social classes 
in the United States. Their methods and ap- 
proach have added to the generality of some 
learning principles and possibly they have en- 
larged the concept of class by giving it a 
denotation in terms of specific kinds of interac- 
tion of child with parent which provide condi- 
tions for differential kinds of learning to occur. 
The Mead tradition in social psychology, as 
represented by such theorists as Cottrell (1942) 
and Foote (1951) , has developed independently 
of most of the S-R theories which we have em- 
phasized in this chapter. Possibilities for a 
reintegration of these disparate streams of be- 
havior theory are suggested in the role theorists’ 
reawakened interest in problems of motivation 
(Foote, 1951) and in certain clear parallels 
which are discernible in the conceptualizing of 
Cottrell (1942) on the one hand and of Sears 
(1951) and Miller and Dollard (1941) on the 
other. 

Sears (1951) has suggested, as a result of his 
active attempts to extend SH theory, that the 
dyadic framework be taken for the analysis of 
all social interaction and personality develop- 
ment problems. He refers to the fact that most 
learning theory has been stated in terms of the 
behavior of a single individual or of a “modal” 
person and that the dyadic or two-person inter- 
action concept has been relatively de-empha- 
sized. It is possible, however, as he points out, 
to derive from SR theory (as well as other 
S-R theories) such dyadic interaction since, as 
we have pointed out, S-R theory leaves consider- 
able place for the effects of the social as well 
as the physical environment. Sears feels that the 
notion of the dyad can be made more clear by 
the use of a new intervening variable which he 
symbolizes as r,, s... 

The diagram in Fig. 2 should make this con- 
cept more clear. The symbol "rs," refers to 
the fact that not only does the goal response 
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Fic. 2. The dyadic sequence with anticipatory re- 
sponses to the environmental event (r,s). (From 


Sears, 1951, p. 471) 
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fractionate and come to anticipate the occur- 
rence of the goal, but that those instrumental 
acts which have preceded the environmental 
event leading to the possibility of the goal re- 
sponse also fractionate and come to anticipate 
the overt instrumental acts themselves. There- 
fore the reference is to the fact that in social 
interaction there is usually involved the expec- 
tation on the part of alpha of certain behaviors 
on the part of beta which provide alpha with the 
possibilities of an “Rg,” or drive reduction. 
This concept appears to be quite close to Mead’s 
notion of the “expectation of the other” (Mead, 
1934) , and there is no reason why modern SR 
reinforcement theory cannot consider such sym- 
bolic expectations as one of the important 
determinants of social behavior. Further ex- 
plication of Sears’ new concept may lean upon 
hunches from the sociological side of social 
psychology, which have been developed, for ex- 
ample, by Lindesmith and Straus (1949), as 
well as by Cottrell. Although conceptual inter- 
translation is not a necessary prerequisite for 
scientific advance, it can lead to expansion of 
the energy and thought thrown upon common 
problems. It is heartening that such inde- 


pendent conceptual concurrence can occur in 
the fledgling field of social psychology. 


CONCLUSION 


We have sketched in broad, but probably se- 
lective, fashion some of the current endeavors 
to sound out the implications of the general 
stimulus-response position and have outlined 
some of the attempts to construct increasingly 
explicit hypotheses regarding human interac- 
tion. It is peculiarly difficult to encapsulate the 
S-R tradition in easy phrases, as of 1953. Change, 
broadening, greater precision, and extension are 
all the order of the day, and there is perhaps as 
much stimulating disagreement among SR 
theorists as there is between S-R theorists and 
other orientations. i 

Whatever the future of this highly cognitive, 
deeply conative but nonmentalistic psychology 
may be, there is little doubt that there is still 
much to be heard from its adherents. A tradi- 
tion is in good hands as long as there remains 
an active searching for defining data, experi- 
mental control, precise thought, and, perhaps 
above all, vulnerability of formulation. 


REFERENCES 


ALLPORT, F. H. Social psychology. Cambridge: 
Houghton Mifflin, 1924. 

ALLPORT, F. H. An event-system theory of collective 
action: with illustrations from economic and po- 
litical phenomena and the production of war. 
J. soc. Psychol., 1940, 11, 417-445. 

ArrronT, F. H. Methods in the study of collective 
action phenomena. J. soc. Psychol., 1942, 15, 165- 
185. 

ALLPORT, G. W., & VERNON, P. E. Studies in expres- 
sive movement. New York: Macmillan, 1933. 


BrnGMANN, G. Psychoanalysis and experimental psy- 
chology: a review from the standpoint of scientific 
empiricism. Mind, 1943, 52, 122-140. 

BERGMANN, G., & SPENCE, K. W. The logic of psycho- 
physical measurement. Psychol. Rev., 1944, 51, 
1-24, 

Bocustaysky, G. W. Interruption and learning. 
Psychol. Rev., 1951, 58, 248-255. 

Bocustavsky, G. W. A mathematical model for con- 
ditioning. Paper read at E.P.A., Boston, 1953. 
Brocpen, W. J. Animal studies of learning. In S. S. 
Stevens (Ed.), Handbook of experimental psy- 

chology. New York: Wiley, 1951. Pp. 568-612. 

BRONFENBRENNER, U. Personality. In C. P. Stone 
(Ed.) , Annual review of psychology. Vol. 4. Stan- 
ford: Annual Reviews, Inc., 1953. Pp. 157-182. 

Brown, J. S., & Farser, I. E. Emotions conceptual- 
ized as intervening variables — with suggestions 


toward a theory of frustration. Psychol. Bull., 
1951, 48, 465-495. 

Bruner, J. S., & Goopman, C. C. Value and need as 
organizing factors in perception. J. abnorm. soc. 
Psychol., 1947, 42, 37-39. 

Bruner, J. S., & Postman, L. Emotional selectivity 
in perception and reaction. J. Pers., 1947, 56, 
69-77. 

Bruner, J. S., & Postman, L. Perception, cognition 
and behavior. J. Pers., 1949, 18, 14-31. 

Brusy, F. R., Busu, R. R, Jenkins, W. O., JOHN, 
W. F., & Wurrisc, J. W. M. Stimulus generaliza- 
tions after extinction and punishment: an experi- 
mental study of displacement. J. abnorm. soc. 
Psychol., 1952, 47, 633-640. 

Busu, R. R., & MosteLLER, F. A model for stimulus 
generalization and discrimination. Psychol. Rev., 
1951, 58, 413-423. 

Busu, R. R., & Wurrinc, J. W. M. On the theory of 
psychoanalytic displacement. J. abnorm. soc, Psy- 
chol., 1953, 48, 261-272. 


Cuirp, I. I, & WATERHOUSE, I. K. Frustration and 
the quality of performance: I. A critique of the 

* Barker, Dembo, and Lewin experiment. Psychol. 
Rev., 1952, 59, 351-362. 

Cap, R. A. The projective measurement of ex- 
perimentally induced levels of sexual motivation. 
J. exper. Psychol., 1952, 44, 391-399. 


88 HANDBOOK OF SOCIAL PSYCHOLOGY 


Corrrett, L. S. The analysis of situational fields in 
social psychology. Amer. sociol. Rev., 1942, 7, 
370-382. 


Davis, A., E Havicuurst, R. J. Social class and color 
differences in child-rearing. Amer. sociol. Rev., 
1946, 11, 698-710. 

Dewey, R., k Humper, W. J. The development of 
human behavior. New York: Macmillan, 1951. 
Dinsmoor, J. A. The effect of hunger on discrimi- 
nated responding. J. abnorm. soc. Psychol., 1952, 

47, 67-72. 

DoLLARD, J., Doos, L. W., Mmes, N. E., Mowrer, 
O. H., & Sears, R. R. Frustration and aggression. 
New Haven: Yale Univ. Press, 1939. 

Dorrarn, J., & Mirer, N. E. Personality and psycho- 
therapy. New York: McGraw-Hill, 1950. 

Doos, L. W. Public opinion and propaganda. New 
York: Holt, 1948. 

Doos, L. W. Social psychology. New York: Holt, 
1952. 


Fer, W. K. Toward a statistical theory of learning. 
Psychol. Rev., 1950, 57, 94-107. 


Fret, Heynen. Operationism and scientific meth- 
od. Psychol. Rev., 1945, 52, 250-259. 

Foote, N. N. Identification as the basis for a theory 
of motivation. Amer. sociol. Rev., 1951, 16, 14-91. 

French, J. R. P., Jr. Organized and unorganized 
groups under fear and frustration. Authority and 
frustration: studies in topological and vector psy- 
chology III. Univ. Ia. Stud. Child Welfare, 1944, 
20, 231-307. 


Ginson, ELEANOR J. Intra-list generalization as a 
factor in verbal learning. J. exper. Psychol., 1942, 
32, 185-200. 

Ginson, Exranor J. The role of shock in reinforce- 
ment. J. comp. physiol. Psychol., 1952, 45, 18-30. 
Ginsow, J. J. Social perception and the psychology 

of perceptual learning. In press. 

Guten, J. The ways of men. New York: Appleton- 
Century-Crofts, 1948. 

Guru, E. R. The psychology of learning. New 
York: Harper, 1935. 

Gurari, E. R. Psychology of human conflict. New 
York: Harper, 1937. 

Gurnee, E. R. Personality in terms of associative 
learning. In J. McV. Hunt (Ed.), Personality and 
the behavior disorders. New York: Ronald Press, 
1944. Pp. 49-68. 

Gurumm, E. R. The psychology of learning. Rev. 
Ed., New York: Harper, 1952. 

Gurnee, E. R., & Horton, C. P. Cats in a puzzle 
box. New York: Rinehart, 1946. 


Hess, D. O. The role of neurological ideas in psy- 
chology. J. Pers., 1951, 20, 39-55. 

Hucarp, E. R. Theories of learning. New York: 
Appleton-Century-Crofts, 1948. 

Hirearp, E. R. Methods and procedures in the study 
of learning. In S. S. Stevens. (Ed), Handbook of 


experimental psychology. New York: Wiley, 1951. 
Pp. 517-567. 

Hottenserc, ELEANOR, E SerRRv, MARGARET. Some 
antecedents of aggression and effects of frustration 
in doll play. J. Pers., 1951, 1, 32-43. 

Homans, G. C. The human group. New York: Har- 
court, Brace, 1950. 

Hovrawp, C. I. Human learning and retention. In 
S. S. Stevens (Ed.), Handbook of experimental 
psychology. New York: Wiley, 1951. Pp. 613-689. 

Hovrawp, C. I, LUMSDAINE, A. A., & SHEFFIELD, 
F. D. Experiments on mass communication. 
Princeton: Princeton Univ. Press, 1949. 

HovrANp, C. I., & Weiss, W. The influence of source 
credibility on communication effectiveness. Publ, 
Opin. Quart., 1952, 15, 635-650. 

Hutt, C. L. Principles of behavior. New York: 
D. Appleton Century, 1943. 

Hutt, C. L. Behavior postulates and corollaries — 
1949. Psychol. Rev., 1950, 57, 173-180. 

Hutt, C. L., Hovrawr, C. I., Ross, R. T., Harr, M., 
Perkins, D. T., & Frrcn, F. B. Mathematico-deduc- 
tive theory of rote learning. New Haven: Yale 
Univ. Press, 1940. 

Huwr, J. McV. Psychological services in the tactics 
of psychological science. Amer. Psychologist, 1952, 
7, 608-622. 


Jenkins, W. O., & SnerriELD, F. D. Rehearsal and 
guessing habits as sources of the spread of effect. 
J. exper. Psychol., 1946, 36, 316-330. 

Jenkins, W. O., & Stantey, J. C. Jr. Partial rein- 
forcement: a review and critique. Psychol. Rev., 
1950, 47, 193-234. 


Kimete, G. A. Conditioning as a function of the 
time between conditioned and unconditioned 
stimuli. J. exper. Psychol., 1947, 37, 1-15. 

KLEIN, G. $, E Krecu, D. The problem of per- 
sonality and its theory. J. Pers., 1951, 20, 2-23. 
Kocu, Sicmunp. The logical character of the motiva- 
tion concept. L, Psychol. Rev., 1941, 18, 15-38. 


LAMBERT, W. W. Acquisition and extinction of in- 
strumental response sequences in children and in 
rats. Unpublished doctoral dissertation, Harvard 
University, 1950. 

Lampert, W. W., SOLOMON, R. L., & Watson, P. O. 
Reinforcement and extinction as factors in size 
estimation, J. exper. Psychol., 1949, 39, 637-641. 

Lawrence, D. H. The transfer of a discrimination 


along a continuum, J. comp. physiol. P. chol., 
1952, 45, 511-516. gu e 


Leeper, R. W. A motivational theory of emotion to 


replace "emotion as disorganized response.” Psy- 
chol. Rev., 1948, 55, 5-21. 


Leeper, R. W. Book review — Essentials of be- 
havior by Clark L. Hull. Amer, J. Psychol., 1952, 
65, 478491. 

Lewin, K. A dynamic theory of personality. New 
York: McGraw-Hill, 1935. 

Linpesmrru, A. R., & Strauss, A. L. Social psychol- 
ogy. New York: Dryden, 1949, 


STIMULUS-RESPONSE CONTIGUITY AND REINFORCEMENT THEORY 89 


McCLetianp, D. C. Personality. New York: Wil- 
liam Sloane, 1951. 

Macron, R. B. The phenomenological approach 
to social psychology. Psychol. Rev., 1947, 54, 193- 
210. 

McReynotps, Paur. Logical relationships between 
memorial and transient functions. Psychol. Rev., 
1950, 57, 140-144. 

Man, N. R. F. Frustration: the study of behavior 
without a goal. New York: McGraw-Hill, 1949. 
Marx, M. H. Psychological theory. Contemporary 

readings, New York: Macmillan, 1951. 

Mean, G. H. Mind, self and society. Chicago: Univ. 
Chicago Press, 1934. 

MzznuL, P. E. On the circularity of the law of effect. 
Psychol. Bull., 1950, 47, 52-75. 

MILLER, G. A. Language and communication. New 
York: McGraw-Hill, 1951. 

Miter, N. E. Experimental studies of conflict. In 
J. McV. Hunt (Ed.), Personality and the behavior 
disorders, New York: Ronald, 1944. Pp. 431-465. 

Miter, N. E. Theory and experiment relating psy- 
choanalytic displacement to stimulus-response gen- 
eralization. J. abnorm. soc. Psychol, 1948, 43, 
155-178. 

Miter, N. E. Comments on theoretical models il- 
lustrated by the development of a theory of con- 
flict. J. Pers., 1951a, 20, 82-100. 

Miter, N. E. Learnable drives and rewards. In 
S. S. Stevens (Ed.), Handbook of experimental 
psychology. New York: Wiley, 1951b. Pp. 435- 
472. 

MILLER, N. E, & DOLLARD, J. Social learning and 
imilation. New Haven: Yale Univ. Press, 1941. 
Minturn, L. Latent learning and discrimination re- 
versal in human discrimination learning. Paper 
read at Eastern Psychol. Assn., Boston, April, 1953. 

Mirrano, A. J. Principles of conditioning in human 
goal behavior. Psychol. Monogr., 1939, 51, No. 4. 

Moore, O. L., & Lewis, D. J. Learning theory and 
culture. Psychol. Rev., 1952, 59, 380-388. 

MosrxLLEn, F., & Busu, R. R. Paper read at Brown 
University Colloquium, January, 1950. 

Mosretter, F., & Busu, R. R. A mathematical model 
for simple learning. Psychol. Rev., 1951, 58, 313- 
323. 


Mowrer, O. H. A stimulusresponse analysis of 
anxiety and its role as a reinforcing agent. Psychol. 
Rev., 1939, 46, 553-565. 

Mownzn, O. H. On the dual nature of learning — 
a reinterpretation of "conditioning" and “prob- 
lem-solving.” Harvard educ. Rev., 1947, 17, 102- 
148, 

Mowrer, O. H. Learning theory and personality 
dynamics. New York: Ronald Press, 1950. 

Mowrer, O. H. Two-factor learning theory: sum- 
mary and comment. Psychol. Rev., 1951, 58, 350- 
354. 

Mownzn, O. H. Secondary reinforcement in two-fac- 
tor learning theory. Unpublished manuscript, 
1952. 

Mowrer, O. H., & Ktucknouw, C. Dynamic theory 
of personality. In J. McV. Hunt (Ed.) , Personality 


and the behavior disorders. Vol. 1. New York: 
Ronald Press, 1944. Pp. 69-135. 

Munpock, G. P. Social structure. New York: Mac- 
millan, 1949. 

Murray, H. A. Explorations in personality. New 
York: Oxford Univ. Press, 1938. 


Nowuts, V., & NowLis, HELEN H. Child Psychology. 
In C. P. Stone (Ed.) , Annual review of psychology. 
Vol. 3. Stanford: Annual Reviews, Inc., 1952. 
Pp. 1-28. 


Ops, J. The acquisition of motives. Unpublished 
doctor's dissertation, Harvard Univ., 1952. 

Oscoop, C. E. The nature and measurement of 
meaning. Psychol. Bull., 1952, 49, 197-237. 


Parsons, T., & Suits, E. A. Values, motives and sys- 
tems of action. In T. Parsons & E. A. Shils (Eds.), 
Toward a general theory of action. Cambridge: 
Harvard University Press, 1951. Pp. 47-275. 

PavLov, I. P. Conditioned reflexes. (Transl. by 
G. V. Anrep). London: Oxford Univ. Press, 1927. 

Postman, L. The history and present status of the 
law of effect. Psychol. Bull., 1947, 44, 489-563. 


Quxzwzm, E. L. Introduction to social psychology. 
New York: Sloane, 1951. 


Scnrossknc, H. The relationship betwen success and 
the laws of conditioning. Psychol. Rev., 1937, 44, 
379-394. 

SCHLOSBERG, H., & SoLoMoN, R. L. Latency of re- 
sponse in a choice discrimination. J. exper. Psy- 
chol., 1943, 33, 22-39. 

Sears, Pautzne S. Level of aspiration, academically 
successful and unsuccessful children. J. abnorm. 
soc. Psychol., 1940, 35, 498-536. 

Sears, R. R. Survey of objective studies of psycho- 
analytic concepts. New York: Social Science Re- 
search Council, 1943. 

Sears, R. R. Experimental analysis of psychoanalytic 
phenomena. In J. McV. Hunt (Ed.) , Personality 
and the behavior disorders, Vol. 1. New York: 
Ronald, 1944. Pp. 306-332. 

Sears, R. R. Ordinal position in the family as a 
psychological variable. Amer. sociol. Rev., 1950, 
15, 397-401. 

Sears, R. R. Social behavior and personality de- 
velopment. In T. Parsons & E. A. Shils (Eds.), 
Toward a general theory of action. Cambridge: 
Harvard Univ. Press, 1951. Pp. 465-478. 

Sewarp, J. P., & Levy, N. Sign learning as a factor 
in extinction. J. exper. Psychol., 1949, 39, 660-669. 

Suerrievn, F. D. & Rosy, T. B. Reward value of a 
non-nutritive sweet taste. J. comp. physiol. Psy- 
chol., 1950, 40, 287-298. 

Suerriet, F. D, Wurrr, J. J, & Backer, R. Re- 

* ward value of copulation without sex drive reduc- 
tion. J. comp. physiol. Psychol., 1951, 44, 3-8. 

Suiptey, T. E, & Verorr, J. A projective measure 
of need for affiliation. J. exper. Psychol., 1952, 
13, 349-356. 


D 


90 


Skinner, B. F. The behavior of organisms. New 
York: Appleton-Century-Crofts, 1938. 

Skinner, B. F. The operational analysis of psycho- 
logical terms. Psychol. Rev., 1945, 52, 270-277. 

SxiNNER, B. F. Are theories of learning necessary? 
Psychol. Rev., 1950, 57, 193-216. 

Skinner, B. F. Science and human behavior. New 
York: Macmillan, 1953. 

Smiru, S. The psychological origin and treatment 
of enuresis. Seattle: Univ. Washington Press, 1948. 

Situ, W. I, & Ross, S. The social behavior of 
vertebrates: a review of the literature (1939-1950) . 
Psychol, Bull., 1952, 49, 598-627. 

Sotomon, R. L. The influence of work on behavior. 
Psychol. Bull., 1948, 45, 1-40. 

Sotomon, R. L., & Howes, D. H. Word frequency, 
personal values, and visual duration thresholds. 
Psychol. Rev., 1951, 58, 256-270. 

SorowoN, R. L., Kamin, L. J., & Wynne, L. C. 
Traumatic avoidance learning: the outcomes of 
several extinction procedures with dogs. J. ab- 
norm. soc. Psychol., 1953, 48, 291-302. 

Sotomon, R. L., & Wynne, L. C. Avoidance condi- 
tioning in normal dogs and in dogs deprived of 
normal autonomic functioning. Amer. Psycholo- 
gist, 1950, 5, 264. 

Srence, K. W. The methods and postulates of “be- 
haviorism." Psychol. Rev., 1948, 55, 67-78. 

Spence, K. W. Theoretical interpretations of learn- 
ing. In S. S. Stevens (Ed.), Handbook of experi- 
mental psychology. New York: Wiley, 1951. Pp. 
690-729. 

Spence, K. W. Mathematical formulations of learn- 
ing phenomena. Psychol. Rev., 1952a, 59, 152-160. 

Spence, K. W. Clark Leonard Hull: 1884-1952. 
Amer. J. Psychol. 1952b, 65, 689-646. 

Stevens, S. S. (Ed) Handbook of experimental psy- 
chology. New York: Wiley, 1951. 

Stroup, J. B. Psychology in education. New York: 
Longmans, Green, 1946. 


HANDBOOK OF SOCIAL PSYCHOLOGY 


TTHonNpikE, E. L., et al. An experimental study of 
rewards. Teach. Coll. Contr. Educ., 1933, No. 580. 

Tinsercen, N. The study of instinct. Oxford: 
Clarendon Press, 1951. 

TorMaw, E. C. Purposive behavior in animals and 
men. New York: Appleton-Century-Crofts, 1932. 
Torman, E. C. A psychological model. In T. Par- 
sons & E. A. Shils (Eds.) , Toward a general theory 
of action. Cambridge: Harvard Univ. Press, 1951. 

Pp. 279-364. 


VarrANcE, T. R. Methodology in propaganda re- 
search. Psychol. Bull., 1951, 38, 82-61. 


Wats, R. D. Effect of discrimination reversal on 
human discrimination learning. J. exper. Psychol., 
1952, 44, 410-419. 

Watson, J. B. Psychology as the behaviorist views it. 
Psychol. Rev., 1913, 20, 158-177. 

Weiss, W. A study of a “sleeper” effect in opinion 
change. Unpublished doctor's dissertation, Yale 
University, 1952. 

Weiss, W. A “sleeper” effect in opinion change. J. 
abnorm. soc. Psychol., 1953, 48, 173-180. 

Was, C. T., & ScHLosBERG, H. Degree of condi- 
tioning of the GSR as a function of the period of 
delay. J. exper. Psychol., 1952, 43, 357-362. 

Wuitine, J. W. M. Becoming a Kwoma. New Ha- 
ven: Yale Univ. Press, 194]. 

Wonne, J. W. M. A brief theory of purposive ac- 
quired drives. Unpublished manuscript, 1948, 
Wnrrisc, J. W. M., & Cmo, I. L. Child training and 
personality. New Haven: Yale University Press, 

1953. 

Wuer, G. C. Projection and displacement: a cross- 
cultural study of the expression of aggression in 
folktales. Unpublished doctoral dissertation. Har- 
vard University, 1952. 


CHAPTER 3 


Cognitive Theory 


MARTIN SCHEERER* 
University of Kansas 


Cognitive theory might be expected to deal 
with the problem of how man gains information 
and understanding of the world about him, and 
how he acts in and upon his environment on 
the basis of such cognitions. Such expectations 
need not carry epistemological meaning; cog- 
nitive theory could still be a legitimate branch 
of psychology as an empirical science whose 
task is to discover the processes which underlie 
cognitions, to determine the conditions for their 
occurrence, and their function in behavior. 
Disagreements regarding the psychological laws 
of cognition would then reflect natural family 
quarrels to be resolved by experimental test. A 
closer inspection of contemporary systems in 
psychology presents, however, a different pic- 
ture. Not only does cognition play a less note- 
worthy role in many of their formulations, but 
it also fails to appear as a commonly shared 
problem. Influential schools seem even to ques- 
tion whether cognitive processes have to be pos- 
ited at all in a scientific psychology. This seems 
to be particularly the case in modern behavior 
theory and in kindred stimulus-response ap- 
proaches. Here cognition is neither a psycho- 
logical fact nor a problem for investigation; nor 
does it become a postulated behavior variable. 
For those, however, whose views are oriented 
around holistic, "molar" or personalistic prin- 
ciples, cognition constitutes a major problem 
area and also an essential explanatory construct 
(cf. the approach to social psychology by Krech 
and Crutchfield, 1948) . 


COGNITION AS AN ISSUE 


The primary issue, therefore, seems not to be 
how cognition plays a role in behavior but 
whether cognition plays a role in behavior. This 
formulation might lead to the objection that it 
creates artificial issues and straw men where only 
different methods, interests, and objectives pre- 
vail. Spence (1950, 1951) for one, has increas- 


*My sincere thanks go to Dr. Beatrice Wright, 
Aaron Hershkowitz, and Dr. Lee Meyerson for their 
patient and constructive criticism of the wording of 
this chapter. 


ingly insisted that the difference between S-R 
and Cognitive Theory is chiefly one of emphasis. 
The first focuses on observable or hypothetical 
S-R connections, the second on reported or in- 
ferred perceptual organizations as mediators 
between stimulus patterns and behavior. Thus, 
Spence compares S-R theory with Perception or 
S-S theory, which emphasizes Sign-Significate 
concepts. It could also be argued that some of 
the recent studies on communication, though 
chiefly emphasizing S-R aspects, do measure cog- 
nition (G. Miller, 1953). Information is here 
determined by the number of possible response 
outcomes, e.g., telling a child to find a cookie 
by lifting one of sixteen boxes delimits the re- 
sponse to sixteen possible outcomes; telling him 
the cookie is under one of four boxes on the left 
reduces the sixteen to four alternatives. Infor- 
mation is defined "'probabilistically" by the 
number of reduced alternatives necessary to 
control the number of possible outcomes. Im- 
portant as these studies are for measuring in- 
formation in terms of message (S) and behavior 
(R), they do not actually deal with cognition as 
a psychological problem. Thus, information 
“is not synonymous with 'meaning' . . . the 
amount does not specify the content, value, 
truthfulness, exclusiveness, history, or purpose 
of the information. . . . Nothing is said about 
whether the information is understood or be- 
lieved" (G. Miller, 1953, pp. 3, 4). 

Even if such differences in approach were 
taken to be merely matters of emphasis, the 
question still remains why one approach so 
studiously avoids semantic reference to cogni- 
tive factors while another seems to consider it 
indispensable. Is it because cognition hypoth- 
eses are often couched in apparently introspec- 
tive terms? 

The influence of operationism and behavior- 
ism. The rising prominence of neopositivism 
might be thought responsible for the above 


"differences in emphasis" between S-R and Cog- 
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nitive Theory. In physics, Bridgman's recogni- 
tion that concepts such as length or distance do 
not mean the same when used in different sys- 
tems led to the method of "operationism" 
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which stipulates that the observational or ex- 
perimental operation should be specified in the 
definition of every concept. In this sense con- 
cepts do not denote absolute or existential prop- 
erties but are constructs of the observer. Stevens 
(1939) has most eloquently championed this 
method for the discipline of psychology; its 
propositions can have validity only if anchored 
in concrete operations which can be performed, 
are repeatable and public. This rule might have 
fostered a tendency to interpret reference to 
cognition as reference to "private" data of ex- 
perience. Thus it has been charged that such 
terms as insight, understanding, conceptual 
thinking, phenomenal field, and self-image are 
“mentalistic,” i.e., contaminated with the sub- 
jectivity of consciousness. 

Yet operationists have not concurred in a com- 
mon verdict against "phenomenological" de- 
scription. Its use in terms of verbal report is 
legitimate according to Boring (1945) and 
others, e.g., when subjects discriminate between 
stimuli. Stevens holds that even " ‘mentalistic’ 
concepts like percept, image, and idea can be 
operationally defined." And Spence (1944), 
though skeptical of the predictive value of the 
phenomenological method, accepts its use of 
verbal reports about perceptual contents, for 
instance in the experiments on figural after- 
effects by Kóhler and Wallach. These kinds of 
“perceptual responses” can lead to “response- 
inferred" theoretical constructs such as life- 
space. On the other hand, there is still an un- 
settled debate over the question of how to deal 
in operationism with the basically private mode 
of direct experience in its unverbal, preverbal, 
or verbal aspects (Bridgman, 1945; Skinner, 
1945; Pratt, 1945a, b). This uncertainty might 
account for the constraint in the research 
interests of operational psychologists who have 
rarely concerned themselves with clinical and 
social problems. But it does not explain their 
self-imposed restrictions in the use of cognitive 
constructs within other problem areas which are 
less person-centered as, for example, sense per- 
ception or animal and human learning. The 
notable exception of postulated "cognitive maps 
in rats and men" in Tolman’s "operational be- 
haviorism" demonstrates that such intervening 
variables in learning need not be objected to 
on operational grounds. 

Operationism as a method, therefore, was not 
the exclusive source of resistance to a more 
universal inclusion of cognitive factors in the 
making of psychological theory. Rather, this 
resistance among psychologists dates back to the 
impact of Watson's Behaviorism. In the "Zeit- 
geist" of Positivism and Reflexology, psycholog- 


ical data became more and more limited to the 
observable facts of overt behavior. Only when 
questions of consciousness and thinking could 
be translated into motor terms was their status 
as legitimate scientific problems recognized. 
Paradoxically, as Hilgard (1948) pointed out, 
it was those students of the doctrine of “associa- 
tion of ideas" who became the advocates of 
equating ideas and consciousness with motor 
activity. These tenets prepared the ground for 
the ready acceptance of Neo-Positivism, which 
tended to anchor operational methods in physics 
and mathematics as the foundation for a “unity 
of science.” The influence of Neo- or Logical- 
Positivism is reflected in the modern behavior 
theories of Hull and Skinner. Both formulate 
physical stimulus-response relationships either 
in terms of mathematical or empirical func- 
tions. Basically both attempt to make their con- 
cepts synonymous with concrete operations, in 
conformance with the new concept of the “unity 
of science." 

Boring (1945) aptly summed up this entire 
historical development and the common philo- 
sophic directive of Neo-Behaviorism and Neo- 
Positivism in the following statement: “All posi- 
tivism from Mach to Bridgman has stressed the 
advantage for clarity of reducing dubious con- 
cepts to empirical ultimates — to the data of 
observation (Mach) or the processes of observa- 
tion (Bridgman) ,” (p. 244). For most SR 
theorists, empirical ultimates are defined as be- 
havior data linked to associative neural connec- 
tions. Thus, empirical ultimates came to mean 
physical response units, and the processes of 
observation, physical measurements. This iden- 
tification made cognition, because it eludes 
physical concepts, a rather “intangible” prob- 
lem. Cognition theorists are no less empirically 
minded in the search for empirical ultimates. 
But for them these ultimates include phenom- 
enal data as long as they are tied to objective 
procedures, Whether or not such data of direct 
experience correspond to the underlying proc- 
esses, and whether or not they are ultimately 
reducible to accompaniments of nerve impulses, 
they are held to be a primary source of informa- 
tion. It is granted that the examination of di- 
rect experience cannot be the sole method of 
psychological investigation. Phenomenological 
description is nevertheless considered a neces- 
sary step in the study of such problems as the 
physiological correlates of emotional contents. 
This step seems also necessary for the discovery 
of “rationalization,” where the discrepancy be- 
tween introspective report and actual behavior 
is crucial, Asch's comment on the functional 
relationship between experiencing and acting 
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typifies the entire position: “To limit investiga- 
tion to the observation of action alone would be 
to ignore the paramount fact that the actor is 
constantly registering in awareness what is hap- 
pening to him and that this alters his subsequent 
acts" (1952, p. 68). 


PREMISES DIFFERENTIATING S-R AND COGNITIVE 
THEORY 


A further methodological difference between 
representative S-R and cognitive theorists con- 
cerns the selection of empirical ultimates as to 
their complexity — whether physical or non- 
physical. The S-R advocates reason that it 
would be "premature" to study the more com- 
plex from the start; one can deal later with 
complex phenomena by treating them as a com- 
bination of the simpler. S-R connections are 
selected for study because they are considered 
the simplest behavior units. Most cognition ad- 
vocates, on the other hand, question the ade- 
quacy of beginning with simple S-R connections. 
They reason that it would be “premature” to 
base explanatory concepts on such connections 
because these have been isolated from the total 
complex behavior. It is asserted that behavior 
is characterized by organization, and it is 
doubted that a theory of total behavior patterns 
can be based upon a mere combination of pre- 
viously isolated simpler ones. A qualitative 
analysis of the perceptual aspects of the stimulus 
and of the response in their total setting should 
precede quantification. It is further asserted 
that cognition is a part of behavior organiza- 
tion; therefore, it also plays a role in deter- 
mining the S-R units which cannot be dealt with 
by themselves, 

The holistic or molar aspect of organization 
has been made a critical focus of theory and 
method by such men as Stern (1938), G. W. All- 
port (1937), Goldstein (1939), Angyal (1941), 
Tolman (1932), MacLeod (1947), Sloane (1945), 
Leeper (1951), and, of course, the founders of 
the Gestalt School. They have pointed out sev- 
eral relevant preconceptions which have con- 
tributed to the development of S-R theories 
without the postulation of cognitive factors. 
Not all of these preconceptions have been made 
explicit. They become apparent when S-R con- 
structs (reflex, drive, habit, etc.) are scrutinized. 
Each of the aforementioned authors has ana- 
lyzed one or another of these premises. Their 
criticisms of these preconceptions can be sum- 
marized in the following statements, without 
regard for their historical origin or order. 

Reductionism: the "attempt to explain a com- 
plex interrelated whole in terms of its simpler 
elements or parts, or in terms of elements be- 


longing to a lower level of phenomena" (Sloane, 
1945, p. 217). (This takes on different forms, 
which Sloane considers to be the fallacies of 
isolation, constants, mathematical summation, 
origin, and of metaphor or analogy.) 

The atomistic reductive bias: the belief that 
something small is more fundamental than 
something large (MacLeod, 1947). 

The genetic bias: the belief that that which is 
genetically early is more fundamental than that 
which occurs later (MacLeod, 1947) . 

Peripheralism: "a viewpoint or faith that 
holds that psychological processes fundamen- 
tally may be conceived as relatively simple or 
mechanical links between peripheral events or 
receptor stimulations at one end and effector 
activities at the other end. It recognizes the 
existence of nervous processes . . . but it is con- 
vinced that the apparent complexity of the nerv- 
ous processes must ultimately be reducible to rel- 
atively simple connections" (Leeper, 1946, p. 3). 

The reinforcement bias: the assumption of 
specific primary drives or derived needs which 
activate behavior; that no learning occurs with- 
out the reduction of these drives or needs (G. W. 
Allport, 1937; Goldstein, 1939). 

Clearly, these preconceptions overlap and are 
interrelated. When taken together they reflect 
an elementarism which, according to cognitive 
theorists, is incompatible with (he: concept of 
organization that includes cognition as a central 
factor. As Allport expressed it, "Addiction to 
machines, rats, or infants leads us to overplay 
those features of human behavior that are peri- 
pheral, signal-oriented, or genetic. Correspond- 
ingly, it causes us to underplay those features 
that are central, future-oriented, and symbolic" 
(1947, p. 190) . In addition, Allport speaks here 
of a "machine model" that is being followed by 
psychologists. This may be exemplified by the 
“robot” paradigm of Boring (1946) to which 
he tried to assimilate human behavior, and by 
Hull's (1943, p. 384) definition of the organism 
as a “more or less successfully self-maintaining 
mechanism” — mechanism being defined as 
“physical aggregate.” 

The foregoing critical formulations of S-R 
premises have, in turn, been criticized as ex- 
pressing bias by attaching emotional values to 
certain methods rather than attaching value to 
scientific objectivity (Marx, 1951; Spence, 1948). 
What is more crucial, however, is that these for- 
mulations bear on major issues involving cog- 
nitive theory — issues that have arisen in actual 
research. In what follows these issues will be 
discussed in the context of three problem areas: 
molar-molecular, conation and cognition, cog- 
nitive development and genetic reductionism. 
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MorAR-MOLECULAR 


Tolman (1932) introduced the terms molar 
and molecular to contrast the descriptive units 
of his behaviorism with those of Watson. Wat- 
son's approach is called molecular because it 
defines behavior in terms of its strict under- 
lying physical and physiological details (i.e., 
receptor-conductor-effector processes per se). 
For Tolman, behavior is an emergent phenome- 
non that has descriptive properties of its own, 
namely, molarity qua behavior-acts as whole- 
responses. All molar properties of behavior acts 
must have physiological correlates that will have 
to be discovered. But it is claimed that at the 
present state of our knowledge behavior acts 
have qualities different from the physiological 
facts which make them up, and therefore cannot 
be deduced directly from such molecular ele- 
ments. At the same time, Tolman pointed out 
that Watson himself did not remain consistent 
in his molecular definition of stimulus and re- 
sponse. On the one hand, Watson defined the 
stimulus in terms of light waves, sound waves, 
or gaseous particles, etc., and the response in 
terms of smooth or striped muscle twitches. 
On the other hand, he also had to introduce 
such concepts as “situation” on the stimulus side 
and "act" on the response side, e.g., talking, 
writing a letter, or bricklaying. Since these 
concepts are really molar, Watson “dallied with 
two different notions of behavior, though he 
himself has not clearly seen how different they 
are” (Tolman, 1932, p. 6). For Watson, situa- 
tions and acts were, of course, reducible to a 
complex group of stimuli and responses. But, 
as Tolman argues, the respective integrations 
present properties not reducible to a mere enu- 
meration of the elements, e.g., of movements 
qua movements of an “act.” 

Elaborating the molar conception in terms of 
observable acts, Tolman developed "objective" 
criteria of purposive behavior which may be 
summed up in the following propositions: (1) 
Behavior is a type of commerce with the en- 
vironment that can take place only in a whole 
organism and not in specific sensory and mo- 
tor segments which are insulated each from the 
other. Molar behavior thus requires mutual 
interconnections between all parts of the or- 
ganism. (2) The complete identification of a 
single behavior act requires an account of “the 
getting to or from a specific goal-object.” (8) 
This involves the selection of appropriate 
means-objects or shortest routes relative to the 
direction and distance of the goal-object. The 
last two propositions lead to the concept of 


docility which corresponds to the observed fact 
that while the organism persists through trial 
and error in reaching the goal, it benefits from 
such trials in that it can select and retain the 
more efficient means to that end. 

On this basis cognition and purpose become 
"objectively definable" behavior variables. Cog- 
nition is defined as the initial orientation to- 
wards a goal-object in terms of distance, direc- 
tion, and valence character, and as the selection 
of a means-object or path in terms of its suitabil- 
ity for reaching the goal. Cognition is further 
definable as the behavioral change upon attain- 
ing the goal and as the variation of behavior 
if the goal does not prove to have the initially 
perceived valence, Correspondingly, purpose 
becomes objectively definable by the fact of con- 
tinuously differing reactions with an invariant 
terminal direction as, for example, in detour 
behavior. Inasmuch as the foregoing propo- 
sitions are interrelated, purpose and cognition 
are considered to be complementary. Subse- 
quently, Tolman (1936) accommodated his 
system to operational thinking by defining cog- 
nition as an "intervening variable," a variable 
which mediates between the stimulus situation 
(the independent variable) and the resultant 
behavior act (the dependent variable) . 

Independently, contemporary authors have 
also employed molar principles, although they 
referred to molarity by using other distinctions 
or terms (Krech and Crutchfield, 1948; Stern, 
1938; Allport, 1937; Lewin, 1935; Goldstein, 
1939). Goldstein has been in the forefront in 
emphasizing the holistic dependency of all be- 
havior acts, including the simplest reflex. He 
has, moreover, asserted more explicitly than 
has Tolman that the concept of organismic part- 
whole relationship involves two factors, The 
first is the hierarchical organization of func- 
tions. Cognitive processes are considered to be 
a more differentiated function on a higher level 
than most motor behavior. They are therefore 
assigned the role of a central determinant in 
behavior, particularly in the human organism. 
The second factor is that cognition can become 
independent of purpose qua concrete action 
goals, so that it can represent the environ- 
ment more broadly and more abstractly, e.g. 
through concepts. "Molar" psychologists also 
differ with Tolman in his original equating of 
molecular events with physical correlates of 
behavior. They have introduced models of 
physiological and neural processes which are 
not molecular but have molar characteristics 
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themselves. Thus Goldstein (1939) applies the 
concept of figureground to the excitation 
spread in the nervous system; Lashley (1942) 
speaks of excitation ratios and directional pat- 
terns of neural processes; the Gestaltists’ theory 
of isomorphism postulates brain field processes 
of physical Gestalt character; and Krech (1950) 
adopts the concept of “open systems" advanced 
by the biologist Von Bertalanffy (1950). Krech 
hypothesizes unitary dynamic systems in the 
brain field which underlie the “phenomenal 
pluralism" of perception, emoting, and think- 
ing. The common denominator for all cog- 
nitive psychologists seems, nevertheless, to be 
the acceptance of the tenet that behavior or- 
ganization is molar and that cognition is an 
indispensable ingredient of this organization. 

Objections have been raised that the concept 
molar has been applied too loosely, so that its 
criteria overlap and cover a multiplicity of 
meanings. In their critique Littman and Rosen 
discuss seven such meanings and suggest that 
the terms molar and molecular be abandoned, 
since "it does not seem possible to purge them 
of their confusing connotations” (1950, p. 65) . 
According to their criticism, it is not made suf- 
ficiently clear whether molar refers to the quan- 
titative or the qualitative nature of events, i.e., 
to the size or kind of units. A case in point is 
the difficulty of deciding where Hull's system 
belongs in the molar-molecular framework, a 
difficulty also mentioned by Hilgard (1948, p. 
82ff) . It is argued, moreover, that the criterion 
of molarity is applied both to total behavior 
and to level of organization. Thus, the cri- 
terion of new emergent qualities which cannot 
be derived from or reduced to simpler elements 
has been the identifying feature of total be- 
havior; it has also been used to identify higher 
levels in relation to lower levels within total 
behavior, and so forth. Finally Littman and 
Rosen express concern about the tendency to 
make the descriptive criteria of molarity “exis- 
tential” properties instead of “instrumental” 
criteria of empirical method. No issue can be 
taken with this last epistemological reservation, 
but it actually has no bearing on the crucial 
issue of the usefulness of the molar-molecular 
distinction. 

Each of the above criticisms seems to result 
from failure to recognize or to formulate sys- 
tematically the criteria of behavior organiza- 
tion. First, as Littman and Rosen themselves 
point out, there is no longer a cogent reason 
to dichotomize physiological and behavior 
phenomena by positing the first as molecular. 
We have already referred to the attempted ap- 
plication of molar constructs to physiological 


events. Secondly, the criticism that quanti- 
tative and qualitative criteria of the descriptive 
units are not distinguished in the use of “mo- 
lar" stems from a preconceived alternative 
which misses the very nature of whole-part or- 
ganization. Cognitive theorists do not derive 
molarity from the size of the descriptive unit, 
but derive its "size" from the organizational 
structure or qualitative context to which the 
unit belongs. In speech, for example, the first 
word “do” spoken in the phrase “How do you 
do?" has a part (molecular) function, whereas 
a bride's "I do" at her wedding ceremony has 
a molar function. In ordinary conversation 
neither the listener nor the speaker registers 
lip movements in the forming of words, where- 
as for a lip-reading deaf person, they are molar 
events. Furthermore, in a neurological frame 
of reference, the movement sequences of the 
speech musculature may be defined, following 
Lashley, as an expression of a "generalized 
schema" of action which predetermines "the 
sequence of specific acts” (1951, p. 122). 

From these paradigmatic instances, it be- 
comes clear that the molar-molecular distinc- 
tion varies with the context or the organiza- 
tional level selected for study. In many scientific 
disciplines the shifting frames of reference for 
these distinctions have been well accepted; viz., 
in one case the astronomer focuses on lawful 
regularities in the motion of planets without 
concern about their molecular structure, in an- 
other case he may focus on this structure, for 
example, through spectroanalysis. In percep- 
tion Koffka has illustrated this dependency of 
units on the selection of the level of analysis. 
A triangle composed of dots constitutes à whole 
(molar) in relation to its sides, the sides in turn 
are wholes in relation to the dots. Size is here 
conceived as the relation between the enclosing 
and the enclosed units, ie. as a functional 
whole-part relationship. In this connection 
Angyall (1941) has given four possible divi- 
sions of a whole such as a tree: (1) a random 
piecemeal cutting leading to fragments; (2) a 
division according to previously fixed prin- 
ciples extraneously imposed, e.g., dividing the 
tree into inch cubes; (3) a division by abstrac- 
tion of distinguishable properties, e.g. color, 
weight, etc.; (4) a division according to struc- 
tural articulation, following the structural 
and functional whole-part organization, eg., 
branches, roots, capillaries, leaves, etc. (cf. also 
Goldstein, 1939) . 

The emphasis on proximal or distal determi- 
nation. Psychologists have defended elemen- 
tarism by claiming that total concrete events 
are never repeated exactly, but only certain 
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features of them. Therefore even the holist 
has to abstract and fractionate in order to ar- 
rive at uniformities or laws (Spence, 1948). 
Which of the isolated features lead to concepts 
that are significant depends, according to the 
elementarists, on the pragmatic success in the 
discovery of laws. Spence therefore feels that 
one can make use either of “a proximal or dis- 
tal account of the stimulus situation, and be- 
havior may be described either in terms of 
movements (muscular patterns) or in terms of 
gross achievements," depending upon which 
kinds of laws one prefers to find (1948, p. 71) . 
Heider (1939) defines the distal and proximal 
account of action and perception in similar 
terms, yet he maintains that whether proximal 
or distal events are made the focus of the de- 
termination is crucial for psychological theory 
(p. 391). 

Spence's position presents two problems 
for consideration. The first is that not all 
proximal regularities are representative of the 
underlying organizational determinants. Thus 
Kóhler in his critique of traditional introspec- 
tive analysis has pointed out that it can destroy 
the functional dependency of perceptual parts, 
for example in the use of the reduction screen, 
with the result that a certain brightness or form 
is considered the “true” regularity of the retinal 
image as a proximal event. In contrast to this 
isolating procedure the holists attempt to de- 
termine the genuine parts of wholes versus 
"fictitious" elements. The second problem is 
whether the proximal account of the stimulus 
situation can be consistently carried through 
without contaminating this account with distal 
reference. 

In S-R theory the tendency is to place the 
focus of environmental determination proxi- 
mal in space and time and to avoid distal 
determinants. This “tendency to glorify the 
skin” leads to a quasiproximal rigor. One in- 
advertently substitutes stimulus concepts which 
refer to distal objects or determinants. Kóhler 
(1947) speaks of the "experience error" where 
the psychologist describes stimuli in terms of 
the experiential distal object, though his in- 
tention is to describe proximal stimuli. Heider 
(1939) speaks of a disguised focus and analyzes 
three such typical cases in S-R theory. 


The first he borrows from Tolman's critique of 
the conditioned response formula. The two stimuli 
and the two responses are not stimuli and responses 
in any strict physiological sense, but are environ: 
mentally defined affairs. Thus the visual, tactual, 
and olfactory stimuli such as food and string will 
differ from occasion to occasion and remain identical 
only because of their environmental meanings. The 


responses of eating and string-pulling “do not cor- 
respond to specific and invariant sets of muscle con- 
tractions but are only identifiable through successive 
times in terms of environmentally nameable ‘manipu- 
lations' " (p. 396). This involves a distal account. 

A second case is Skinner's description of the rat 
pressing the lever. Although the interest here is to 
arrive at proxima] stimulus response determinants, it 
is admitted that “the number of distinguishable acts 
on the part of the rat that will give the required 
movement of the lever is indefinite and very large." 
"Therefore, instead of the intended proximal events 
there is substituted by Skinner "a class which is suf- 
ficiently well defined by the phrase ‘pressing a 
lever." This phrase denotes distal determination. 

Heider's third example concerns Hull's account of 
the habit family hierarchy which is to “provide the 
basis for a purely physical theory of knowledge." 
The problem is: Habits can vary in situations in 
which there is more than one action sequence that 
will lead to the attainment of a particular goal or 
subgoal. The solution: There is a habit family hier- 
archy based on past learning of many different action 
sequences, for example, of different paths in dif- 
ferent mazes. What makes the different action se- 
quences belong to one family and thereby inter- 
changeable is an identical element, namely, the same 
goal reaction as a "fractional anticipatory goal reac- 
tion." Heider asks whether these goal reactions are 
identical from the point of view of proximal deter- 
mination. From the position of the locomoting 
organism, e.g, a rat in a maze, each detour within 
the repertoire of the family hierarchy follows proxi- 
mally different S-R action sequences. Therefore, 
proximally viewed, the end place of each detour in 
that repertoire cannot be identical with the learned 
end place in the current maze. Hull does not take 
this into consideration, and declares that the antici- 
patory goal reactions are identical when the inter- 
changeable paths terminate in a goal which is in the 
same environmental place. The identical element is 
thus distally determined, although the theory is bascd 
on proximal S-R. sequences. 


The proximal-distal issue has been dealt with 
in such detail because of its deeper implications, 
some of which Heider points out. Insofar as 
S-R theory attempts to explain behavior with- 
out cognitive factors, it is faced with these basic 
problems; (1) to establish a system of proximal 
determination independent of distal environ- 
mental determinants; (2) to derive from this 
system the observed fact that there are behavior 
acts which correspond to the objects, namely, 
the distal stimuli in the environment; (3) that 
this relevant, object-adequate behavior is not 
due to cognitive commerce with the object but 
only appears relevant and has no intrinsic rela- 
tion to the properties of the distal stimuli. The 
last problem is usually solved by a hypothesized 
process of environmentally imposed “selection,” 
as Heider calls it. From an infinite number 
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of possible associative bonds or conditioned 
reflexes between any stimuli and any responses, 
only certain bonds will be favored in experi- 
ence. This position is apparently shared by 
reinforcement theorists and Ames (1951) in his 
experiential purposivism. 

A further consequence of the quasi-proximal 
rigor is reflected in the conceptualization of 
social behavior. The definition of attitude ad- 
vanced by Doob (1947) and based on Miller 
and Dollard includes hypotheses such as these: 
Attitudes are similar to other learned implicit 
cues or drive-producing responses along a gen- 
eralization gradient. An attitude has cue value 
in the same sense as counting the number six 
is a response to the cue of five, and in turn 
acts as a cue to elicit seven. Thus an attitude 
can act as a stimulus in that it evokes learned 
anticipatory goal responses which have the 
effect of tension reduction or reward. The dis- 
tal references which are here implicit but not 
accounted for are concealed in the assumed 
learning processes on which the attitudes are 
based. The learning as well as the evocation of 
the attitude by cues must have a perceptual and 
contextual reference. Chein (1948) therefore 
raises this question: Is it the perceptual re- 
sponse to the proximal cue which is tension 
reducing or is it the content of the perception? 
The perceptual content of what leads to reward 
in relation to persons or objects, e.g., to ap- 
proval or prestige, transcends the presumed 
proximal “stimulus patterns.” Thus cognitive 
processes qua distal references are concealed 
behind proximal determinants. 


WHAT IS A STIMULUS COGNITIVELY CONSIDERED? 


From the cognitive point of view, cognition 
begins with perception. Koffka (1935) asks 
why things looks as they do, and goes on to say, 
“Gf things look as they do because they are what 
they are, then perception would not contain in 
its very makeup a cognitive problem” (p. 78) . 
Inspection of any dotted outline figure demon- 
strates that there is a tendency toward unit 
formation, towards organized togetherness. This 
patterned continuity does not correspond to the 
stimuli themselves inasmuch as they are dis- 
crete points in objective space. Perceptually, 
however, the dots belong together as parts of 
the contour of a figure. Usually two adjacent 
spatial regions share a continuous physical bor- 
der. Perceived contour, however, always has a 
“one-sided” directional quality; it does not out- 
line the shape of the ground but of the figure; 
it encloses and separates the figure area against 
a surrounding field seen as unformed ground 


which extends behind the figure. These and 
other countless observations of perceptual unit 
formation have led to the conclusion that our 
senses do not directly mirror the physical ob- 
ject in the "geographic environment.” Per- 
ceptual unity does not exclusively depend upon 
object unity. The dotted configuration or ap- 
parent motion is a classical case in point. Con- 
versely, object unity is not always mirrored in 
the same perceptual unity. Camouflage color- 
ing can destroy the real and create perceptually 
different units. It seems true, then, "that things 
are what they are does not explain why they 
look as they look” (Koffka, 1935, p. 78). The 
Gestaltists have been among the first to insist 
that "geographic" objects cannot be directly 
responded to as “they are,” nor act directly as 
stimuli. As in most Gestalt theory, this thesis 
is expounded in the realm of vision. 


When I see a table, this table qua table does 
not affect my senses at all; they are affected by 
processes which have their origin in the sun or an 
artificial source of light, and which are only modi- 
fied by the table before they excite the rods and 
cones in our retinae. "Therefore, these processes, 
the light waves, and not the geographical objects, 
are the direct causes of our perceptions . . . (Koff- 
ka, 1935, p. 79) 


We thus have to recognize a distinction be- 
tween the physical objects, i.e., “distal stimuli," 
and the direct cause of our perception, the 
proximal stimuli which impinge on local re- 
ceptors. 'The proximal stimuli, e.g., light rays, 
air particles, belong to what Heider (1927) 
called the "medium" which is interposed be- 
tween the distal object and the sense organ. 
The medium possesses neither organized inte- 
gration nor segregation into specific units. “The 
waves of light . . . do not contain as such the 
slightest indication of the fact that some are 
reflected by parts of one physical object and 
others by objects in its environment" (Kóhler, 
1947, p. 161). It is true that to a continuous 
distal object there corresponds an equally con- 
tinuous area on the retina. Yet the elements 
within this retinal image are qua stimuli “func- 
tionally as independent of one another as any 
one of them is of an element outside the image" 
(Kohler, 1947, p. 162) . We have, then, the para- 
dox that our percepts must depend on proximal 
stimulation, since we cannot perceive the dis- 
tal objects directly, yet it is these objects that 
determine our orientation and behavior. More 
concretely, when we inspect a planet through 
a telescope, do we perceive the light waves, 
ie, the proximal stimuli, or the distant 
planet directly? Actually, we see neither, we do 
not see stimuli but see because of stimuli, What 
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we perceive is an organized pattern occasioned 
by the distribution of proximal stimuli, a pat- 
tern which happens to have some correspond- 
ence to the distal physical object — The 
phenomenal thing, e.g., the planet, results from 
a process within the organism which may be 
called representation of distal objects. The se- 
quence of events may be summarized as follows: 
distal source (physical object) , medium, distri- 
bution of proximal stimuli, sensory organiza- 
tion, response to the product of organization. 
'The product of organization is the phenomenal 
unit formation which represents the distal ob- 


t. 

This formulation leads to the question as to 
what determines the organizational character 
of a percept. It has been dealt with by the 
Gestaltists in two ways: first, they postulate a 
dynamic self-distribution of nervous excitations 
which is triggered off by the proximal stimuli. 
This culminated in Kóhler's theory of iso- 
morphism, which assumes a formal correspond- 
ence between ` brain-field patterns and 
phenomenal patterns; in the last analysis the 
latter are believed to derive from the former. 
In other quarters of psychology, the isomorphic 
hypothesis is not held essential for the ac- 
ceptance of perceptual organization as a fact. 
Although the data on neural correlates of per- 
ception favor a molar interpretation, other 
"nonmosaic" alternatives to strict isomorphism 
have been entertained, e.g., in Lashley's theory. 
Second, the Gestaltists examine the proximal 
stimulus conditions themselves as to their pos- 
sible relevance for perceptual patterning. In so 
doing, Kóhler concludes: "Although the local 
stimuli are mutually independent, they exhibit 
formal relations such as those of proximity and 
similarity" (1947, p. 167). It is clearly implied 
that the Wertheimer factors give rise to per- 
ceptual unit formation. The organizational 
response is induced by the relations among the 
independent local stimuli in terms of similarity, 
proximity, common fate, and good continua- 
tion. On this basis the phenomenal characteris- 
tics of the perceptual field have been further 
explored and experimentally substantiated. 
The well-known demonstrations of figure- 
ground articulation by Rubin and others, of 
whole-part dependency in terms of units and 
subunits, of enclosing and enclosed structures, 
have been almost universally accepted. The 
cognitive significance of these perceptual prin- 
ciples may be summed up in the proposition 
that phenomenal organization is a cohesive 
structured field and that the units in this field 
represent distal objects of the geographic en- 
vironment. 
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A further question concerns the behaviorally ob- 
served fact of relative correspondence between the 
representation of the distal object and the physical 
object itself. How does this take place? It cannot be 
due to a oneto-one correspondence between distal 
and proximal stimuli or between proximal stimuli 
and cognitive representation. It is well known that 
for each distal stimulus there exists practically an 
infinite number of proximal stimuli — a fact well 
attested to by the work of Ames (1951). The seat 
of a chair remains rectangular in its perceived shape, 
although its retinal image is rarely so, The form of a 
plate appears constant, although its retinal image is 
nearly always distorted into an oval. We know that 
a man thirty feet away produces a smaller retinal 
image than at three feet, but the perceived size 
remains the same. We are referring, of course, to the 
phenomena of size and form constancy, which con- 
tradict the “traditional” constancy hypothesis that 
to every local stimulus change a corresponding 
change in the sensation occurs. We know, moreover, 
that a change in distal stimuli, even if reflected in 
proximal stimulus variation, may not lead to a 
corresponding change in the percept. A white sur- 
face under low illumination may reflect less light 
into the eye than a black surface under strong illu- 
mination, yet the white surface may continue to look 
white. These well-investigated phenomena led to the 
conclusion that changes in proximal stimulation, 
regardless of whether they are accompanied by 
changes in the distal stimulus object, need not pro- 
duce corresponding changes in the appearance of 
the phenomenal thing. The phenomenal thing is 
conservative to the extent that it preserves largely 
the constancy of the real physical object despite 
variations in proximal stimuli. It has also been 
recognized that the phenomenal thing can undergo 
alteration without variation in either the distal or 
proximal stimuli. Examples of this are optical illu- 
sions and ambiguous figures in which the percept 
can be reorganized under the influence of internal 
conditions, e.g., learning, acquired set, analytic atti- 
tude, sensory fatigue, and satiation effects. 


In repudiating the traditional constancy hy- 
pothesis of a strict correspondence between 
local stimulus and sensation the theory has to 
postulate that something of the physical object 
is preserved which makes, on the whole, for its 
adequate representation. For one thing, the 
“mosaic” of retinal projection, through con- 
vergent refraction, for example, duplicates to a 
large degree the geometric relations among the 
various points of the surface of the distal object. 
This is possible because the proximal stimulus 
distribution, i.e., the reflected light rays, follow 
these formal geometric relations along the 
object (Kohler, 1947, p. 167). In the words of 
Koffka, the perceptual representation of an ob- 
ject “is a dynamic map of the distant stimulus, 
when and inasmuch as the proximal stimulus 
distribution possesses such geometric charac- 


teristics as will produce a psychophysical or- 
ganization similar to the one of the distant 
stimulus object" (1935, p. 659). To explain 
the phenomena of size, form, and color com- 
stancy which seem to militate against this 
assumption, the concept of "invariance" is in- 
troduced, Invariance refers to two factors. 
First, that the visual response organization is 
oriented towards the total stimulus field so that 
a given part of the field will always be re- 
sponded to in relation to the rest of the field. 
Second, that this relationship will tend to re- 
main invariant. For example, in the case of 
the constancy of a white surface under low 
illumination, with further light reduction the 
white surface will still tend to appear white 
rather than gray, while the illumination of the 
surroundings will begin to appear darker. In- 
variances of this kind have been investigated 
by Wallach (1948). Boring (1952) has dealt 
with invariances in perception from an opera- 
tional point of view. Stevens (1951) places the 
constancy phenomena in relationship to the 
problem of transformation, in which certain 
relations remain invariant. Cassirer (1944b) , 
in a brilliant discussion of the pertinent psycho- 
logical data, related these constancies to the 
theory of groups, again stressing invariant 
functions in the transformation. The contro- 
versy between experience advocates and the 
Gestaltists has not been resolved in regard to 
this point. The latter, however, insist that the 
invariance ultimately rests on the fact that no 
local stimulation can determine the correspond- 
ing excitation by itself, but only in connection 
with the totality of the stimulus field. This 
implies that the dynamics of visual processes — 
whatever their neural characteristics are — can- 
not be based on point-for-point local excita- 
tions. These dynamics must be based on 
"translocal" functions of excited receptor areas 
and translocal nervous processes. 

A noteworthy attempt to deal with the prob- 
lem of invariances has been advanced by Bate- 
son (1951). In line with Gestalt theory, he 
derives the phenomenal thing from figure- 
ground processes of reconstructing external 
relations. Bateson asserts that external events 
are different from internal events and that a 
translation from the former to the latter must 
take place. He treats this translation with the 
concept of codification, which is defined as the 
substitution of one type of event for another. 
For an event to be codified two conditions must 
be fulfilled. First, codification must be sys- 
tematic. "Whatever objects or events or ideas 
internal to the individual represent certain ex- 
ternal objects or events, there must be a sys- 
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tematic relationship between the internal and 
the external, otherwise the information would 
not be useful” (p. 168). 
has to preserve the relationships for which it 
substitutes, “While it is imposible for a man 
to have inside himself a tree corresponding to 
the external tree that he perceives, it is possible 
to have internal objects or events so related 
to each other that their relations reflect relation- 
ships between parts of the external tree". (pp. 
169-170). This is a novel approach to the 
problem of cognition in terms of information 
theory: representation becomes synonymous 
with codification. 

It follows from the foregoing analysis that 
there is not one answer to the question, "What 
is a stimulus?," but that we have to distinguish 
at least three. From the point of view of im- 
mediate psychological reality the stimulus is 
the phenomenal "thing" as it represents the 
geographic object — whether faithfully or not. 
From the point of view of the underlying and 
mediating processes the stimulus is a total, 
proximal field distribution as it elicits trans- 
local excitation patterns in the central nervous 
system. From the point of view of the geo- 
graphic environment, the stimulus consists of 
those physical units as they affect the organism 
through the medium of proximal stimulus rela- 
tions. Cognition on these sensory levels may 
then be defined as the perceptual representa- 
tion of distal objects; it takes the form of 
phenomenal organization which is centrally in- 
terposed between the source of stimulation and 
the behavioral adjustment. Consequently, cog- 
nitive representation is not determined by the 
“stimulus” alone but always in connection with 
the organizational and other response disposi- 
tions characteristic of the person. 

Thus cognitive theory differs from S-R mod- 
els on several counts. First it neither makes 
proximal factors the focus of determination 
nor does it introduce distal factors in disguise. 
Second, the theory does not derive the organ- 
ism's coordination to the geographic world from 
associations between individual stimuli and spe- 
cific responses. Third, it includes a distal de- 
termination in its conception without omitting 
reference to the mediating processes. It pro- 
poses a field-determination which is neither 
distal nor proximal alone, but includes both 
these factors as aspects of the total field deter- 
mination. 


"PERCEPTUAL REPRESENTATION AND PHENOMENAL 
FIELD IN OTHER THEORIES 

We are now prepared to examine the exten- 
sion of the representational character in phe- 
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nomenal organization to other areas of 
cognition. First, it seems apropos to sample the 
manner in which the configurational tenets 
have been assimilated by other contemporaries, 
both with and without S-R leanings. For the 
Gestaltists the total field, in the widest sense, 
consists of the geographic environment and the 
psycho-physical organism within it. Only a 
portion of this total field is psychologically rep- 
resented; this is the phenomenal (or psychologi- 
cal) field of direct experience or awareness. It 
includes distal representation and the phenome- 
nal self. The behavioral environment, or life 
space, is more inclusive than the phenomenal 
field, and extends beyond momentary awareness. 
In turn, the behavioral environment corre- 
sponds only to that segment of the wider geo- 
graphic environment that affects behavior. 

Hull (1945) has acknowledged configura- 
tional precepts with his concepts of “stimulus 
patterning” and "afferent interaction." The 
latter is peripheral in nature, not really in- 
volving central processes. According to Hil- 
gard, "the postulate of neural interaction . . . 
represents an honest recognition that the 
problem of patterning in perception is a genu- 
ine one. The postulate does not solve the 
problem .. ." (1948, p. 82). It relegates the 
issue to the nervous system, where every com- 
bination of afferent impulses can produce 
patterns, but the theory fails to specify in what 
way and under which conditions. 

Hebb (1949) grants phenomenal organiza- 
tion insofar as he accepts án unlearned "primi- 
tive unity" as the “distinctiveness of figure from 
background." Such a visual boundary reaction 
even occurs in rats reared in darkness (p. 20, 
fI). Distal representation is thus envisaged. 
In fact, cues from the distant environment are 
shown to govern primitive space perception 
more than nearby objects (p. 92 ff). Learning 
and "nonsensory factors," however, are called 
for in the discrimination of forms, i.e., pattern 
differentiation as “identifiability.” The learn- 
ing presumably consists of a visual tracing along 
the figure boundary and a corresponding build- 
up of neural cell assemblies in the brain. Hebb 
supports this hypothesis by drawing on von 
Senden’s (1932) case reports of congenitally 
blind patients. Postoperatively, they seemed to 
identify shapes through learning. This in- 
terpretation has been challenged by Michael 
Wertheimer (1951), who re-examined yon Sen- 
den's data for such evidence. It is significant 
that Hebb's theory, in contrast to a peripheral 
connectionism, does allow for configurational 
and central factors of representation. 

Gibson (1950, 1951) adopts figure-ground ar- 
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ticulation in his phenomenological analysis of 
the visual world, but derives it from tridimen- 
sional “solid” vision. In explaining the linkage 
between phenomenal space and the distal 
stimuli, however, he considers it unnecessary to 
invoke a mediating sensory organization. The 
latter “becomes gratuitous” because the proxi- 
mal stimulus distribution and the distal object 
have properties which are strictly correlated 
(1950, p. 216, 61 ff.). A specific order of light 
energy gradients on the retina, rather than the 
formed image, is coordinated to the surface and 
depth characteristics of the physical object. 
Such "ordinal stimulation" is used to account 
for direct visual form perception as well as 
the constancy phenomena. This leads to a dis- 
tinction between the "literal visual world" 
which is formed by the ordinal stimulation, and 
the "schematic visual world" which is a cen- 
trally mediated representation. The latter con- 
sists of the abstractions and meanings which 
the individual imposes upon his literal per- 
ceptions. The schematic view is synonymous 
with cognitive structuring, which is attained 
through learning and preverbal conceptualiza- 
tions. It culminates in verbal symbolism (1951, 
p. 102 f£). 

Brunswik (1936, 1952), although objecting 
to the "thing language" in psychology, sub- 
scribes to distal representation consisting of 
patterned cues and "sign Gestalten." Cognition 
is treated as the functional achievement of at- 
taining objects or final goal effects through 
perceptually guided behavior. From his eco- 
logical studies on perceptual constancy Bruns- 
wik formulates a probabilistic approach to the 
"gross organismic coming-to-terms with the 
environment," "Cognitive correctness" must be 
defined in “terms of over-all statistical correla- 
tion between classes of variables" instead of 
single judgment measures (1952, p. 23). A 
"psychology in terms of objects" is proposed, 
i.e, an empirical inventory of the kinds of goal 
effects and the probability of their attainment. 
Brunswik's system differs from Gestalt theory 
in that mediation problems are not considered 
essential, and the details of actual sensory, 
nervous, or motor phenomena are deliberately 
disregarded. "One and the same means-object 
may be represented at different times by very 
different stimulus configurations. And one and 
the same goal may be reached equally well by 
very different kinds of movements and means- 
object manipulations" (1936, p. 125). 

Brunswik (1952) spelled this out by present- 
ing a lens model of molar behavior. The distal 
stimulus is called the initial focal variable, and 
the accomplished result the terminal focal 
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variable. The mediation between the two varies 
according to the circumstances and is therefore 
“multiple” or “vicarious” or “interchangeable” 
in terms of a family hierarchy of cues and hab- 
its. The cognitive terminal focus can remain 
substantially invariant. Perception, because of 
its probability character, becomes a "quasi- 
rational system" and the lens paradigm a con- 
ceptual tool for "the operational definition of 
meaning" (1952, p. 20 f£). 

Other authors also retain the phenomenal 
field concept without following Gestalt theory 
in its specific tenets of mediation. For example, 
the postulate that "the relationship between the 
behavioral and the physiological field is cru- 
cial" (Koffka, 1935, p. 53) has not been adopted 
by Lewin. Using the concepts of the life space 
and psychological field, he formulated his field 
theory. But he did not include in his system 
mediating physiological processes, which he 
considers among "alien factors." Lewin devel- 
oped principles of behavior in which cognitive 
representation was objectively dealt with (cf. 
Chapter 4). 

Snygg and Combs (1949) lean heavily on 
the concept of the phenomenal field and the 
characteristics of figure-ground organization. 
However, these authors advance a different con- 
ception of life space. It is not an objective 
construct, as in Gestalt and field theory. The 
phenomenal field which is posited coincides 
with subjective awareness. Aside from con- 
sciousness, no other psychological events are 
deemed necessary. The problem of unconscious 
processes or of a behavioral environment ex- 
tending beyond awareness is relegated to 
hypothetical degrees of conscious clarity. Pre- 
diction has to be based upon the understanding 
of the phenomenal field of the behaver, as per- 
ceived by him at that moment. This under- 
standing can be accomplished by a method of 
inference or reconstruction of the behaver's 
field which must include his “phenomenologi- 
cal needs" and "the projection of the future 
field." The observer does this by asking typi- 
cally, "Under what circumstances would I have 
done that?" Here the subjectivity and unique- 
ness of the representations in the person's 
phenomenal field is the paramount material for 
the psychologist. All influences outside the 
phenomenal field belong to the realm of an 
"objectivistic" psychology which is rejected. 
Reference to the organism as a biological sys- 
tem and to the geographic environment is 
avoided. The total field "“. . . is made up of the 
individual's meanings of events past, present, 
and future" (p. 248). The person's behavior 
is therefore completely determined by his "pri- 
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vate" world and conscious frame of reference. 
"This poses the problems of how each phenome- 
nal field can correspond to reality: how it can 
have access to other persons' private worlds, 
and how commonality with others is possible. 
The problems are not solved by postulating 
an objective geographic or a real behavioral 
environment that is shared by all human organ- 
isms. The answer is given in two parts: (1) 
There is some connection or relative corres- 
pondence between the phenomenal fields of 
different persons which is based upon com- 
monly shared experiences in a given culture 
or subculture (p. 108). (2) "From a phenome- 
nological point of view ‘objective’ facts are 
derived from the phenomenal fields of a num- 
ber of observers" (p. 249 n). These facts 
"represent the least common denominator" of 
the differing phenomenal fields (l.c). The 
individuals world therefore consists of the 
"meanings" which the "objective facts" have 
for him. The core concept of this system is the 
phenomenal self. It “is the only frame of 
reference which the individual possesses" (p. 
79). The maintenance and enhancement of 
this self is the "prime objective of the indi- 
vidual's existence." 

The theory, then, establishes the psychologi- 
cal realities by borrowing from Gestalt and field 
theory the organizational characteristics of cog- 
nitive representation. The role of cognition is 
markedly tied to an egocentrism of perspective 
and is placed within a framework of cultural 
relativism. For the upholder of phenomeno- 
logical tradition this method would qualify as 
a rather idiosyncratic version. He may wonder 
about the procedure of establishing the non- 
subjective reference of experience (e, "objec- 
tive reality") via a consensus reached through 
nose counting of phenomenological fields. And 
he could doubt whether the proposed method 
of inference is truly phenomenological, since 
Snygg and Combs reject introspection by the 
behaver as an instrument of reconstruction. 
Introspection, they believe, distorts and modi- 
fies the phenomenal field, whose "indistinct 
characteristics . . . cannot be verbalized and 
therefore cannot be communicated except by 
inference from their behavior correlates" (p. 
36). 

MacLeod (1951, 1947) follows more closely 
in the footsteps of the Gestalt school and Hus- 
serl's early theorizing. He extends phenomeno- 
logical technique to the sphere of social psy- 
chology. Psychological phenomenology is not 
a school or a system but an approach. “A phe- 
nomenon is by definition an appearance. Psy- 
chological phenomenology is the attempt to 
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suspend biases and to observe and describe 
faithfully that which appears” (1951, p. 226) . 
“The phenomenological question is simply, 
‘What is there?, without regard to Why, 
Whence, or Wherefore” (1947, p. 194). The 
method is claimed to be objective. Without 
previous assumptions it describes what is given 
in direct experience — things as they are. 
Moreover, phenomenology is assigned the 
propaedeutic task of defining psychological 
data, deciding which are important for theory, 
and setting the initial problems. From there 
on, scientific constructs and explanations 
should be built which may even include physi- 
cal correlates. This differs from the finality 
of Snygg and Combs’ subjectivistic premises and 
their procedure of inferential reconstruction. 
For MacLeod, too, perceptual representation is 
the point of departure, inasmuch as he accepts 
the structure of the phenomenal field as given, 
and the content as including things and per- 
sons in relation to the behaver. The relation- 
ship is not solely subjective in nature. To 
speak of the phenomenal world as though it 
were only a private world tends to create con- 
fusion. “There is nothing in phenomenological 
analysis which requires us to believe that the 
self is the only, or even the main, point of origin 
of vectors” (1951, p. 236) . The category "mine" 
characterizes only some of the components of 
the phenomenal world. “The bulk of the 
phenomena are not mine at all, any more than 
they are yours. You and I observe and describe 
the same object . . . for each of us there are 
some phenomena of a private or subjective 
nature but together we live in a public world, 
a world which contains people as well as things" 
(p. 237). The danger of confusion, it may be 
reasoned, arises as follows: One begins with the 
undisputed fact that ultimately all experience 
depends upon an organism, a person. Direct 
(or immediate) experience is therefore only 
subjective, and objective reference can be estab- 
lished only indirectly or by inference. Con- 
sequently, private and immediate experience 
become synonymous, The phenomenal field has 
no directly given objectivity, ie, objective 
contents or reference. Operationists have not 
made things clearer when most of them label 
direct experience as private and at the same 
time admit that all science ultimately rests on 
this mode of experience (Boring, Pratt, Skin- 
ner, Feigl, 1945) . MacLeod no less than the 
Gestaltists insists that the very nature of dircct 
experience itself has subjective and objective 
reference; it already has a discriminable public 
and private character. To paraphrase Mac- 
Leod, there is, for example, an undeniable 
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difference between color and pain. Usually the 
color is “out there” and a property of an ob- 
ject. It remains there in my absence. It is not 
a property of me. In contrast, the pinprick on 
my hand is mine; it is felt in me and not in the 
pin. I cannot push it out, making it inde- 
pendent of me. Objectivity and subjectivity 
are phenomenal facts of the same order and 
validity as the attribute of brightness or of 
loudness. They are “properties of an organized 
perceptual field in which the points of refer- 
ence are selves (subjects) and objects . . ." 
(1947, p. 200). From a cognitive point of view, 
the self and the object are part of the same 
perceptual field. However, the self is an ever- 
present anchorage point, with the emotional 
quality of selfhood feelings. Only under par- 
ticular circumstances (eg. primitive or early 
developmental stages, or border states of con- 
sciousness) does the phenomenal field lose this 
dual representational reference of self and not- 
self. 

If one accepts this analysis, serious doubts 
arise concerning the taboo placed on direct 
experience as unreliable material for psychol- 
ogy. If the immediately given already contains 
discriminable objective and subjective charac- 
teristics, one cannot call all direct experience 
private. Moreover, even the "private data" may 
be accessible and amenable to psychological 
investigation. To relegate what is private to 
the realm of the scientifically irrelevant may 
leave out what is psychologically relevant. A 
distinguished physicist, widely respected among 
psychologists, has this to say: 


Simple observation shows that I act in two 
modes. In my public mode I have an image of 
myself in the community of my neighbors, . . - 
in the private mode I feel my inviolable isolation 
from my fellows. . . . To ignore this difference, 
or to set up the thesis that the difference is unim- 
portant until it is proved, is opposed to the entire 
spirit of the operational approach. . . . The op- 
erations by which I now deal with the "public 
for me" are qualitatively different from the opera- 
tions by which I now deal with the "private for 
me." . . . Since one aspect of the structure of my 
universe is the operational difference between 
mine and thine, I must make my report in a lan- 
guage which recognizes this operational duality. 

. It may be that eventually we shall be able 
to take account of the operational dichotomy in 
the universes of each one of us in some simpler 
way. But until that time, it seems to me that by 
ignoring the dichotomy psychology is engaging in 
an unnecessary gamble and is probably riding for 
a fall. (Bridgman, 1945, p. 282 ff.) 


Bridgman asserts further that it depends 
upon the subject matter of a discipline whether 
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the private and social modes of individual be- 
havior should be recognized and insisted upon; 
while the social mode may be ignored in phys- 
ics, this is not the case in psychology. 

A methodologically interesting contribution 
to this question has been made by Wood 
(1940). He discusses the confusion between 
direct experience and private experience which 
has led to a misidentification of phenomenology 
as introspection. To clarify this situation, 
Wood introduces the distinction between in- 
spection and introspection. The first is an 
elementary form of cognition which is im- 
mediate. It is a primary experiential activity 
directed toward its appropriate object and at 
the same time it is not self-cognizant, i.e., not 
self-reflective. Introspection is a secondary sub- 
sequent cognitive activity in which the primary 
cognitions are reflected upon. The difference 
between these two cognitive modes may be il- 
lustrated as follows. Perceiving the action of a 
person is inspectional cognition, since I do not 
include my seeing in the perceptual act: the- 
action-of-a-person-seen. Introspection enters 
when I reflectively perceive the total situation 
as  myself-in-the-act-of-perceiving-the-action-of- 
the-other-person. Wood claims that inspec- 
tional data correspond to direct experiences, 
are accessible to all, and are essentially the same 
data with which phenomenology and Gestalt 
psychology deal in perception. Introspectional 
data are private, inasmuch as they are accessible 
only to the introspectionist himself. In this 
sense, introspectional cognition is a construc- 
tional knowledge “of a peculiar class of objects, 
namely, those belonging to the continuum of 
the introspectionist’s personal experience. In- 
trospection is dependent upon inspection, for, 
although introspection involves and presup- 
poses inspection, the converse is not true — 
inspection may or may not be introspective in 
character” (p. 227). In this connection it is 
noteworthy that Brunswik, notwithstanding his 
objectivistic functionalism, concedes this role 
to direct experience. “Tt must be stressed, how- 
ever, that act-psychology and phenomenology 
are important, and perhaps indispensable pre- 
liminaries to molar research of the objective 
kind” (1949, p. 59). 


MANIFEST ORGANIZATION AND COGNITION 


Employing the principles of phenomenal 
unit formation as the basis of distal representa- 
tion, Gestalt theorists and those of similar 
orientation have developed the special charac- 
teristics of the phenomenal field. This field 
always represents something — some content in 


103 


direct experience. Since direct experience has 
molar and relational characteristics, we may 
speak of the “manifest organization” of the 
phenomenal field. Manifest organization is 
awareness of behavioral objects and of their 
relatedness to each other and the relation be- 
tween their properties. Without manifest or- 
ganization, awareness would be a mere sequence 
of experienced events. In the words of Koffka, 
“|, . there would be no forces. One billiard 
ball would run, come in contact with another, 
stop, and the other would begin to roll. Two 
trains would collide, leave the tracks, and cars 
turn turtle and become wrecked; another mere 
sequence” (1935, p. 383). This is the kind of 
world that many S-R theorists, like Hume be- 
fore them, posit. The contiguity of events in 
space and time leads to the mental habit of 
uniting these sequences as cause and effect. The 
experienced “causal union” becomes felt neces- 
sity only if the same contiguities always occur. 

Phenomenal causality, however, according to 
the Gestalt laws of unit formation, is part of 
the manifest organization through which the 
why and how of objects affecting each other 
can be directly experienced. Duncker (1945) 
and Kohler (1947) have made searching exami- 
nations of the characteristics of phenomenal 
causality. Both discuss its intelligible aspects 
and its cognitive relations with thé objective 
environment via the behavior space. There is 
first the functional aspect of objects, which is 
learned and grasped through action, e.g, the 
function of the hammer in driving the nail. 
The experienced functions are not arbitrary 
but “ground consequence structures” in which 
the properties of the object are intrinsically re- 
lated to its use and effect. There is, further, 
an intelligible connection in the experienced 
correspondence between the function of objects 
and their material or their form. The material 
fits to the functional “valence” of things accord- 
ing to whether the material is light, heavy, soft, 
hard, flexible, or immobile. Correspondence 
of form and function is seen in the rolling of a 
ball, the notches on a key matching the casing 
of the keyhole, and the shortest detour conform- 
ing to the contour of an obstacle. Intelligible 
connections are also seen in the correspondence 
between the phenomenal properties of cause and 
effect, Although these phenomenal properties 
overlap, they may be enumerated. 


e I Positional correspondence, as in (a) spatial 
proximity: the effect is perceived close to the place 
of its origin, e.g., the sound is heard where the bell 
rings; or (b) temporal proximity: the effect follows 
closely its antecedent, e.g, the landscape is illumi- 
nated after the flash of lightning. 
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Il. Qualitative correspondence, as in (a) con- 
gruency: the effect is in harmony with its cause, e.g., 
the trace in the sand fits the form of the object, as in 
footprints, or the sound of knocks corresponds with 
the rhythm of the knocking; or (b) similarity or 
identity of content: one event passes from one me- 
dium to another but retains its character, e.g., the 
heat of the stove becomes the heat of the room, the 
wetness of the rain the wetness of the pavement, the 
blackness of the pencil point the blackness on the 
paper. 

Phenomenal causality therefore appears to be de- 
pendent on the objective contiguity of events. Here- 
by results a phenomenal coincidence which is not 
accidental or arbitrary. The factors of proximity, 
identity, similarity, and good continuation as they 
are inherent in the particular contents of events be- 
come codeterminants of the phenomenal coincidence, 
so that a selection occurs. Cause and effect depend 
upon positional and qualitative correspondence; in 
every case of direct causal experience this functional 
connection stands out perceptually from all other 
conceivable simultaneous relations and events. 


This analysis is, in effect, the reverse of 
Hume's position. Because in experienced causal- 
ity the time and place of the cause "coincide" 
phenomenally with the time and place of the 
effect, there is sometimes a tendency to link 
causally those events which have no causal con- 
nection save contiguity. The mere coincidence 
of a door slamming behind a person in a dark 
hallway and the hallway light going on at the 
same time can be to him a compelling impres- 
sion of causation without intelligibility. The 
factor of proximity is preserved, but the func- 
tional connection between cause and effect is 
phenomenally absent. In contrast, it is ever- 
present in our actions of manipulating objects 
where we are the causes of the change in the 
object, e.g, when we break a stick. It is there- 
fore not surprising that Kóhler (1947) applied 
the term “insight” to the manifest organization 
of phenomenal behavior and also of the ego-field 
structure in general. Both can have the charac- 
teristics of “awareness of determination.” The 
cognitive aspects of manifest organization can 
include the awareness of the dynamics of our 
own action: not only do we act, but we know 
why we act, and why our acts have effects. 

A commonplace instance of phenomenal caus- 
ality in behavior is action in terms of means-end 
relationships and detour-paths to goals, i.e., 
“this leads to that.” Ever since the experiments 
by Kóhler, Yerkes, and others with chickens, 
chimpanzees, and children, the principle of 
detour has been widely accepted. We have here 
an affirmation of Tolman's position that molar- 
ity, goal directedness, and cognition are neces- 
sarily interdependent. From any segmental 


point of view, each single movement away from 
the goal would actually lead away from it unless 
it is phenomenally related to the total detour 
path as part of the whole. It is in this context 
that Tolman and Brunswik (1935) analyzed the 
"causal texture of the environment" and ex- 
tended the concept of representation. In the 
organism's environment certain objects may 
function as the local representatives of other 
objects; the latter then are considered the enti- 
ties represented. The function of local repre- 
sentation in the causal texture of the 
surroundings has two subvarieties. The first is 
the case of tools or paths which are “behaved to" 
only because of their role as means-objects. 
‘They function “as ‘local representatives’ where- 
by to reach or avoid such and such more ulti- 
mate, ‘represented’ positive or negative goals" 
(1935, p. 45). "The second is the selective per- 
ceptual response to proximal stimuli (‘‘light- 
wave bundles and the like") as cues or signs, 
ie. local representatives, of the distal objects 
or events. The latter may also function as repre- 
sentatives for still other objects or events and 
may then come to serve as a cue or "sign Gestalt" 
for a causal relationship. For example, "the 
retinal effects on the child's eyes serve as local 
representatives, signs, for the facial expression 
of (another) individual. And this facial ex- 
pression, as a local representative, serves in its 
turn as a sign . . . for the will, good or bad, of 
the other individual" (1935, p. 51). This ap- 
proach essentially parallels the Gestalt tenets of 
phenomenal causality but introduces one im- 
portant variation. The parallel lies in the 
assertion that in the environment there exist 
causal couplings in which different events are 
regularly dependent upon each other; that be- 
cause of this, organisms come to accept one 
event as the local representative of another 
event; that they thereby come to "behave" ap- 
propriately to the more distant, "represented" 
situation or event. The variation lies in the 
assertion that the causal connections thus repre- 
sented always have some degree of equivocality, 
so that a probability factor is attached to the 
local representative. The organism approaches 
the environment with certain hypotheses and 
expectancies about the most probable causal 
relations and constancies. 


APPARENT BEHAVIOR: COGNITIVE STRUCTURING IN 
SOCIAL PERCEPTION 


During the last decade the social implications 
of cognitive structuring in phenomenal causality 
have been investigated by several authors. Mi- 
chotte (1950) performed a series of experiments 
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using simple geometric figures, e.g., two colored 
rectangles moving towards or away from each 
other, under varying conditions of speed, direc- 
tion, and distance. The subjects were presented 
with a variety of combinations: 


1. Object A moves toward and reaches B, which 
is stationary. When A moves rapidly, the im- 
pression is that it strikes B; when 4 moves slowly, 
it appears to gently join, touch, or unite with B. 

2. After A reaches B and stops, B begins to 
move. If A’s approach is more rapid than B’s 
subsequent motion, B's “moving away" seems to 
be caused by A, sometimes interpreted as a re- 
action of anger. If, however, A’s approach is less 
rapid than B’s subsequent motion, then B’s move- 
ment appears autonomous, only released by the 
contact with A, and gives the impression of flight, 
sometimes interpreted as a fear reaction of B. 
This occurs when the duration of the contact is 
short. When the contact lasts longer, “the im- 
pression produced is often interpreted in the 
sense of a momentary agreement between two 
accomplices who meet, but which gives place to a 
disagreement, followed by separation.” 

3. Object B begins to move after being touched 
by A and the two continue to move together at 
the same speed and in a common direction. If 
there is a short pause after contact, the impression 
is one.of friendly “getting together,” the unity 
being accentuated by the similarity of the speed 
and direction of the subsequent movement — the 
“law of common fate.” If there is no pause, and 
A approaches B slowly, with rapid acceleration 
during contact, A appears to carry off B by brute 
force. If, under the same conditions, the direction 
of A's approach is reversed during contact, 4 may 
appear to snatch B back with him (“kidnap- 
ping"). 

Michotte considers these impressions to be 
Gestalten, namely, specific perceptual kinetic 
structures related to the action of successive 
stimuli on the eye. The lessening of the dis- 
tance between the retinal images of the two 
objects causes the impression of approach, but 
this is much more than a simple response to dimi- 
nution of distance. The perception is selective be- 
cause in a complex visual stimulus field, the 
lessening of distance between two objects is neces- 
sarily accompanied by increasing distance between 
the moving object and others: 

"Yet the impression of ‘going toward B’ is 
never accompanied by the simultaneous impres- 
sion of ‘going away’ from these others. . - . There 
seems to be some sort of incompatibility between 
the simultaneous presence of impressions of 'go- 
ing toward’ and ‘going away from’ in the case of 
the same object. It is something analogous to the 
distinction between ‘figure’ and ‘background,’ 
where contour belongs exclusively to ‘figure’.” 
(1950, p. 120) 


According to Michotte, the organizational fac- 
tors of perception are autochthonous unit for- 
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mations which vary in strict correspondence 
with changes in the stimulus conditions, e.g., 
rate of movement, direction, and distance. Thus 
the category of causation is believed to have 
the same immediacy as shape or figure-ground 
relations. Perceiving movements as properties 
of figural units, we refer them to definite origins 
and so connect consequences with antecedents 
in terms of a constant object passing through 
and producing change. Specific sequences of 
motion are thus perceived as "functional con- 
nections.” These experienced causal contexts 
are also described as felt connections by Mi- 
chotte, who thereby bridges the traditional 
separation between cognition and emotion. The 
perceptual response includes similar somes- 
thetic and motor reactions in the "affected" 
observer, as in the experienced expressive move- 
ments of other persons. Studies pointing in a 
similar direction, linking cognition and emo- 
tion, have recently been made by Werner and 
Wapner (1952) . The interrelation between the 
tonic state of body equilibrium and perceptual 
forces in the visual field has been experimentally 
explored, leading to a sensory-tonic and viscero- 
tonic theory of perception. The visual dynamics 
of faces or triangles pointing right or left when 
presented in a dark room affected the sensory- 
tonic equilibrium correspondingly. A right- 
pointing profile or triangle was measurably 
displaced to the right of the median plane of 
the observer, etc. This approach opens an im- 
portant new avenue to establishing empirically 
the intraorganismic processes attending the per- 
ception of "physiognomic" or "demand quali- 
ties." 

All such findings have far-reaching applica- 
tions to the perception of social action. Mi- 
chotte's subjects spontaneously used not only 
the language of causation but also spoke in 
terms of human emotions and attitudes. They 
described the inanimate figures in such phrases 
as "It is as though B was afraid when 4 ap- 
proached and went off .. „ A joins B, then 
they fall out, have a quarrel, and B goes off by 
himself," and so on. 

Heider and Simmel (1944) independently 
arrived at very similar results after showing a 
movie to several hundred college students. The 
movie presented different geometric figures, e.g., 
a small triangle, a large triangle, a circle, and a 
rectangle in varying combinations and move- 
ment sequences of changing speed and direc- 
tion. The subjects' verbal descriptions showed 
a striking preponderance of animistic responses, 
describing the figures not only as human but 
also in terms of interpersonal relationships such 
as rivalry between two men about the girl, or 
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family conflicts among mother, father, and 
child. The second overwhelming tendency was 
to structure the movement sequences into units 
which comprised a meaningful action in which 
the three forms interacted in a causal and 
motivated fashion, e.g., chasing, attacking, flee- 
ing, and so on. These actions also became 
related to the fourth figure, the large rectangle, 
part of which opened and closed during the film. 
Over ninety percent of the subjects described 
this rectangle as a house which the large triangle 
angrily destroyed at the end of the story. Heid- 
er’s analysis demonstrated once more that the 
relations of temporal succession, spatial proxim- 
ity, good continuation are decisive stimulus 
conditions for the causal and interpersonal 
perceptions involved. 

In all of these experiments the influence of 
past experience upon the perceptual reactions of 
the observers may appear obvious. However, 
if the explanation of directly apprehended phe- 
nomenal causality were to be derived from past 
experience alone, it would have to account for 
the lawful stimulus response correlations ob- 
tained by Michotte and Heider. Of course, the 
reported “animation” points to the similarity 
between the movement forms of the abstract 
figures and the previously experienced move- 
ment forms of concrete persons, Yet to derive 
animation from this similarity does not answer 
the primary problem: What are the original and 
relevant stimulus conditions for the perception 
of human action, and how do these phenomenal 
unit formations arise which are germane to 
perceiving others? Heider (1944), Asch (1952) 
and MacLeod (1947) have each dealt with these 
questions in rather similar ways. 

Under everyday conditions objects are not 
usually experienced as agents which actively 
cause events (except in animism). Objects are 
related to each other in causal determination. 
The cigarette scorching the tabletop is not seen 
as the cause of damage, but its burning (and 
being left unattended) is so seen. The laws of 
unit formation operate differently when we per- 
ceive the behavior of objects and when we per- 
ceive the behavior of persons. In human action, 
we perceive the person as the causal agent or 
source from which the action issues. The person 
and his action perceptually form a specific 
causal unit in which the person is the author 
of the action and its effect. The perceived di- 
rection and attainment of his action depends 
upon an organization of movements according 
to spatio-temporal laws of proximity, similarity, 
and good continuation in varying degrees, Inas- 
much as the act and its accomplishment fit the 
perceived direction, as, for example, in attack 
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or in pointing to an object, we also perceive a 
congruence between the motive, goal, or inten- 
tion of the behaver and the course and terminus 
of his activity. These principles apply just as 
much to misperception as to the correct per- 
ception of human actions. 

Heider (1944), following the leads of W. 
Stern (1938) and Werner (1940), analyzes 
different levels and modes of perceived personal 
causation ranging from the more primitive 
(magic) to the more complex. Different cog- 
nitive structures are involved in such cases as: 
(1) The primitively experienced union of the 
doer and the deed — the person who inad- 
vertently steps on my foot is the cause of my 
pain. (2) The more differentiated causal rela- 
tionship between the doer and the deed — the 
dentist who unintentionally causes me pain is 
not perceived as the cause of my pain. "This also 
has significance for the relationship between 
frustration and aggression. Aggression is more 
readily evoked and more intense if the frustra- 
tion is attributed to a person rather than to 
impersonal causes. The application of these 
observations to other interpersonal relations, 
such as forming impressions of personality, has 
a wider orbit than can be discussed here (Asch, 
1946; Mensch and Wishner, 1947; Chapter 17, 
this work). 

The general maxim should be stressed that by 
virtue of these specified unit formations the 
"other" comes to be represented as a causal 
entity in the phenomenal field of the person. 
Thus “the paramount fact about human inter- 
actions is that they are happenings that are 
psychologically represented in each of the par- 
ticipants" (Asch, 1950, p. 142). In the person- 
to-object relation, representation is one-sided, 
whereas in the person-to-person relation, it takes 
place on both sides and is interdependent. Psy- 
chological interaction between person and per- 
son occurs "between entities both of which are 
systems which perceive, feel, and understand." 
This leads to the concept of the “mutually 
shared field," which includes both the common 
behavioral environment and the psychological 
perspectives of the interacting behavers as the 
objective sphere of interaction. Even the most 
simple behavioral interplay, let alone coopera- 
tive acts, requires a reciprocal representation in 
which "different and complementry actions are 
executed simultaneously with reference to each 
other" (Asch, 1952, pp. 163, 175). In his analy- 
sis, MacLeod (1947) points to the phenomeno- 
logical fact that a person has a unique status 
among other objects of the psychological field, 
that by being apprehended as an "other self," 
he is more or less "like me" and is at the same 


COGNITIVE THEORY 107 


time apprehendable as an "object" which can 
generate its own purposes. Our dislike of a 
person will be felt as caused by, not as causing, 
the perceived purposes or properties of the 
person. 

The fact that these principles are still in need 
of empirical elaboration is recognized by Mac- 
Leod, who urges the use of improved methods 
of observation, especially of everyday life situa- 
tions. A fruitful approach in this direction may 
be the dividing of the behavior continuum ac- 
cording to criteria derived from the principles 
of unit formation in social perception, a method 
proposed by Wright and Barker (1951). Action 
units are defined as molar behavior episodes 
oriented toward a molar environment, ie., to- 
wards situations. The molecular aspect of ac- 
tions, such as steps in walking, are defined as 
subordinate and mediating part processes. 
Among other criteria, behavior episodes have 
the following molar and whole-part charac- 
teristics: invariance of direction; relative in- 
clusiveness or size which varies with the subgoals 
and subunits; the span of the person's own 
behavior perspective in his current goal pursuit. 
These criteria are further refined with regard 
to relative potency of the phases of an episode, 
its transitive and consummatory stages, the de- 
gree of completion or incompleteness of episodes 
and their overlapping. Related molar criteria 
for perceived behavior units have been devel- 
oped by Krech and Crutchfield (1948, pp. 30 fF). 

All the foregoing discussions of social per- 
ception share this in common: they deal with 
the stimulus as a cognitive representatior. More- 
over, this representation can be conceived as 
causing behavior, and the recognition of this 
can lead to predictions. Here again cognitive 
and behavior theorists are divided on a funda- 
mental issue of method. Cognitive theorists not 
only recognize the relevance of the behaver's 
phenomenal field for all psychological theory, 
they also consider it a cause, i.e., an intervening 
variable or an explanatory construct of his ac- 
tions under certain conditions. Therefore psy- 
chological understanding and explanation can 
coincide for cognition theorists, who in such 
cases would invoke causal principles for their 
conceptualization. For the S-R advocates with 
strong operational leanings, such as Kendler, 
this leads to the danger of not clearly distin- 
guishing "the requirements of scientific ex- 
planation from those attributes of propositions 
which are capable of instigating in some a feel- 
ing of ‘understanding’ (the ‘aha’ phenome- 
non)" (1952, p. 274). We must conceive the 
intervening variables “as economical devices to 
order experimental variables in relation to the 


dependent variables." Psychologists are criti- 
cized for going beyond the operational meaning 
of the intervening variables by reading into 
them surplus, particularly causal, meanings. 
“Most, if not all psychologists were confronted 
with the question ‘Why?’ prior to their exposure 
to scientific training" and their "prescientific 
explanatory techniques have lingered on and 
been confused with scientific explanation" (p. 
274). Apparently causal understanding has 
been placed outside the realm of science and 
into that of "personal cogitation" and "intui- 
tively satisfying explanation." Ritchie (1953) 
satirically criticizes Kendler for this proposed 
“circumnavigation of cognition.” We may ask 
whether the following instance can be concep- 
tualized as an "intelligible" causation from 


Kendler's point of view. When an observer 
sees a motorist avoid a child on a highway, he 


will experience the same causal connections as 
the driver who perceives the child and avoids it. 
Here is an experience which causes behavior 
and is perceived as such by an observer. For 
Kendler, causal explanation is to be substituted 
by either formulating empirical relationships 
with predictive capacity or constructs of deduc- 
tive capacity. This rationale is a direct out- 
growth of Feigl’s proposal that “low-grade” 
explanations in science be replaced by “high- 
grade" explanations, even though it is ad- 
mitted that psychology has yet achieved only 
a few of the “low-grade” (Kendler, 1952). Feigl 
(1951) considers “high-grade” explanation that 
which relegates macroscopic events to under- 
lying microphysico-chemical events. The prob- 
lem here is: How can the logical positivist bridge 
the gap between psychology and physics without 
interposing the organism as a system? Failure 
to recognize the primacy of functional and 
structural organization in an organism inevi- 
tably leads to an uncalled-for reductionism 
which skips the entire dimension of biological 
data. Rather than prematurely delegating 
causal thinking in science to the realm of the 
subjective, prescientific, and “low-grade,” there 
exists the need to account scientifically for the 
very phenomenon or causal thought. This need 
has been accepted by the cognitive theorists. 
Duncker (1945) has analyzed the conditions for 
partial and total insight in physical science. He 
shows that there are ultimate irreducible facts 
which are not intelligible. However, the formu- 
lation of laws which relate these facts to each 
ogher often do qualify as intelligible. They 
must even have logical consistency, particularly 
if they are couched in the language of the hypo- 
thetico-deductive method so ardently and rightly 
advocated by Feigl (1951). 
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The assertion that truth, understandability, 
and logic are reducible to mere intersubjective 
consensus because science must be “public” has 
been challenged by Bridgeman (1940). 


The process that I want to call scientific is a 
process that involves the continual apprehension 
of meaning . . . accompanied by a running act 
of checking. . . . This checking and judging and 
accepting, that together constitute understanding, 
are done by me and can be done for me by no 
one else. They are as private as my toothache and 
without them science is dead. . . . When I fol- 
low the marks on paper that constitute a treatise 
on symbolic logic, I am not willing to call my 
activity scientific unless I can say at each stage 
of the process “I understand." ... (1940, pp. 41, 
46) 


WHAT IS A RESPONSE COGNITIVELY CONSIDERED? 


It seems difficult to define the relevant stimuli 
for social perception and thinking in physical 
terms alone. They have meaning. S-R theorists 
typically refer present meanings to past asso- 
ciations. The manner in which cognition 
theorists have dealt with the relationship be- 
tween response and past learning differs from 
SR conceptions. Both schools agree that pres- 
ent responses can always be influenced by past 
experience. They disagree on how the past 
learning occurred and how it affects the current 
response. To begin with, cognitive theorists 
have not separated stimulus and response in 
the traditional way. The stimulus is phenome- 
nally represented. through configurational re- 
sponses to the relations between proximal 
stimuli. The response is therefore already a 
process of spontaneous organization. In the 
case of learning the response is also viewed as 
relational rather than specific. For this reason 
is it advisable to examine first the treatment of 
the response in the pertinent areas of learning 
and problem solving. 

"There has been a tendency to exaggerate the 
reservations, voiced by Gestalt theorists, about 
the role of past experience, As Wertheimer 
(1945) emphasized, the issue is not present struc- 
turing versus past learning, but rather how 
learning takes place, whether primarily through 
associative couplings or through organizing the 
relationships inherent in the content. It is held 
by cognitive theorists that genuine learning oc- 
curs through structuring and restructuring of 
the psychological field. In this respect, learning 
and problem solving have cognitive processes in 
common. 

Thus, maze learning by animal or man is not 
viewed as a stimulus response linkage in the 


sense of peripheral receptor-effector connections. . 


Rather, the learning is viewed as the acquisition 


of central representations of layout, place rela- 
tions, and directions, viz., reaching the goal by 
swimming after having learned to reach it by 
running, or reaching it after severance of the 
kinesthetic tracts. The experimental fact that 
new adequate goal paths can be selected when 
those learned previously are blocked led Tol- 
man to propose the “cognitive map" as an 
intervening variable. Once acquired, such cog- 
nitive structures (sign Gestalten, field expectan- 
cies, etc.) are available for the organism. They 
can be activated when future needs arise. These 
representations can subsequently be reorganized 
when detours are demanded. 


Light is thrown on the organizing processes in 
learning and problem solving by the experiments of 
Köhler and others with primates and children (Kóh- 
ler, 1925; Alpert, 1928). A gap between the reach 
of the arm and the food lure is filled. A stick is 
"seen out" from its configurational context and its 
current properties are "recentered" into "rake-in and 
arm-extension” properties. One trial alone is suffi- 
cient for learning. Also transfer to other objects 
with gap-filling and rake-like qualities occurs spon- 
taneously when a stick is not available. This mili- 
tates against the interpretation that specific 
associations are formed in the original experience. 
The achievement is rather rooted in the learning of 
functional connections between object properties and 
action. The understanding of such causal structures 
involves a certain cognitive generality. The famous 
double-stick solution of Sultan occurred later with 
other sticks of new diameters. The transfer consists 
in the carry-over of the experienced joinability of 
hollow sticks which can be wedged into each other. 
Kóhler's theory of a predominantly visual structuring 
has been supplemented by Birch. He showed that 
chimpanzees ‘without any previous stick experience 
could not solve the raking-in problem within thirty 
minutes. After having played with sticks they 
achieved solution within twenty seconds, The 
interposed experience was carefully controlled. The 
animals were observed to use the sticks as extensions 
of the arm for poking or making contact from a dis- 
tance, but they never used them to sweep objects 
into reach (Birch, 1945, p. 375). The use of the 
stick for raking is therefore a "new" pattern of re- 
sponse in which the preceding stick experiences are 
reorganized. This stick experience may be necessary 
action-wise, but again does not consist of specific, 
“contiguously learned items of experience” (p. $81) . 


Concerning the use of previous knowledge in 
problem solving by human subjects, the issue 
is the following: Can insight be reduced to a 
response which repeats an identical or partially 
identical element of past experience, or repeats 
identical reactions to equivalent stimuli, as Bor- 
ing (1946) and others claim? In the first place, 
it has been shown that past experience often 
hinders solution in problem solving, to wit, 
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Duncker's (1945) investigation of “functional 
fixedness,” and Maier's (1930, 1931, 1933) re- 
search on "direction" in reasoning. Second, 
Maier (1945), in a variation of his experiments 
on hatrack construction, has challenged spe- 
cifically the doctrine of identical elements and 
of equivalence: reproductive thinking, defined 
as a transfer of previous knowledge, is only 
operative when the identity or equivalence is 
discovered phenomenally. "Objective identity 
is wholly inadequate." In productive thinking 
not only the present but also past experiences 
can be changed and restructured. Stimulus 
properties which have not appeared before may 
appear under the stress of a problem situation. 
According to Maier, this militates against the 
traditional assumption that only the perception 
of the present thing may change during learning 
or reasoning; that the memories of this percep- 
tion, however, cannot be modified and reorgan- 
ized because they are static. 


With his experimental design Maier did not probe 
beyond reproductive thinking in which subjects had 
to discover an identity between an item in their 
past life experience, e.g., a hatrack, and a presently 
available item, e.g, a clamp on a pole. Nor did he 
investigate productive thinking beyond the reorgani- 
zation of the memory for a specific item into a dif- 
ferent familiar item; e.g., a clamp which first had 
been used to join two poles for a supporting column 
later had to be recentered into a hook, creating a hat- 
rack, Maier's definition of productive thinking could 
be widened to include the creation of a new aspect 
which is not contained in a specific past experience. 
Wertheimer's (1945) analysis has strongly suggested 
such genuine creative structuring in the instances of 
young Gauss and Galileo. 

Restructuring may also transcend the discovery of 
a specific familiar item. In the solution of social 
problems, for example, a re-examination of the cur- 
rent goal or intention may be required. This specific 
goal is taken out of the present context in search for 
the broader underlying intention. The discovery of 
the "true" nature of the goal can produce a novel 
organization of the means for its attainment. Wer- 
theimer (1945) devoted a chapter to the conflict 
which two boys encountered in playing badminton 
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because the older one was so superior to the younger 
that the latter refused to continue the game. The 
solution: the older discovers that when both join 
in the common goal of keeping the bird up in the air 
as long as possible the game can remain enjoyable 
for both. The boy re-examined the need for his 
present competitive goal and found that the under- 
lying intention or true goal of the game was to have 
fun together. This leads to the recentering from 
"playing to win" to “playing cooperatively.” 

In another type of productive thinking an un- 
familiar problem may be reorganized in a novel way 
for the utilization of past learning. This is the case 
when a child has once grasped the structural prin- 
ciples for determining the area of a rectangle. He 
may then find the area of a parallelogram by discov- 
ering that there is "too much on one side" and 
"something missing on the other." He reorganizes his 
percept and "sees out" the superfluous triangle on 
the one side and fits it to the other side of the paral- 
lelogram in such a way that a rectangle is created. 


Transfer effects in terms of relational prin- 
ciples have been experimentally demonstrated 
in Katona's (1940) experiments on card tricks. 
The group which learned by understanding 
could subsequently solve both the original and 
new card tricks, while the group that learned by 
drill showed poorer retention on the old tasks 
and failure on the new ones. An important 
feature of this, as well as many other studies of 
this kind, is that even the S's who demonstrated 
"understanding" were not readily able to verb- 
alize the organizational principle involved. 
Experiments by Szekely (1940, 1950a,b) on the 
creative restructuring of latent knowledge spe- 
cifically suggest the presence of implicit rela- 
tional thinking. A similar conclusion was 
drawn by Leeper (1949, 1951) in reviewing the 
work on determining tendencies and concept 
formation. He indicates that there is an inti- 
mate connection between the cognitive struc- 
tures of mental set, imageless thought, transfer 
in problem solving, and certain processes of 
generalization. A common denominator of these 
processes is that they usually are outside of 
direct experience and consequently unverbal- 
ized. 


SILENT ORGANIZATION: A COGNITIVE DETERMINANT OF RESPONSE 


The inference that only certain aspects of the 
cognitive organization may be “manifest,” i.e., 
in awareness, poses a new problem. Do “non- 
conscious” processes have representational 
characteristics, and if so, how? There are nu- 
merous instances that indicate the functional 
dependence of the phenomenal field on a frame 
of reference which transcends it in space or time. 
In the case of the perceived upright, for ex- 
ample, Witkin (1949) has demonstrated the 


influence of spatial framework which is not part 
of direct experience. Likewise, the speed and 
direction of perceived motion may be deter- 
mined by a spatial frame of reference which is 
beyond awareness (Brown, 1931; Duncker, 
1929). The classical studies on successive judg- 
ments of weights have established the effect of 
a temporarily operative frame of reference not 
manifest in the subjects experience (Wever 
and Zener, 1928). Sherif (1936) made these 
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findings the point of departure for his well- 
known theory of social norms. Sherif and 
Cantril (1947) reviewed the experimental evi- 
dence for the effect of such frameworks on 
psycho-physical judgments in various modali- 
ties. This evidence was utilized in developing a 
psychology of attitudes originating in percep- 
tion and governed by frames of reference. Re- 
cently Helson (1947, 1948) has attempted to 
subsume a wide range of similar phenomena 
under the rubric of “adaptation level” as a 
unitary explanation. 

The Gestaltists have introduced the concept 
of “silent organization” to account for factors 
outside of awareness which influence behavior. 
Among such factors, Koffka has extensively dis- 
cussed the functional effect of the spatial frame- 
work upon body posture in action and 
locomotion, and upon the localization of the 
self (1935, p. 216 ff, p. 384 ff). In the case 
of silent organization the individual's con- 
scious judgment or performance is thought to 
be dependent upon a nonconscious frame of 
reference similar to the way in which a figure 
depends on the unfocused background. Apply- 
ing the laws of whole-part relationship here as 
well, Gestaltists propose that the phenomenal 
field can become a functional part of the wider 
process of silent organization as a whole. 

Silent organization will be used here in a 
broader sense. It is suggested that various prob- 
lem areas may be dealt with by using silent or- 
ganization as an explanatory construct. This 
construct would cover the formation of frames 
of reference and mental sets, the development 
of both generalizations and concepts in their 
preverbal form, and the carrying over of im- 
plicit organizing principles from one problem- 
solving situation to another, The common 
denominator of these silent organizations is that 
they are not tied to specific individual contents 
but operate as cognitive structures which have 
the character of “typifying schemata,” guiding 
orientation and action, Head (1920), Bartlett 
(1932), Lashley (1951), Piaget (1937, 1952), 
and others have employed the term "schema" to 
designate essentially the same integrative proc- 
esses. If we accept this concept, it follows that 
silent organization has representational charac- 
teristics. This representation does not consist 
of specific phenomenal objects. It is a trans- 
phenomenal organization which relates spatial 
or temporal units of the behavioral field into 
an integrated structure — the schema. Likewise, 
relations between ideational or action units of 
behavior can attain schematic representation. 

Such schemata can exercise a striking influ- 
ence on individual contents of perception and 
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thought. Two experiments, among many others, 
may serve as illustrations. In Luchins’ (1942) 
study, the S’s had to solve the task of obtaining 
a certain quantity of water with the use of three 
containers of different capacities. For example, 
the S is told: you have jars of 18-, 43-, and 10- 
quart capacity; get 5 quarts. After having solved 
several such problems, most S's developed a set 
to subtract from one of the larger containers. 
The last problem was: given 5-, 15-, and 20-quart 
jars, get 5 quarts. Certain S's did not realize 
the direct availability of the 5-quart jar. Instead 
they adhered to their stereotyped procedure of 
subtraction. They measured 15-55 or 20-5-5-5 
or 20-15. It is not only noteworthy that their 
schema was more general (subtraction) than a 
specifically fixed sequence, but also that it liter- 
ally prevented the S’s from cognizing that 5 
quarts is 5 quarts even though they were able 
to perceive the 5 and use it for their schematic 
pattern of solution. 

An illuminating case in which schemata of 
everyday life are tapped is the experiment by 
Bruner and Postman (1949). The S's were 
tachistoscopically presented with playing cards, 
some of which had color and suit reversed, e.g., 
clubs red and hearts black. The recognition 
threshold for the incongruous cards was signifi- 
cantly higher than for ordinary cards, ‘The 
incongruous cards also required a greater num- 
ber of prerecognition trials, However, when a 
subject had once recognized an incongruous 
card, the threshold for later incongruous cards 
was substantially reduced. Once the possibil- 
ity of suit-color reversal was recognized, a shift 
in cognitive structure took place. We have here 
an instance of the strength of a schema and its 
reorganization under the influence of changed 
information. Bruner, without using the term 
"schema," speaks here of a "hypothesis" which 
"is best likened to such terms as determining 
tendency, set, Aufgabe, predisposition. It may 
be regarded as a highly generalized state of 
readiness to respond selectively to classes of 
events in the environment" (1951, p. 125). 
Again, "a specific hypothesis is not an isolated 
expectancy . . . but rather relates to more in- 
tegrated systems of beliefs or expectancy about 
environmental events in general" (ec, p. 127): 
Bruner emphasizes that such hypotheses are de- 
pendent upon a "cognitive map." Tolman 
(1951) , in his latest formulations, has extended 
the map into a cognitive matrix which is a larger 
system of hypotheses, similar to what we have 
here called schemata. 

The attempt to subsume a whole range of 
psychological phenomena under the construct 
of silent organization as a system of schemata 
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may be open to criticism. It may be said that 
what is gained in the inclusiveness of the con- 
struct is outweighed by its vagueness. Of 
course, the various types of schema building are 
still in need of empirical clarification. The core 
value of this conception, however, is that it pre- 
pares the ground for a broader unitary theory. 
A starting point in this direction would be the 
proposition that the phenomenal field and silent 
organization share representational processes 
which can also extend into memory function 
itself. This is recognized in one way or another 
by contemporary theorists. 

Tolman (1951) contends that perception in- 
cludes more than what is merely present to the 
senses; it often encompasses “familiar objects 
on the wall behind my back . . . For I might 
well be found to be immediately ready to behave 
to all these other objects.” Moreover, the ‘“‘be- 
havior space” contains not only objects but also 
their spatial and temporal relations and direc- 
tions. Even mechanical, aesthetic, and language 
dimensions may be present. Inasmuch as any of 
these govern immediate action, they are "per: 
ceived, whether or not (in introspective terms) 
the actor is then and there consciously aware of 
them" (1951, p. 296 ff). Tolman also speaks 
of acquired "typed (not concrete) images of ob- 
jects . . . conceived to be arranged along func- 
tionally defined ‘generalization dimensions’ ” 
(pp. 290 ff). These would correspond to the 
ideational schemata under discussion. 

In the last analysis all these formulations 
point to a need for linking the psychological or 
behavioral descriptions to physiology of brain 
processes and traces. The postulate of schemata 
as silent cognitive representations strongly sug- 
gests that certain relationships between the en- 
vironmental field and the trace system must be 
preserved. It is not necessary here to speculate 
on the ultimate nature of this relationship, 
whether it is, for example, isomorphic. It is 
important to realize, however, that a mosaic and 
connectionist theory of the physiological mech- 
anisms involved seems least adequate. Von 
Bertalanfly, who refers to the work of von 
Holst, Lashley, Krech, Goldstein, and others, 
concludes: “The fact that only one experience 
is in the focus of consciousness at a time seems 
to indicate that its physiological correlate ex- 
tends over the whole ‘brain field.’ If the mosaic 
theory were correct, obviously any number of 
excitations and corresponding experiences could 
be coexistent . . . If, however, form perception 
is a system process, dynamically ordered and 
extended over larger brain areas, the aftereffect 
of excitation will consist not in leaving traits 
[traces] in individual cells, but in an alteration 
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of the brain field as a whole” (1951, p. 29). In 
a similar vein, Koffka maintained: “there is no 
break between the conscious and the uncon- 
scious, rather have part-processes of total strains 
of events the additional property of being con- 
scious” (1928, p. 68). Antedating his later 
distinction between manifest and silent organi- 
zation, he states: “Conscious processes, depend- 
ent upon unconscious conditions, arise out of 
unconscious processes, terminate in such, and 
leave behind new unconscious conditions” (p. 
68). Various theories of trace systems have simi- 
lar implications, namely, that traces are organ- 
ized or related and not aggregates of elements. 
Emphasis upon such physiological mediation is 
urged upon us precisely because silent organiza- 
tion poses the problem of how a transphenome- 
nal representation codifies certain external as 
well as past events. In this area, the intriguing 
formulations of Bateson (1951) have been 
referred to earlier. 


BODY IMAGE AND EGO AS COGNITIVE ORGANIZATION 


We have thus far considered chiefly the cog- 
nitive relationship between external and in- 
ternal events. In dealing with frames of refer- 
ence and schemata which mediate response, an 
important internal framework has not been 
mentioned. The ego is not only cognitively 
represented in the phenomenal field, but it also 
exerts an influence in terms of silent organiza- 
tion. The Gestaltists, however unsystematically, 
have treated the ego in several ways. First, the 
ego is included in the phenomenal field as a 
part or a subsystem. Thus it becomes a per- 
ceptual object of a special kind. It is that part 
of the behavioral world which is “in between 
the front and the behind.” It is therefore a sys- 
tem of spatial coordinates which operate as an 
experienced reference point in the perception of 
direction. In that sense the ego differs from 
other perceptual objects because there is always 
a “mine” relationship involved, e.g, my hand; 
to the left or right of me; up and down, etc. 
This is an aspect of the “personal dimensions 
of space,” as William Stern called them (1938, 
p. 95). 

Second, the ego functions as a silent organiza- 
tion in being a schema of orientation in the 
behavior space for localizing parts of our bodies, 
or for localizing our bodies as a whole in sur- 
rounding space. The body image as analyzed 

eby Head (1920) and further elaborated by 
Koffka (1935) and Schilder (1935) is an in- 
structive paradigm of cognitive representation 
without awareness. They arrived at similar con- 
clusions. The body image constitutes a central 
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integration of cues and traces from diverse sense 
organs into a schema, a plastic model of our- 
selves. In particular, Koffka adopted Head's 
view as to the importance of the memory trace 
system, which forms a silent background or 
frame of reference in the conscious perception 
of body posture and position. According to 
Head it is not the image, visual or motor, which 
is the standard by which postural changes are 
appreciated. 


Every recognizable change enters into con- 
sciousness already charged with its relation to 
something that has gone before. . . . Just as ona 
taximeter the measured distance is presented to us 
already translated into shillings and pence, so the 
final product of spatial changes rises into con- 
sciousness as a measured postural appreciation. 
For this standard against which all subsequent 
changes of posture are measured before they enter 
consciousness, we have proposed the word “sche- 
ma." (1920, pp. 60 f., pp. 723 f£) 


Both Head and Koffka, therefore, made a dis- 
tinction between process and trace, i.e., the 
schema building and the schema built. The lat- 
ter is the unconscious standard in the form of 
traces representing the model of the body 
against which the current process of postural 
change is appreciated. 

We cannot pursue here the finer elaboration 
of the postural and surface models of the body. 
Suffice it to say that both are interconnected and 
related to the ego. The surface model develops 
into a more constant and value-laden represen- 
tation of the body. In this connection, it should 
not be overlooked that Head's theory stems from 
extensive empirical research in neurology. 
Koffka made these functional schemata, which 
are models of ourselves, the starting point for 
his own theory of a permanent ego substratum 
or ego system (1935, pp. 516 f£) . An organized 
trace system develops which integrates ego- 
related experiences (i.e., "mine" experiences) . 
Only the experiences which refer to the ego or 
ego system acquire self-character. They form 
the subjective pole of memory as against the 
objective pole of an environmental trace system. 
The so-called “ego-trace column" is the auto- 
biographic frame of reference which temporally 
localizes every experience as to present, past, and 
future, It preserves the continuity of the “I” 
experience, and makes it possible to register 
cognitively the changes which I, myself, under- 
go, as well as the changing events in my life 


space. This ego frame of reference can operate, 


silently and also can become represented in 
awareness, e.g, when I represent to myself an 


event in my past. ; 
The difference between the environmental 
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and the autobiographic trace systems finds re- 
markable support in the studies by Claparéde 
and his co-workers (1911) on the disturbed ego 
reference in the memory of patients with 
Korsakow syndrome. These patients show no 
recognition of persons or recent events but have 
functional retention of environmental happen- 
ings. They know their way around in the hos- 
pital and are well oriented as to the function 
and rank of the hospital personnel. When faced 
with their doctor such patients usually fail to 
recognize him, yet they will be able to "guess" 
his name from a list of names. Both Koffka and 
Claparéde conclude that recognition requires 
the cognitive reference of "me-ness," i.e., the 
experience of “familiar to me." 

Other aspects of the ego substratum as seen 
by Koffka are the motivational content and di- 
rection in self-referent behavior and the "execu- 
tive control of “my” actions These are 
exemplified in the emotions of self-interest and 
in the experiences of wishing, willing, intend- 
ing. Here again only certain experiences and 
emotions are referred to the self. The support 
for this construct of ego structure is found, for 
example, in experiments on level of aspiration, 
on the Zeigarnik effect and on the completion 
of interrupted tasks. All these findings are taken 
to indicate tensions which originate dynamically 
in the ego system and often belong to the dimen- 
sion of success and failure feelings. Such ten- 
sions are therefore distinct from neutral, 
environmental forces. In this context, attitudes 
become forces issuing from the ego and they 
can affect the phenomenal field. 

Important as these conceptions are as a point 
of departure for formulating the relationship 
between cognition and conation, Gestaltists 
have not formulated a systematic theory of per- 
sonality, ego function, and motivation. Nor 
have they delved into the processes which me- 
diate between motivation and representation. 
Aside from an incomplete definition of attitude, 
they have not really shown how attitudes influ- 
ence the "autochthonous" Gestalten in the 
phenomenal field. The reason for this may lie 
in the fact that the Gestalt school has limited 
its objective to first establishing formal prin- 
ciples of representation in the organism-environ- 
ment relationship. 'The cognitive aspects of 
stimulus and response have been developed in 
terms of organizational structures which repre- 
sent either the distal or the internal environ- 
ment. Such an approach may be considered a 
basic mapping out of what Gibson (1951) has 
called the "world of literal perception." In 
contradistinction, the “world of schematic per- 
ception" has not been explored. The latter, ac- 
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cording to Gibson, includes the significance of 
objects, their meanings and symbolizations as 
the person acquires them in his particular cul- 
tural milieu, The Gestaltists have not at- 
tempted to deal with the acquisition. and 
function of symbols, concepts, ideational con- 
tents, and language, nor with the manner in 
which all these are reflected in the phenomenal 
field and in motivated behavior. Koffka ac- 
knowledges this gap. 


The environment in which we behave is not 
only the actually and palpably present behavioral 
environment, but also the environment which 
we "merely" imagine or think of. This environ- 
ment is responsible for the greatest achievements 
of mankind. Without it neither the sciences nor 
the arts would be possible. Closely connected with 
this environment is our language and the similar 
symbolic functions which we have developed. And 
an ultimate explanation of the problems of 
thought and imagination will not be possible with- 
out a theory of language and the other symbolic 
functions, But we shall exclude the study of lan- 
guage from our treatise. (1935, p. 422) 


Another admitted gap is the social and value- 
oriented aspect of behavior. Efforts have been 
made to implement Gestalt theory in this direc- 
tion. With regard to value, Kóhler (1938) has 
presented a novel approach of combining the 
phenomenological analysis of "requiredness" 
with the formal principles of isomorphism. Asch 
(1952) has extended the Gestalt approach with 
his cognitive analysis of human interaction. 
However, in these later efforts, symbolic be- 
havior and its relation to motivation are secon- 
dary. 

Problems which arise in this connection may be 
typified by observations which Bruner (1951) cites. 
It is only with the greatest difficulty that one can 
teach or "force" the Chukchee Indians "to carry out 
anything resembling an adequate sorting of the 
Holmgren yarns” because of their extremely limited 
color nomenclature. “Yet these reindeer-herding 
people can and do apply more than two dozen names 
to the task of distinguishing the patterns of the rein- 
deer hides" (p. 136). The suggested distinction. be- 
tween literal and schematic perception is here quite 
apparent. : 

"The influence of cultural and personality factors 
on cognition, and vice versa, has been the focus of 
numerous experiments of the so-called “New-Look” 
type. The study of the "Honi" phenomenon by 
Wittreich (1952) is a striking illustration because, 
like many of its kind, it demonstrates that cognitive 
and motivational factors intertwine but does not 
explain how. In the Ames (distorted) room, the S 
commonly sees the room as normal, while a face will 
shrink or enlarge in size depending upon the window 
through which it is seen. When faced with his own 
spouse, however, the S reports the room as distorted 


D 


113 


rather than the face of his marital partner (Honi 
phenomenon). The spouse's face appears less 
changed from its normal size than is that of a 
stranger. Similar findings were obtained when the 
marital partner was compared with a stranger walk- 
ing through the room. Strangers of both sexes were 
used. Twenty-four of the thirty-two S's were married 
less than a year and the Honi phenomenon was 
predominant among them. Familiarity and fre- 
quency as an explanation were ruled out, since there 
was a negative correlation between length of mar- 
riage and the measured size of the Honi effect. 


The examination of the cognitive aspects of 
the response points to the problem of silent rep- 
resentation as a behavior determinant. The 
decisive difference between the cognitive and 
S-R approach as to the nature of traces may be 
thus summarized. The associationists conceive 
them as static and specific, e.g., habits which 
can be combined in various families. Cognition 
theorists, on the other hand, conceptualize 
traces as representing relations or systems of re- 
lations. In the habit theory, items of knowledge 
cannot be modified and new problems are solved 
by reinstating identical or fractional identical 
past responses. The organization hypothesis 
permits the restructuring of past responses in a 
novel way and offers a unitary explanation of 
transfer and productive thinking. It also intro- 
duces a systematic account of how manifest 
responses occur in a nonconscious context. This 
context is a schematic representation operative 
as a frame of reference, preverbal generalization, 
or set, all of which are usually transphenome- 
nal. 

In assessing the limitations of the formal- 
structural approach of the Gestalt school, the 
question has been posed: If acquired cultural 
meanings, values, and motives become part of 
silent organization, how can they be treated 
from a molar point of view? Taking their lead 
from Tolman’s work, Bruner (1951), and Post- 
man (1951) answer this question by putting 
forth the construct of “hypothesis formation” 
as an intervening variable. The individual's 
predispositions to hypothesize are value-laden 
and operate selectively in terms of value con- 
gruence (or incongruence) of the information 
provided by stimulus cues. Yet, it seems that 
the adoption of values is not really analyzed, 
and motivation is treated only as an experi- 
mentally induced or pre-existing variable. Klein 
and Schlesinger (1949) have criticized this and 
similar approaches for neglecting the person as 
a whole. Such methods isolate motivational 
variables and treat them outside of their de- 
pendency on and embeddedness in the indi- 
vidual, There is a manipulation of motives and 
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values in a piecemeal way, implying a constancy 
hypothesis, which tends to de-emphasize indi- 
vidual differences, Eventually cognitive theory 
will have to agree on a conception of personality 
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and motivation which reconciles the above diffi- 
culties arising from “idiographic” and “nomo- 
thetic" emphases. 


CONATION AND COGNITION 


There is general agreement that man is a 
wanting creature. There is disagreement as to 
whether he is a wanting human or a wanting 
animal, Psychologists who prefer an “animal- 
centric” conception have linked all motivation 
to the mechanism of primary and secondary 
drives. In man no place is given to a genuine 
interest in the understanding of his surround- 
ings or for cognitive commerce with it. Instead, 
knowledge is reduced to learning that consists 
of stimulus-response couplings associated with 
the satisfaction of primary physiological needs. 
Any goal beyond the primary needs, as well as 
the means for its attainment, is derived from 
additional associations. In principle, motives 
such as prestige, love, cooperation, and submis- 
sion are responses which have become instru- 
mentally coupled to stimuli or "cues" during 
the process of gratifying the basic needs. Such 
connections between drives and learned be- 
havior are the S-R alternatives to cognitive 
anticipation and structuring of relevant goals 
and to the insight into means-end relations of 
"what leads to what." Although efforts have 
been made to define reward and punishment 
nonsubjectively and operationally (Thorndike, 
1935) , S-R drive theory, in general, has hedonis- 
tic roots. As Postman (1947) has pointed out, 
hedonism is reflected in the current version of 
the law of effect as "primary reinforcement." 
'This is no longer the classical hedonism in 
which pleasure is the motive. Postman speaks 
of “neohedonism,” where tension or discomfort 
is the motive, while pleasure is the result of ten- 
sion reduction, and it is that relief which leads 
to learning. Mowrer (1946), in support of the 
reinforcement principle, equates pleasure, satis- 
faction, and drive reduction. 

Points of dispute between S-R theorists and 
their opponents have arisen in connection with 
such a drive theory. The issues may be crystal- 
lized by considering the following arguments 
raised by various cognitive theorists. 

1. Are all motives ultimately reducible to a 
number of specific organic needs? Is the con- 
cept of tension reduction an adequate model 
for human motivation? 

2. Is primary or secondary need gratification 
the only condition for the attainment of infor- 
mation or knowledge? 

$. Is the “egocentric conception" of motiva- 


tion, as implied in neohedonism, veridical? Does 
it account for both the subjective and objective 
reference of human goals and emotions as well 
as for their cognitive contents? 

It is obvious that these questions are all inter- 
meshed. Any separation would be artificial. 
For reasons of practicality, however, we shall 
have to examine each of them briefly. 


PRIMARY NEEDS, TENSION REDUCTION, AND 
COGNITION 


The premise that the biological needs have 
primacy in man stems from infrahuman and 
atypical observations. By manipulating depri- 
vation states in animals the existence of inde- 
pendent "viscerogenic" drives or needs is in- 
ferred. This is then applied to man, particu- 
larly to the human infant. The apparent po- 
tency of body needs in infancy is not seen as a 
symptom of immaturity and lack of integration. 
Instead, the drive states and release urges in 
the infant become the paradigm for the motives 
of the mature adult, Outside of Freudian ideol- 
ogy this model may have its origin in a mistaken 
notion of Cannon's (1939) theory of homeosta- 
sis. From his experiments it became apparent 
that different tissue needs can disturb the in- 
ternal equilibrium of the body and lead to 
specific "hungers." If internal self-regulation 
itself does not compensate for these deficiencies, 
then the organism has to restore the homeostatic 
balance by interaction with the external en- 
vironment. 

By making tissue deficiency a Stimulus for 
a motivated Response, behaviorists universal- 
ized such an SR relationship for human striv- 
ings. This follows the schema: S — Specific 
organic need state (Tension or Discomfort) 
—> R — motor activity reducing this state 
(Tension discharge or Pleasure). It has been 
said that such a schema applies only to cyclic 
physiological imbalances and their equalization. 
Goldstein. (1939) as well as Maslow (1943a, 
1943b) have argued that human behavior thus 
actuated is atypical rather than normal. It oc- 
curs when the person is governed by acute or 
chronic deprivation. Such lack or frustration 
can lead to an abnormal isolation of part proc- 
esses which then dominate the organism and 
give the impression of powerful motives. Under 
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normal conditions, there are less severe lacks 
and the person shows integrated, cognitively 
directed, behavior. He does not operate on a 
physiological push-button basis but realizes dif- 
ferent purposes and “cognitive needs” (Maslow, 
1948a, 1948b). The transient nature of manifest 
drives stemming from homeostatic imbalances 
is for Goldstein a cogent argument against ele- 
vating them to the status of primary and perma- 
nent motives. The need for salt, for example, 
cannot be considered a true motive, unless there 
is craving for it because of abnormal deficit. 
Activities connected with homeostatic mainte- 
nance are merely instrumental and subservient 
to the all-encompassing drive of the human 
organism — the optimal realization of his 
capacities. The motives corresponding to these 
capacities have a new quality: they can have 
cognitive content and object reference. Thus 
fulfillment of most organic needs enables the 
human organism to actualize his characteristic 
potentialities. Such need fulfillment can even 
become an avenue of self-expression. Sex grati- 
fication, for example, becomes part of realizing 
a capability for the emotion of love. Both 
Rogers (1951) and Fromm (1947) have given 
the actualization of potentialities a prominent 
place. 

Cannon himself was the first to realize the 
restricted nature of the deficiency approach to 
human motives and emphasized the liberating 
function of homeostasis from the concern with 
the details of bare existence for the activity of 
the higher levels of the nervous system. “The 
organism is released from the limitations im- 
posed by internal and external disturbances . . . 
liberated for its more complicated and socially 
more important tasks . . . free to enjoy beauti- 
ful things, to explore and understand the world 
. . . to develop new ideas and interests” (1939, 
pp. 302, 303, 322 ff.) . 

From such a point of view it would follow 
that when bodily needs are satisfied, human 
motivation cannot come to a standstill. There 
is much truth in the adage: the higher the stand- 
ards of living in a given society, “the more 
people eat in order to work rather than work 
in order to eat.” S-R adherents have explained 
such phenomena by grafting upon the primary 
need reduction secondary rewards of coinciden- 
tal associations, any one of which can become 
a substitute incentive for primary needs. 

A further limiting feature of this derivation 
is that it conceptualizes primary or secondary 
need gratification only in terms of antecedents 
and consequences without considering a phe- 
nomenal representation of the need states and 
the end states (cf. Young, 1952). From a phe- 


nomenological point of view, some writers have 
criticized reinforcement theorists, and Freud 
too, for treating motivation primarily as a nega- 
tive force, i.e, defining it as a tension reducing 
vector. On this point there is convergence 
among authors such as Allport (1946a, 1947) , 
Goldstein (1939, 1940) , Schilder (1942), Hart- 
mann (1951). Each, in his own theoretical 
version, has been prompted to give a prominent 
place to positive aspects of both the setting of 
goals and the consummatory phase of their at- 
tainment. In the first place, the individual 
is not always engaged in avoiding (reducing) 
tension. He is just as often creating tension in 
terms of tasks, interests and long range objec- 
tives. Second, the pleasure component in 
achievement is not altogether a relief from ten- 
sion nor is it a mere end state of a former dis- 
equilibrium. More than this, goal attainment 
is usually a cognitive enrichment which leads 
to a changed, differentiated state of the person 
(Allport, 1946a; Piaget, 1951). This new state 
of the person allows for a widened horizon in 
which new goals, previously not realizable, can 
become envisaged. A comparison between the 
drive reduction tenets of the Freudian or the 
S-R type and the cognitive alternatives cited 
here may be illuminating. In the former, mo- 
tives are conceptualized as reactions to disturb- 
ances resulting in a "state of rest.” In the 
latter, motives are not discharges of tension. 
The person spontaneously creates objects of 
endeavor and activates purposes mediated by 
cognition. Furthermore, the person does not 
remain static but progressively changes with 
each goal attainment. 

The productive possibilities in treating moti- 
vation in this framework may be illustrated by 
the thought-provoking formulations of Schilder 
(1942). He asserts that living organisms, par- 
ticularly the human, have the drive not to 
remain static in “completeness,” but to go “from 
completeness to incompleteness to new com- 
pleteness.” Applied to the sex drive, for ex- 
ample, "the psychic expression of sexuality is 
incompleteness and the necessity for contact 
with another individual. Sex is, therefore, one 
specific expression of the ‘social’ characteristics 
of life” (p. 280). What is completeness in pros- 
pect of a goal becomes incompleteness after 
reaching it, because the person has now become 
more differentiated. One reason for the change 
in the person following attainments is suggested 
by Maslow (1948b), in agreement with the 

* Gestaltists. There is a nonarbitrary relationship 
between the person and the satisfying object, in 
that it has intrinsic appropriateness. This im- 
plies that some contents of the environmental 
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goal interact with the behaver in an objectively 
fitting way. He must acquire an internal repre- 
sentation of both the means and the attained 
end. In turn, this reorganizes the person and 
his life space. Both Woodworth and Hilgard 
have called attention to a motivational factor 
in perception which is virtually antithetical to 
tension reduction. Woodworth's thesis is: 


Perception is always driven by a direct, inherent 
motive which might be called the will to perceive. 
Whatever ulterior motives may be present from 
time to time, this direct perceptual motive is al- 
ways present in any use of the senses. It is impos- 
sible to look without trying to see or to listen 
without trying to hear. To see, to hear — to see 
clearly, to hear distinctly — to make out what it 
is one is seeing or hearing — moment by moment, 
such concrete immediate motives dominate the 
life of relation with the environment. . . . When 
the goal of such a search is attained, strong rein- 
forcement is revealed by the observer's cry of 
satisfaction and later by his excellent retention 
of the discovered figure." (1947, p. 123) 


Correspondingly Hilgard (1951) speaks of 
two goals of perception — one, to attain stabil- 
ity in the world about us, and two, to achieve 
clarity in what we perceive. Granted that these 
perceptual goals are also attended by satisfac- 
tion and may be embedded in wider ends of 
the person, they are quite remote from any 
association with drive reduction. 

The emphasis on the cognitive facets of moti- 
vation is thus clearly in opposition to primary 
and secondary drive theories. One may epito- 
mize the various arguments in this reasoning: 
the more the human organism is freed from the 
immediate urgencies of tissue hungers, the more 
potent do the inherently human and social 
potentials in motivation become. Thus, Maslow 
(1948a) has proposed a hierarchy of “higher” 
and "lower" needs in which, for example, the 
“living at a higher need level" leads to even 
greater biological efficiency and mental health. 
The fulfillment of the higher needs is closer to 
the actualization of one's capacities than is the 
gratification of lower needs. 

In contrasting drive reduction with capacity 
fulfillment, we have mainly cited the following 
points: the individual strives for cognized ob- 
jects and goals, not for satisfaction itself; having 
attained a specific goal, he sets new ones for 
himself. Nothing has as yet been mentioned 
about motivations which transcend the phe- 
nomenal field or goals which are not represented, 
in it — nonconscious motives. There is actually 
no provision for a distinction between conscious 
representation and unconscious motivation in 


drive reduction theories of the S-R type. As for 
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the Freudian drive-reduction theory, uncon- 
scious longrange persistent strivings in the 
person are converted into repetitions of child- 
hood frustrations. Orientation towards the 
future becomes re-enactment of the past. 
Granted that clinically such motivations are 
often reported, it should not detract from the 
equally legitimate occurrence of future oriented 
behavior. Allport (1947) has given this prob- 
lem recognition with his concept of "intention" 
as a value-directed activity. An intention is a 
guiding idea which may be in or outside of 
awareness. "In my use of the term I do not 
confine intention to reportable purpose. Some- 
times the essential direction of an intention is 
understood well enough by the subject, some- 
times not. If the term is taken to mean both the 
understood and the nonunderstood direction of 
an act... it can serve as a proper designation 
for ‘ultimate’ motives, not merely for proximate 
or rationalized motives" (p. 187n). Allport dis- 
tinguishes intention from "anticipatory goal 
reaction" as well as from "expectancy" because: 


Infant and rodent have immediate goals and 
indulge in anticipatory goal reactions but have 
no directive schemata . . . Men often have values 
without having any specific goal in mind. They 
may have a consistent direction of striving, but 
their goals are either trans ™ or else undefinable. 
All of a rat's, but only digniall bit of human, 
behavior can be characterrled in terms of concrete 
goals whose attainment will de-tension specific 
drives. For the most part, the course of man's 
behavior runs according to certain schemata, or 
in prolonged channels. Only now and then are 
these channels marked by lights or buoys that 
represent specific goals... We may know a 
person's expectancies and even his past rewards, 
and yet be singularly unable to predict or con- 
trol his future behavior, unless at the same time 
we know also his basic intentions, which are by 
no means a stencilled copy of his previous expec- 
tancies and rewards. (p. 184 ff) 


This approach embodies the positive direc- 
tion of motivation and at the same time leaves 
room for its internal representation, which may 
or may not be in direct experience. Allport's 
conception of directive schemata encompasses 
these factors, It is noteworthy that his formu- 
lation connects a motivational vector with silent 
organization. 

Essentially "intention" means “what the per- 
son is trying to do" whether in or out of aware- 
ness. In certain respects we have a supplement 
to this in Klein's (1952) analysis of how the 
person does what he is trying to do. Extending 
his former work on personal styles of the per- 
ceiver, Klein experimentally explored the re- 


COGNITIVE THEORY 


lationship between need tension and cognition. 
He focused on the manner in which the ego 
handles needs, and he used thirst conditions 
as a paradigm for need arousal and ego proces- 
ses: S's were given, among other tasks, a number 
of perceptual problems, when thirsty and when 
sated. As one function of the ego structure the 
experimenter distinguished “cognitive atti- 
tudes” which have regulative and directional 
properties in the control and expression of 
needs. Briefly, his thesis is that people are or- 
ganized differently to deal with needs in that 
they have individual styles of cognitively chan- 
neling them. 

Klein does not equate cognitive controls with 
defense, which for him is not the prototype of 
all control. Defenses are modes of resolving 
conflicts among needs, whereas cognitive atti- 
tudes might even be called “conflict-free styles 
of regulation.” This conclusion follows from 
his experimental findings. The cognitive styles 
tapped in his S's also influenced their perceptual 
structuring of situations where quite "neutral" 
stimuli were involved. Furthermore, the selec- 
tive effect of cognitive attitudes on the S's 
responses was not linked to “‘satisfaction-giving 
objects.” Cognitive attitudes are adaptive in the 
sense that they determine the manner in which 
needs are expressed in conformance with the 
task, ie. reality, but they are not necessarily 
tension reducing. As a matter of fact, one group 
of S's actually was more accurate in its perform- 
ance of tachistoscopic recognition under need- 
tension than when sated. While thirst “lamed 
the overcontrolled" S's, it seemed to “wake up" 
the opposite type of S's, whose overall errors 
decreased in the various tasks. Instead of deal- 
ing with specific contents, as needs or defenses 
do, the cognitive attitudes express themselves 
in formal qualities of behavior, e.g., "the par- 
ticular way that the stimulus field is organized." 
Cognitive controls are indicated by such proc- 
esses as "schematizing or conceptualizing be- 
havior, mapping, absolute and differential 
thresholds, anticipation, attention, concentra- 
tion, etc." In the light of Klein's formulations, 
certain of the experiments on the influence of 
needs upon perception may take on a new look. 
It is his contention that by treating need in- 
tensity as an independent variable and a per- 
ceptual response as a dependent variable, the 
ego structure is overlooked. The contradictory 
reports about the direction of cognition under 
the influence of need states is taken by Klein as 
support for his claim that cognitive attitudes 
are "intervening variables." And these are 
necessary for an explanation of the individual 
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differences and consistencies in such experi- 
ments. 

It is true that the "new look" experiments 
have revived interest in the interrelatedness of 
motivation and cognition. However, they have 
sometimes oversimplified that relationship. 
Cognition was dealt with as if it were object per- 
ception along an external stimulus dimension 
correlated to a motive along an internal stimu- 
lus dimension. The experimentally induced 
distortions are then usually obtained under 
reduced or equivocal stimulus presentations. 
Since the deviations are rather small and incon- 
sistent under these conditions, generalizations 
about a one-to-one relation between need states 
and sensory perception may be premature. Even 
such discrepancies as found in the coin-size 
experiments do not really shatter the veridical- 
ity of ordinary object perception. It has been 
further questioned whether the “vise-like grip” 
of needs significantly distorts our percepts 
(Pratt, 1950; Luchins, 1951; Gibson, 1951). The 
use of equivocal stimulus conditions for tapping 
the personological aspects of perception seems 
to have been more profitable in settings such 
as the following: (1) When the consistency of 
intraorganismic processes of the person's func- 
tioning is investigated rather than a direct 
motive-percept correlation (Werner and Wap- 
ner, 1952; Klein, 1952; Witkin, 1953). (2) 
When the values that are pertinent to a person 
(including self-regard) are studied in relation 
to stimulus material which belongs to the realm 
of learned meanings, symbols, and cognitive con- 
tents (Postman, Bruner, and McGinnies, 1948; 
Postman and Schneider, 1951). 


REINFORCEMENT AND COGNITION 


The doubts cast on primary drive-and-tension- 
reduction as a suitable model for motivation 
necessarily implicate the reinforcement prin- 
ciple. The well-known controversy on latent 
learning is a further expression of this state of 
affairs. The question is whether reinforcement 
is the sole condition for attaining knowledge. 
Both the defenders and critics of latent learning 
have tried to decide the issue, chiefly through 
animal experimentation, Typically, rats first 
explore a maze without getting food reward and 
are then tested for learning this maze with re- 
ward introduced. Since they are superior to a 
control group which had not previously ex- 
plored that maze, advocates of learning sans 
reward see this as a support for their theory. 
Since Tolman's students introduced this method 
as a demonstration of cognitive mapping with- 
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out primary reinforcement, numerous varia- 
tions of the experimental design have been 
carried out, with varied results. 

In evaluating the experimental evidence for 
latent learning, Hilgard (1948) concludes that 
it is “severely critical of the reinforcement prin- 
ciple.” Thistlethwaite reviewed the research in 
this area and accepted a number of experiments 
as “constituting exceptions to the alleged indis- 
pensability of reinforcement in the learning 
process" (1951, p. 125) . The issue has, however, 
not been closed, as the continued debate over 
concealed: variables of noncognitive nature at- 
tests. Thus most investigators agree on the 
superiority of the performance by "latent learn- 
ers," but they disagree as to the drive and re- 
sponse variables involved in the learning process 
itself. 

Tolman’s latest stand on the matter is that 
latent learning is ushered in by a motive, 
namely, a "curiosity" or "placing need." New 
"field relationships" are learned through a mo- 
tivation which is a “relatively ‘disinterested’ one 
provided by a general exploratory need" (1951, 
p. 304). The gratification of such an explora- 
tory motive during latent learning seems to 
parallel the satisfaction which Woodworth and 
Hilgard ascribe to the attainment of clarity and 
stability in perception. For Tolman, such dis- 
interested cognitive mapping can become a 
vehicle for a specifically acute need aroused at 
a later time. 

Notwithstanding the admirable ingenuity 
which has gone into the designing of crucial 
experiments in this area, psychologists have not 
sought to resolve the entire issue of reinforce- 
ment on phyletically higher levels. This would 
entail the difficulty, if not impracticality, of 
testing for reinforcement in man by primary 
need deprivation in an experimentally "pure" 
way. After all, inexperienced human infants 
have never been deprived in learning experi- 
ments, as have been rats. Therefore the general 
application of primary reinforcement to the 
human level is not operational but speculative 
in method. In this regard Harlow's (1953) ex- 
perimental critique of reinforcement deserves 
attention. If primary drive reduction is not a 
necessary factor in the learning of monkeys, 
then reinforcement becomes questionable as 
the sole principle of learning in man. Harlow's 
data indicate: rhesus monkeys learn efficiently 
even though they were fed previously and 
munched stored food during both right and 
wrong responses; they solved complex mechani- 
cal problems spontaneously with no other in- 
centive than the presence of the puzzle; 
investigatory motives can be more potent than 
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hunger tension. As an alternative to primary 
drive-reduction theory, Harlow offers the hy- 
pothesis that responsiveness to external stimulus 
patterns is decisive for motivated learning and 
increases in importance with the complexity of 
organisms. The key to human learning for 
Harlow is not the conditioned response. He 
asks "can anyone seriously believe that the 
insatiable curiosity-investigatory motivation of 
the child is a second-order or derived drive con- 
ditioned upon hunger or sex or any other in- 
ternal drive?” (p. 29). 


SECONDARY REINFORCEMENT AND COGNITION 


The preceding discussion has alluded to the 
principle of secondary reinforcement. It has 
been applied to processes of socialization and 
the acquisition of attitudes, The radical dif- 
ference between cognitive and SR approaches 
to "conditioning" of likes and dislikes may be 
illustrated by three typical instances. 


l. Among the classical studies is M. C. Jones' 
(1924) experiment on the "de-conditioning" of the 
boy, Peter's, morbid fear of rabbits. The schema of 
conditioning the food response to the rabbit is still 
being diagrammed in many textbooks as a paradigm 
of secondary reward learning and stimulus-response 
substitution. In the "direct conditioning" the ex- 
perimenter brought the rabbit as close as she could 
without interfering with Peter's eating. After several 
sessions, Peter was apparently cured, explaining, “I 
like the rabbit." Concerning the substitution theory, 
Tolman has satirically wondered why Peter does not 
now eat the rabbit. Hence a joy-overfood response 
has been "substituted" for the food response itself. 
Even this hypothesis raises questions: Has the origi- 
nal response remained the same and an additional 
response merely become attached to it? Will Peter 
henceforth make one response to "dangerous rabbit" 
plus a more potent joy-overfood response? Along 
the lines suggested by Tolman (1932) it seems pos- 
sible, however, that the original stimulus changed 
from dangerous rabbit to harmless rabbit. If this 
be true, then the role of the eating pleasure was not 
that of a pacemaker but of a peacemaker in à cog- 
nitive restructuring. The joy-over-food was a pleas- 
ant distraction during the rabbit presentations, which 
provided an opportunity to discover that the rabbit 
was harmless. 

This discussion, however, is only based upon the 
idealized version of the Jones’ experiment. Her 
careful protocols reveal important facts frequently 
overlooked. The entire training lasted several 
months, The first seven sessions took place in the 
presence of three other children who “were selected 
carefully because of their entirely fearless attitude 
towards the rabbit" (p. $10). "Peter progressed from 
a great fear of the rabbit to a tranquil indifference 
and even a voluntary pat on the rabbit's back” (p- 
$12). After two months” interruption, his fear re- 
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appeared when a dog frightened both Peter and his 
nurse. Now the method of “direct conditioning” 
was introduced and “occasionally also, other children 
were brought in to help with the 'unconditioning' " 
(p. 318). Jones noted most improvement on four 
occasions. In these, either other children or an 
assistant whom Peter called "Papa" were present. 
The first improvement was recorded after Peter had 
followed and watched a child playing with the rab- 
bit, the second when the rabbit was given to another 
child and Peter wanted it on his lap. These data con- 
traindicate a mere substitution of stimuli or re- 
sponses. The variable, joy-over-food response was 
contaminated by the variable of perceiving fearless 
and unharmed children playing with the “danger- 
ous" rabbit. The introduction of such social inter- 
action makes it doubly necessary to consider 
cognitive factors in Peter's loss of fear. This, of 
course, could be answered by stipulating that he 
learned through "imitation." 

2. In studying imitation in children, Miller and 
Dollard (1941) on the one hand and Asch with co- 
workers (1952) on the other have employed similar 
experimental designs. Both tested how children fol- 
low a leader who found candy in one of two boxes. 
The only difference was that in the experiments 
reported by Asch the "followers" were interrogated 
after each trial ("How did you know which box to 
go to?"). There are important contrasts in the 
respective conclusions. For Miller and Dollard the 
leading child is the cue for the followers response 
of finding the rewarding candy in one of the two 
boxes. By prearranged design the follower always 
found the candy either in the box identical to or 
opposite to the leader's choice. In the identical 
choice the leader is a positive cue for reward; in 
the "opposite" choice the leader is a negative cue. 
No knowledge of the why or wherefore is assumed 
in this learning, and since the prearrangement was 
not known to the followers, conditioning by associa- 
tion without understanding appears to be operative. 
This principle is then universalized for all imitation 
in leader-following (1941, p. 97). Asch, however, 
using the verbal protocols, found that the followers 
think. They make "assumptions that serve as hy- 
potheses, which they formulate, test, revise, reject, 
or confirm" (p. 394); they attempt to reach an 
understanding of the hidden system. When the 
candy in the box was replaced by a rubber ball which 
had to be returned after each trial, the same, if not 
more serious, efforts on the part of the children were 
observed. They apparently were more interested in 
finding a rubber ball not-for-keeps than a candy-for- 
eats. 

An intriguing question arises out of Miller and 
Dollard’s data. Why did the children average only 
one-fourth of the errors when they learned to go to 
the “opposite” as compared to the "identical" box? 
And why did 77% of all children go to the box oppo- 
site the leader's choice in the first trial in all cases? 
Could we not say they reasoned by generalizing from 
past life experiences: “One candy has already been 
taken away by the child ahead of me from this box; 
the candy for me will be in the other box?" 


119 


In comparing the S-R. with the cognitive interpre- 
tation of imitation, Asch criticizes the concept of 
"cue." It is insufficient to call a perceived action a 
cue. One has to state the perceptual conditions un- 
der which certain parts of the action and not others 
are singled out as relevant. This reintroduces the 
factor of social perception and phenomenal causal- 
ity. It also points to the reasoning necessary for 
learning from another person's experiences. This 
applies to the case of Peter as well as to the above 
case, where learning from the leader's reward (which 
is not one's own reward) has to occur. 

3. In the area of attitude formation, cognition 
advocates have increasingly stressed the role of 
central and representational processes. Both McLeod 
and Asch, for example, concur that the ideational 
content of an attitude has to be included in psycho- 
logical analysis. Asch speaks of attitudes as "an or- 
ganization of experiences and data with reference to 
an object" (1952, p. 580). McLeod asserts that an 
attitude has both subjective and objective compo- 
nents. "If we are to characterize adequately an 
attitude towards 'religion, we must specify both 
what is objectively there as symbolized by the term 
‘religion’ and what is subjectively felt in relation to 
that objective structure" (1947, pp. 202-209). More- 
over, attitudes are conceived as value directions 
which are organizationally related to each other in a 
systematic way. Following the reasoning of G. W. 
Allport (1935), critical reservations have been made 
about treating attitudes as particularized responses 
to specific stimuli. Thus McLeod cautions against 
the atomistic tendency to hypothesize a separately 
measurable attitude in correspondence to each name 
which designates a social object, such as race, class, 
nation, etc. 


The aforementioned comments together with 
the recent formulations of Adams (1953) and 
Heider (1953) converge on two characteristics 
of attitude. Cognitively, it organizes and orders 
certain events into a unified view, a representa- 
tional structure. The emotional or subjective 
nature of attitude consists in its functional sig- 
nificance for the person in supporting a stable 
relationship between him and his social field. 
In this role it is interdependent with other atti- 
tudes, forming a system. It is for this reason 
that the attempt to influence attitudes by deal- 
ing with any one of them singly often meets 
with such strong resistance. A change in one 
part of the system “threatens” the entire system, 
so that the resistance of a particular attitude 
derives its strength actually from the whole 
structure of which it is a part. 

In contrast, reinforcement theorists have tried 
to deal with attitudes as specific responses which 
do not differ essentially from any other S-R 
connections that are either closely or indirectly 
associated with reward. Miller and Dollard 
(1941) as well as Doob (1947) are outspoken 
exponents of this trend. Attitude becomes a 
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learned cue which is tension producing and 
evokes responses as anticipatory goal reactions 
which lead to tension reduction or reward. This 
view is reflected in such statements as “Most 
people learn to love those persons, objects, 
places, and conditions which are associated with 
less hunger, less thirst, less pain and fatigue 
or more money and prestige” (May, 1944, p. 
51). We are dealing here with an S-R chain 
of responses in which each response evokes fur- 
ther acts that bring about a reinforcing state 
of affairs. It is true that the origin of many 
motives has been located in the learning of 
subgoal activities while pursuing primary need 
gratification. This does not dispose, however, of 
the role of cognitive processes and changes in 
such learning. For example, that a love attach- 
ment to the mother accrues in the child from 
the discomfort relief by the mother’s nursing is 
a common tenet advanced in this context. Even 
if true, it is difficult to accept the assumption 
that this love attachment remains unmodified 
by cognitions in the maturing individual. 
Otherwise the reversal from the infantile atti- 
tude of loving mother because-she-nurtures-me 
to the adult attitude of nurturing mother be- 
cause-she-needs-me is hard to explain, especially 
when the average adult is not wholly uncritical 
of the average mother. The theory also conflicts 
with the adoption of attitudes which have not 
been rewarded in the past and do not offer re- 
ward in the future, as in the case of conscien- 
tious objectors and other nonconformists. 

Upon closer inspection, the concept of secon- 
dary reinforcement appears to be a rather weak, 
if not elusive, link. There is too wide a gap 
from the presumed S-R bonds during primary 
drive reduction to the more complex, remote 
S-R chains which involve the learning of sub- 
goals and their development into self-sustained 
social motives, 

Whereas G. W. Allport tried to solve the 
problem of an increasing independence of for- 
mer subgoal motives by his concept of "func- 
tional autonomy," Hull tried to replace both 
cognition and motivation by contiguity in asso- 
ciation. Hilgard's judgment is: 


The integration of behavior choices remote 
from a tension-reducing goal object is usually 
accounted for on the basis of some sort of in- 
ternalized stimulus-response sequences, supported 
by secondary reinforcements. Hull . . . finds it 
necessary to assert that secondary reinforcing 
agents need not bring about tension reduction in 
order to be reinforcing. This is almost a com- 
plete concession to cognitive control, and weakens 
greatly the centrality of the reinforcement prin- 
ciple. In describing the essential similarities be- 
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tween secondary reinforcement theory and cog- 
nitive theory it is clearer to state that Hull 
reformulates the cognitive theory than to assert 
that the cognitive theory is just another way of 
talking about secondary reinforcement. (1948, 
p. 345) 


MOTIVES, EGO, AND COGNITION 


It may be said that the doctrine of drive re- 
duction embraces hedonism in the form of ten- 
sion release, while the same doctrine rarely 
postulates an ego that can "enjoy" its need 
reductions. In turn, if an ego or self is posited, 
personality is viewed as being governed by 
self-interest qua tension release. For example, 
in orthodox Freudian thinking, cognition be- 
comes essentially subservient to self-centered 
motives, and genuine motives cannot be induced 
by cognitions. 

The state of affairs may be illustrated by the 
controversy between Allport (1946a) and Mow- 
rer (1946) on the ego and the law of effect. 
Mowrer recognizes the necessity for an ego by 
postulating ego processes in reward learning, 
but would like to translate ego processes into 
“emotional arousal.” Emotional reactions as 
well as “ ‘interests’ are secondary drives and . . . 
as such they can both motivate and reinforce 
behavior, in much the same fashion that pri- 
mary drives do" (p. 327). Thus, interests are 
"covert" emotional responses and are learned 
in essentially the same way as other responses. 
In contrast, Allport considers the ego a funda- 
mental construct for motivation. Motivation 
in learning consists of interests which belong 
to the ego structure. Reward becomes just one 
of many factors for perceiving the consequences 
of an act and for applying that knowledge in 
the future when a similar act or situation occurs. 
Satisfaction or dissatisfaction may at best serve 
as indicators of success, And success is always 
evaluated according to its relevance for the in- 
terests that the ego pursues. In attacking the 
universality of the law of effect and defending 
an antihedonistic conception of learning 
through interests, Allport presents these argu- 
ments: there is no evidence for self-administered 
“covert” reward (ie, "internal O. K. reac 
tions"). The inference of such reward is based 
on a circular reasoning — because a person 
persists in his activity it is concluded that he 
has been rewarded. "Interests often persist in 
the face of continuous dissatisfaction or failure" 
(p. 546). The tenor of Allport's reasoning sug- 
gests that the concept of interest is related to 
that of intention; both constructs share the 
openness of a general directional system, as 
distinct from behavior guided by a specific goal 
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channel. Interests seem to embody two com- 
ponents: a cognitive relationship of the person 
to the environment, and a subjective relation- 
ship to his ego values. The interests are more 
central to the person and therefore lead to his 
active participation in pertinent activities and 
contents. Interest satisfaction in learning is not 
predominantly “emotional” in the sense of ten- 
sion reduction. "Learning proceeds because it is 
relevant to an interest-system: it adds to knowl- 
edge, it differentiates items within the system, 
it broadens the range of equivalent stimuli" (p. 
346). It is therefore the person who learns, 
who is rewarded, and not a single response or 
concatenation of responses; and it is the person 
who utilizes the reward as a cue for his progress 
in the learning activity. 

Once it is accepted that an ego construct is 
necessary to account for both cognitive and mo- 
tivational directions, then the relationship be- 
tween motivation and cognition takes on a new 
aspect. One may now deal with this relationship 
"from above" rather than "from below." In- 
stead of postulating need pushes which drive 
the ego, one can conceive of a less direct, holisti- 
cally mediated determination of behavior. Mo- 
tivation may be mediated via reference to the 
ego and via evaluation by the ego. It follows 
that such mediation has to take place centrally 
and involves internal representation. One of 
the outstanding contributions in this direction 
has been made by Rogers (1951). The point 
of departure for this new emphasis may be 
typified in the following statement: “The only 
channels by which needs may be satisfied. are 
those which are consistent with the organized 
concept of self" (p. 508). 

Basic to his theory is the cognitive relation- 
ship between the organism and the environ- 
ment. Rogers considers an S-R explanation of 
behavior inadequate. Instead, he asserts that 
the person reacts as an organized whole within 
his phenomenal field, which in turn is a per- 
ceptual "map" of the outside world and himself 
in it. He posits figure-ground organization for 
this field. There exists also a wider part of ex- 
periences, connected with the internal and 
external environment, which is outside of aware- 
ness and comparable to the behavioral environ- 
ment of the Gestaltists. The central theoretical 
construct. is the self-concept of the individual. 
Although Rogers elaborates a more precise defi- 
nition of the self, in a general way the self may 
be thought of as all the things the person 
consciously thinks he is. The term “self” there- 
fore is not synonymous with organism, but is 
used to denote "the awareness of being, of func- 


tioning” (p. 498). 
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The structure of the self is formed “as a result 
of the interaction with the environment, par- 
ticularly as a result of evaluational interaction 
with others"; it is “an organized, fluid, but con- 
sistent conceptual pattern of perceptions, of 
characteristics, and relationships of the ‘I’ or the 
‘me,’ together with values attached to these con- 
cepts” (p.498). There are two ways in which 
this process occurs. One way is through "'direct 
organismic valuing,” i.e., through the experi- 
ences which the individual absorbs from his own 
sensory and visceral evidence. This is raised to 
the level of symbolization by attributing the 
experience to himself. For example, there is the 
dawning awareness of the infant, though non- 
verbal, of “I like,” “This is cold and I dislike 
it.” Such direct experiences become part of the 
concept of self. A second way is the introjection 
of experiences or values of others as if they were 
one’s own. This occurs in an effort to keep new 
experiences consistent with the already existing 
self-concept, and is accomplished by distorting 
the meaning or symbolization of the experience. 


For example: A child feels loved by his parents 
because of all the direct evidence which comes from 
being cuddled, talked to, etc. But then the parents 
reprimand the child when he hits his baby brother 
and he is made to feel, “You are not loved when 
you behave this way.” In order to preserve the 
threatened concept of the self as lovable, the child 
denies to awareness the satisfaction that is experi- 
enced in the aggressive act. In place of a more accu- 
rate symbolization of the anger expression as 
satisfying, the experience becomes distorted into the 
symbolization “bad,” “I perceive this behavior as 
unsatisfying to me,” “I do not like to hit baby 
brother." The concept of the self now includes "I 
am lovable,” and the distorted symbolizations "I like 
baby brother,” “anger is bad.” 


Akin to the Gestalt concept of fittingness, the 
principle of congruence or consistency with the 
self-concept is therefore a controlling factor of 
behavior. As an example, Rogers notes the man 
who, possessing a self-concept of honesty, cannot 
seek achievement through means which seem to 
him dishonest. The same tendency can lead to 
distorted symbolization, as in the case of the 
hitherto brave pilot, who, overcome by fear, 
cannot simply flee, as this would be inconsistent - 
with his self-portrait of a fearless person. In- 
stead he falls ill and has to be excused from the 
mission. In cases where the person realizes that 
a behavior is inconsistent with the self-concept, 

sometimes happens when the behavior is 
“forced” by the pressure of strong needs, then 
it is felt as not "owned" by the individual — "I 
really wasn't responsible for what I was doing” 


(p. 509). 
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The representational map of one's self and of 
the environment is best understood from the 
internal frame of reference of the individual 
himself. He is considered the center of his ex- 
perienced, “private” world or psychological 
reality. This reality, which corresponds to the 
behavioral environment, can be responded to in 
one of three ways: “(a) symbolized, perceived, 
and organized into some relationship to the self, 
(b) ignored because there is no perceived rela- 
tionship to the self structure, (c) denied sym- 
bolization or given a distorted symbolization 
because the experience is inconsistent with the 
structure of the self” (p. 503). In his concept 
of psychological reality as dependent on the 
frame of reference of the individual, Rogers 
does not go as far as Snygg and Combs (1949) , 
who dispense with reference to an organism or 
to neural processes in relation to external 
stimuli. The individual can test out percep- 
tions which arise from the interplay between 
external reality and neural processes (p. 486). 
The total psychological reality therefore con- 
sists, for Rogers, of a combination of testable 
and untestable “hypotheses” or cognitions. 

A certain limitation in Rogers thinking 
seems to stem from his tendency to make the 
individual the center of a "private world." One 
can readily see that in a therapeutic orienta- 
tion, understanding the client as being the 
center of his world is one of the essential vehicles 
for clinical action. To form a general theory 
of behavioral reality around this "center" may, 
however, invite an unintended, mistaken em- 
phasis on ego-centered motivation. 

The notion that not all motives or emotions 
have ego reference has found increased back- 
ing (Koffka, 1935; Kóhler, 1938; Allport, 1943, 
1945; MacLeod, 1947, 1949; Asch, 1952). A 
typical statement is one by MacLeod. "If we 
try to envisage all motivation as self-generated 
we are doing violence to some of the more ob- 
vious facts of observation. The vectors may 
emerge from the object just as validly as from 
the self. The self may be remodeled to meet 
the demands of the object just as truly as the 
object may be shaped to fit the needs of the 
self" (1949, p. 65). As an experimental illus- 
tration of what the above writers have in mind, 
we may take H. B. Lewis’ (1944) findings that 
“my” tensions towards task completions, when 
interrupted, can be resolved if a teammate com- 
pletes them. The suggested distinction between 
task and ego orientation opens the possibility 
for dealing with group-centered and other- 
centered behavior as being intrinsically moti- 
vated. No ad hoc or post hoc hypothesized 
mechanisms of ego expansion, identification, 
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etc., would be necessary to account for behavior 
that transcends the orbit of ego-centered moti- 
vation. 

Task orientation as a nonegoreferred direc- 
tion of motivation seems to demonstrate the 
"openness" of the ego to the world, a perceptual 
and cognitive grasp of environmental contents. 
From this point of view, McLeod's proposal 
warrants attention: "what we call motives are 
systems polarized in the dimension of subjec- 
tivity-objectivity" (1949, p. 65). 'The conse- 
quences of such a definition would be that, on 
either pole, the subjective or the objective, 
emotion and cognition are inseparable. This 
raises the question whether the traditional 
separation between conation and cognition, or 
the customary interaction theory, gives a realis- 
tic picture of behavior. Most cognitive theorists 
have moved towards a more unified treatment of 
these functions. "This was already implied in 
the rationale of such concepts as interests, in- 
tentions, cognitive attitudes, self-concept, etc. 
Representational and ideational characteristics 
have been attributed to conations just as much 
as motivational direction and emotional quality 
have been claimed for cognition. Thus, writers 
on this subject nearly all see an inherent con- 
nection between striving, feeling, and knowing. 
(Take, for example, Tolman’s concept of “be- 
lief-value-matrix,” 1951.) Yet the problem of 
how to conceptualize this unity and how to 
demonstrate it is still unresolved. 

The most promising avenue of solution seems 
to come from a phenomenological analysis of 
emotions which is followed up by an empirical 
search for the underlying intraorganismic proc 
esses. Emotions may well turn out to be the 
unique expressions of the unity of cognitive and 
conative functioning. The positive and non- 
arbitrary nature of emotions, as against their 
disorganizing and disrupting influence, has been 
emphasized in recent writings (Leeper, 1948; 
Goldstein, 1951). The most poignant formula- 
tion is given by Angyal, who defines “the feel- 
ing tone of emotions as the experience of the 
state and of the situation of the person under 
the aspect of value" (1941, p. 71). It appears 
that any experience or act of valuing presup- 
poses a cognitive-emotional-motivational ma- 
trix. A stimulating exploration of this matrix 
can be found in Asch's treatment of emotions 
(1952, p. 79 ff.; 109 f£). To have an emotion 
is to perceive agents acting upon us or others, 
ie, to apprehend causal relations. Emotions 
are, as a rule, directed towards their perceived 
source — persons or objects. We are angry at or 
because of, sorry about, happy over, grieving 
for, and so on. We seem to "know" with what 
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emotion to respond to a situation, i.e., emotions 
usually have an appropriateness about them. 
Emotions are the way we react directly to the re- 
lations between given conditions and ourselves. 
"They can represent to us the fate of our goals; 
they are immediate cognitions of the state of 
affairs of our motives. Once these propositions 
are accepted, our perception of emotions in 
others can represent to us their intentions and 
motives in an immediate, understood way. It 
follows that we can emotionally participate in 
other people's fate. Asch's contention that emo- 
tions have a cognitive basis becomes crucial on 
the level where the individual has developed the 
idea of a self. This leads to the generation of 
new and more complex emotions, all hinging 
on the conceived relation between the idea of 
myself and others. Such human emotions as 
pride, envy, gratitude, shame, jealousy, indig- 
nation, remorse, embarrassment can now be 
actualized. Asch makes the interesting point 
that all these emotions presuppose past experi- 
ences and a knowledge of the contents of 
situations. However, the emotions themselves 
cannot really be learned or taught. The person 
has to "generate" the emotion himself on the 
basis of what he has learned as well as under- 
stood about the relevant factors which call for 
gratitude, remorse, and so on. "There are no 
*unconditioned stimuli' to remorse" as there 
perhaps are to fear (p. 81). Hence there can be 
no conditioning to remorse. Remorse, like 
gratitude, goes hand in hand with cognition. 
Similar reasoning has been applied to the emo- 
tions of shame and embarrassment by Buyten- 
dijk (1950) and Nuttin (1950) independently. 

Other phenomenologically oriented thinkers 
have emphasized the object reference of emo- 
tions, as in the case of “physiognomic” experi- 
ences: the feeling of the mood of a landscape, 
its friendliness or unfriendliness. These and 
similar object-oriented emotions may also lead 
to motivation. For example, the individual may 
feel prompted to take action because he is 
“moved” by the experience of pity, sympathy, 
indignation, injustice, etc., in reference to a 
person or a cause (MacLeod, 1947, 1951; Kohler, 
1938) . 

In principle, then, behavior may be con- 
ceptualized as being embedded in a cognitive- 
emotional-motivational matrix in which no true 
separation is possible. No matter how we slice 
behavior, the ingredients of motivation-emo- 
tion-cognition are present in one order or an- 
other. As Adams succinctly expressed it, "For 
there is a conative or dynamic component in 
our most disinterested, scientific, and objective 
cognition, or we would not make the observa- 
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tion. There is likewise a cognitive component in 
our blindest lust or rage, or their expressions 
would not have even the minimum of direction 
that they seem invariably to have" (1958, p. 55) . 
This intimate relationship could not escape the 
profundity of William James: 


The contrast is really between two aspects, in 
which all mental facts without exception may be 
taken; their structural aspect, as being subjective, 
and their functional aspect, as being cognitions. 
In the former aspect, the highest as well as the 
lowest is a feeling . . . In the latter aspect, the 
lowest mental fact as well as the highest may 
grasp some bit of truth as its content . . . From 
the cognitive point of view, all mental facts are 
intellections. From the subjective point of view 
all are feelings . . . And then we see that the 
current opposition of Feeling to Knowledge is 
quite a false issue. If every feeling is at the same 
time a bit of knowledge, we ought no longer to 
talk of mental states differing by having more or 
less of the cognitive quality; they only differ in 
knowing more or less, in having much fact or little 
fact for their object. The feeling of a broad 
scheme of relations is a feeling that knows much; 
the feeling of a simple quality is a feeling that 
knows little. But the knowing itself, whether of 
much or of little, has the same essence, and is as 
good knowing in the one case as in the other. 
(James, 1890, Vol. 1, pp. 478-479) 


In dealing with the cognitive-emotional-moti- 
vational matrix, a kind of leitmotif emerges. 
Even when the unity of this matrix is recognized 
it has a polarity of direction. In James’ terms, 
there are the structural (subjective) and func- 
tional (cognitive) aspects; in the terms of the 
Gestaltists and McLeod there is a subjective 
and an objective pole; in the terms here used 
there is an ego reference and an objective ref- 
erence. 

There has been a tendency to deal with the 
subjective pole in terms of the individual 
uniqueness and privacy of the person's experi- 
ence. Conversely, the objective pole has often 
been equated with the public mode of experi- 
ence, for example, in terms of "intersubjective 
protocols" as a basis for science. In a sense this 
difference is also reflected in the methodological 
debate between the idiographic and nomothetic 
schools of thought in psychology. A develop- 
mental approach which examines how the dif- 
ference between the subjective and objective 
poles emerges may throw light on this problem. 
dt may lead to a better understanding of how, 
for example, the awareness of subjectivity and 
self-reference arises together with the segrega- 
tion of the self from its unreflective embedded- 
ness in a social and behavioral surrounding. 
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Coenitive DEVELOPMENT AND GENETIC REDUCTIONISM 


Genetic reductionism takes two forms. The 
first is phylogenetic; the second is ontogenetic. 
In phylogenetic reductionism, evolutionarily 
higher stages, e.g., complex human behavior, are 
treated as if they did not differ fundamentally 
from lower stages. The difference is quantita- 
tive: the higher levels are more complex only 
in the sense of containing more “elements.” 
No qualitative change is attributed to this in- 
crement of complexity. The tendency therefore 
is to consider human cognition and intelligence 
in terms of an increased repertoire in the num- 
ber of associative connections, and not as a new 
function in the kind or direction of thinking. 

In ontogenetic reductionism, similar additive 
mechanisms are posited for human behavior. 
Adult behavior is said to be derived from early 
acquired reactions. The basic response disposi- 
tions are inculcated during childhood. The 
mature personality becomes a “‘stencilled copy" 
of previously conditioned motives which have 
become attached to additional environmental 
stimuli. In principle, the development of the 
individual does not create new motives during 
various genetic stages. Ideational processes — 
when admitted — are relegated to past learning 
rather than to the developmental emergence of 
capacities for reasoning or to new modes of 
learning. The two types of reductionism are 
usually complementary, and the second form, 
the ontogenetic, is often derived from the first, 
the phylogenetic. This relationship is manifest 
in those approaches to cognitive phenomena 
which have the intent of reducing or relating 
them to S-R laws. 


UNIFORM MODES OF LEARNING 


It is probably no overstatement to say that 
behavioristic learning theories have more or 
less taken it for granted that the laws of learn- 
ing are the same for different phylogenetic levels 
and that the same laws hold for different levels 
of human development. “The extreme form of 
generalization from comparative studies is that 
there are no differences, except quantitative 
ones, between the learning of lower animals and 
primates, including man" (Hilgard, 1948, p. 
329). It has been pointed out by Schneirla that 
this reasoning leads to the fiction of under- 
standing the various learning phenomena as 
based on "homogeneous agencies" rather than, 
on "processes of very different nature still re- 
quiring explanation" (1949, p. 248). Moreover, 
human learning is typically not studied in its 
own form but is extrapolated from lower forms 


— particularly rat learning. The importance of 
differences in capacity and kind of learning has 
been pointed out by Werner (1940), Schneirla 
(1949), Harlow (1953), and Tolman (1949). 
Schneirla has criticized Hull's S-R. doctrine for 
its neglect of such differences and also for its 
reliance on the conditioned response as the 
prototype of learning (1949, p. 267). Serious 
doubts about the universality of the rat model 
and the premise of uniform learning mechan- 
isms in different species have also been voiced 
by Birch and Bitterman (1951). Hull (1952), 
in a posthumous publication, states that most of 
the postulates are based on the behavior of 
lower animals, particularly the rat. "Humans 
have the added capacity of speech, symbolic 
behavior, with its accompanying advantages to 
the higher mental processes. Whether this in- 
troduces any primary behavioral laws remains 
to be determined" (p.4). Hilgard remarks that 
“a peculiar twist is sometimes given psychologi- 
cal thinking which takes the form that a process 
in order to be scientifically reputable must be 
demonstrated to occur in lower animals. If it 
occurs only in man, it is excused away because 
man posseses verbal and related abilities" 
(1948, p. 329). We may make the distinction 
between the veterinarian approach to human 
learning which "puts the rat into man" and 
what Hilgard calls the pharmaceutical ap- 
proach. The latter approach tests infrahuman 
specimens only for their comparability with 
human functions. “Only if a process demon- 
strable in human learning can also be demon- 
strated in lower animals is the comparative 
method useful in studying it" (p. 329). Cog- 
nition theorists have more or less implicitly 
followed this rule. Kóhler's (1925) studies on 
insight in animals is a good example. ‘Tolman’s 
position may be considered ambivalent on this 
score, since he "enjoys sitting on the good old 
phylogenetic limb" (1945). Yet, Spence (1944), 
Boring (1953), and others may not quite trust 
him up there because Tolman defines his ex- 
perimental problems “phenomenologically,” as 
attested by his well-known dictum “If I were a 
rat" (1938). He thus seems to "put man into 
the rat." 

In essence, these issues add up to the broader 
and more fundamental question of a functional 
difference between animal and man. S-R theo- 
rists do not consider this difference a serious 
one. Thus N. Miller, in defending the equation 
between the learning of rats and men, uses 
the analogy, "the obvious differences between 
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the bodies of Rita Hayworth and an artichoke” 
do not dispense with the assumption “that the 
same fundamental mechanism of cell-division by 
mitosis is involved” (1951, p. 376). By com- 
parison, Lashley recognizes the same mechanism 
of forming simple associations in most organ- 
isms. Nevertheless, he maintains that what is 
more important are the differences in organiza- 
tional functioning between lower and higher 
animals. “It is not the fact of learning but what 
is learned that differentiates animals in the 
evolutionary scale. The learning of higher ani- 
mals involves the perception of relations which 
is beyond the capacity of the lower" (1949, p. 
30). 


CAPACITY DIFFERENCES BETWEEN MAN AND ANIMAL 


It is noteworthy that biologists, medical scien- 
tists, and comparative psychologists have more 
emphatically emphasized the capacity distinc- 
tion between man and animal than have neo- 
positivists. Phylogenetic reductionism has been 
called in question by such writers as Novikoff 
(1945) and Goldstein (1939) . Novikoff presents 
arguments that organic matter already consti- 
tutes a new level of organization which requires 
biochemical concepts and which is not reducible 
to molecules and atoms. He postulates that the 
evolutionary process proceeds through a series 
of new irreducible "integrative levels.” Novi- 
koff criticizes the elementarism of traditional 
cell theory employed in explaining the biology 
of multicellular organisms because it does not 
account for the new emergent whole of cell 
organization. Just as any organism at any point 
in the phyletic scale represents a new integrative 
level, so it is itself organized into higher and 
lower levels: “a knowledge of the lower levels 
does not enable us to predict a priori what will 
occur at a higher level. Although some may 
have validity for the higher level, laws of a lower 
level are inadequate to describe the higher lev- 
el" (1945, pp. 214-215). This applies particularly 
to man. According to Novikoff, man's biological 
organism is transformed by its existence in 
society. Under such influence the biological 
becomes psychological. This is made possible 
by man's unique biological constitution, permit- 
ting thought, speech, and the capacity for 
planned action. Goldstein (1939) independ- 
ently follows a similar line of thought. He 
challenges the view that one can benefit by 
exploring the functions of “lower” animals for 
an understanding of "higher" forms because 
the lower ones are presumed to be simpler. 
The concept of simplicity applied to lower phyla 
introduces more difficulties than advantages. 
The behavior of simpler animals turns out to 
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be actually more obscure because it introduces 
different variables than those known to and 
understandable for the human investigator. 

The crucial diagnostic features of homo 
sapiens have led G. G. Simpson to join Julian 
Huxley in objecting to the “nothing but” 
fallacy which dealing with man as an animal 
suggests: “It is important to realize that man 
is an animal, but it is even more important to 
realize that the essence of his unique nature lies 
precisely in those characteristics that are not 
shared with any other animal” (1950, p. 283 
ff.). What are these differentiating characteris- 
tics? Herrick asserts that in man new patterns 
of behavior have emerged which are not known 
anywhere else, and he singles out the following: 
“Symbolic thinking is a new kind of function, 
though the steps by which it was fashioned can 
probably be traced” (1926, pp. 350-351). J. 
Huxley, perhaps more than any other biologist, 
has emphasized that every human activity, in- 
cluding eating, sleeping, and mating, is unique, 
so that “man stands alone" in nature (1927). 
Such biological uniqueness is, for Huxley, 
rooted in man's capacity for conceptual thought. 
This potentiality is the condition for the crea- 
tion of language, the utilization of tools, and 
the forming and transmission of tradition. The 
new type of social integration which emerges on 
the human level is considered by Warden (1936) 
to be cultural rather than biosocial because of 
the presence of language. These arguments can 
be multiplied by quotations from other biolo- 
gists and psychologists who have singled out 
the cognitive aspect of symbols as the mediating 
vehicle in the social behavior of man. Further, 
they have maintained, as Schneirla (1946) has 
shown, that there is a psychological difference 
between the pseudosocial organizations on the 
infrahuman level and the societal organization 
of man. Only human society is characterized 
by symbolic communication. 


SYMBOLIC FUNCTION, CONCEPTS, AND 
PROPOSITIONAL LANGUAGE 


The pertinent investigations of symbolic 
function bring into sharp focus the differences 
between the reductionist and cognitive posi- 
tions. At the one pole we find philosophers 
such as Cassirer (1944a) , who defines man as a 
unique “symbolic animal.” He makes the capac- 
ity for representation ‘through concepts, lan- 
guage, myth, and art the criteria for this 
uniqueness. Man no longer deals with things 
themselves but with self-produced symbols of 
things and of signs. Man alone has created sym- 
bolic systems as a new dimension of behavior. 
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This new reality transcends the visible signals 
of the physical world. It is an ideational, arti- 
ficial medium which man has interposed be- 
tween himself and the things of nature. At the 
other pole are those behavior theorists who 
prefer to equate symbols with signals or signs 
which can be learned according to the principle 
of substitution and stimulus generalization in 
conditioned response terms. 

Among cognitive psychologists it was K. 
Bühler (1934) who most clearly formulated 
the problem of a “darstellende sprache" (repre- 
sentational speech) as different from signals and 
sign language. It was left, however, to the 
neurologists’ study of speech impairment fol- 
lowing brain injury to establish empirically the 
importance of representational language (Jack- 
son, 1915; Goldstein, 1939, 1948; Head, 1926) . 
It was Jackson who first pointed to the im- 
portant difference between emotional and 
propositional speech — a distinction adopted 
by Goldstein, Head, and others. Emotional 
speech is confined to the expressions of subjec- 
tive emotional states. Such are the vocalizations 
and cries of animals, the exclamatory human 
utterances, ie., interjections, curse words, and 
expressions of affect. Propositional language 
exists only when speech is used to designate or 
describe objects and situations independently 
from a subjective state or an action context. 
To propositionalize means making "statements 
about," i.e, using language denotatively. The 
line of demarcation between animal and man 
lies in the animal's inability to propositionalize. 
Says Cassirer, “In all the literature on the sub- 
ject there does not seem to be a single conclusive 
proof of the fact that any animal ever made 
the decisive step from subjective to objective, 
from affective to propositional language" 
(19442, p. 30). 

Language development, whether onto- or 
phylogenetic has been frequently “explained” 
as a transformation from the affective language 
of exclamatory interjection to the propositional 
language of naming and communication. Cas- 
sirer argues "that between interjection and word 
there is a chasm wide enough to allow us to say 
that the interjection is the negation of language; 
for interjections are employed only when one 
either cannot or will not speak” (1944a, p. 117) . 
The forming of words has been explained also 
as a fixation of certain babbling sounds already 
in the repertoire of the infant via circular re- 
flexes and rewarded imitation of heard sounds, 
(F. L. Allport, 1924) . Against this view, Gold- 
stein cites the evidence that the baby's babbling 
contains all kinds of sounds, most of which never 
appear in later language and do not exist in 
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the speech of adults in the child’s surroundings. 
Therefore the child has to learn totally new 
sounds for words and exerts great effort in the 
word stage to learn some of those sounds which 
he previously uttered in the babbling (1948, 
p. 35). Goldstein’s conclusions have been inde- 
pendently amplified and supported in a study 
by Lynip (1951) , who used magnetic recording 
and a newly designed sound spectrograph. Ly- 
nip found that during the first year not a single 
vowel or consonant is produced that is compar- 
able to adult vowels and consonants, and even 
at the end of that period the sounds of the infant 
are "only approximately the equivalent of an 
adult sound" (p. 246). This also holds for the 
early imitative period, even when the naive 
listener has the impression of a reasonably close 
imitation. “The infant's vocalizations have no 
more relation to an adult's words than his leg- 
kickings have to a grown-up's genuflexions" (p. 
247). 

The decisive maturational step from babbling 
to communicative speech is the socially moti- 
vated intention to acquire and understand the 
word in a denotative relationship. L. A. White 
(1949) , Lindesmith and Strauss (1949), Vigot- 
sky (1939), and Werner (1940), among many 
others, have insisted that the meaning of the 
word is symbolic. First, the sound patterns come 
to represent something different from what they 
are as mere sounds. Second, the word is an ex- 
pression of generalizing thought which cul- 
minates in genuine concept formation. Concepts 
are psychologically operative when the invariant 
relationship between the properties of an object, 
an action, or an idea is grasped, and when the 
communality of characteristics that is invariant 
can be abstracted from a variety of changing 
aspects. This process in turn makes the word 
a conceptual symbol. The name for an indi- 
vidual object in daily life does not refer to the 
specific uniqueness of the object; the name sig- 
nifies the object as a representative of a category 
— an exemplification of all the possible varia- 
tions allowed for by its invariant characteristics. 
For example, in the word "green" as applied 
to a given green nuance, we cognize this specific 
green as a representative of all possible shades 
of green. Workers in the field of concept forma- 
tion have further emphasized that the abstrac- 
tion of the invariant is a deliberate, isolating 
activity. A general idea is attained by abstract- 
ing from as well as recognizing the differences 
among the given specific variations (Werner, 
1940; Hanfmann and Kasanin, 1942; Goldstein 
and Scheerer, 1941; Scheerer, 1949). It is ob- 
vious that we are here dealing with the most 
complex form of representation, which, strictly 


COGNITIVE THEORY 127 


speaking, consists of a hierarchy of symbols: the 
sound stands for a word, the word stands for a 
conceptual range, and the perceptual object 
stands for a class (E. Heidbreder, 1945a, 1946a, 
b). 

No discussion of word meaning should omit Helen 
Keller's well-known account of her sudden discovery 
that everything has a name. This discovery was not 
just hitting upon a mechanical rule, but an insight 
into the representational function of symbols as a 
general principle. Up to the crucial waterpump 
incident Helen had learned with difficulty to imitate 
the forms of the touch alphabet and to connect them 
with objects touched — very much as association 
theory would depict the learning of words as signs 
or cues. Following her insight, she speedily acquired 
many times more words than she had heretofore 
learned by rote, and spontaneously asked over and 
again for the names of objects she touched. Shortly 
thereafter she had the first “conscious experience of 
an abstract idea," namely, "think" (H. Keller, 1931, 
pp. 21-30). 

Word meanings, however, do not stand alone. 
They are combined in sentence structure, which 
is the mature mode of communication, i.e., 
propositional language. According to Jackson, 
“The unit of speech is not the word but the 
proposition formed by the interaction of all the 
words used” (cf. Head, 1915, p. 10). The inter- 
action is possible only when a relation among 
words obtains which creates a mew meaning. 
And this new meaning is not based upon the 
mere addition of individual word meanings, but 
rather on their organization into a new whole. 
This reasoning substantiates thé view that 
propositional language and syntactical structure 
are complementary. Recently Lashley (1951) 
made a trenchant analysis of the processes which 
underlie the syntactical order of language and 
the serial order of motor behavior. He disputes 
the "associative chain theory" of sequential ac- 
tion, according to which movements are asso- 
ciatively linked together so that one movement 
is the response to the preceding movement and 
the stimulus for the following movement. Ac- 
cording to Lashley, the order of muscle innerva- 
tions in speech or skilled movements (such as 
typing) is not a temporal sequence of direct 
associative connections. The pronunciation of 
the word "right" involves the same motor ele- 
ments as the word "tire" but in reverse order. 
In speech such movement elements occur in so 
many permutations that individual associations 
between each of them is improbable. Corre- 
spondingly, in typing a word, finger strokes oc- 
cur in all sorts of combinations, so that no single 
letter invariably follows another specific letter. 
Whether, for example, following g, "gh, ga, or 
gu is written, depends upon a set for a larger 


unit of action, the word" (1951, p. 116). The 
same holds for the spelling of a word, as well 
as for the relation of a word to the sentence. As 
it sounds, "the word ‘right,’ for example, is 
noun, adjective, adverb, and verb, and has four 
spellings and at least ten meanings" (p. 116). 
Its adequate usage and placement in a sentence 
will depend on the total sentence meaning and 
not on any temporal associations between the 
elements within the word or between this word 
and others. The temporal order must therefore 
be imposed upon the elements of speech by some 
preceding organization. It consists of the set or 
idea to be expressed and of a general schema 
which simultaneously predetermines the tem- 
poral sequence of the specific acts. 

To summarize, propositional language has 
these characteristics. It is conceptual; it is sym- 
bolic; it is not a temporal chain of associations. 
The preconceived schema of the sentence as a 
whole determines the relational meaning be- 
tween the words. Functionally, language is used 
to convey meaning, detached" from a subjective 
perspective and divorced from an immediate 
actionsituation. Language is denotative and 
can signify the "real" as well as the possible, and 
also the "unreal" S. Langer has thus captured 
thé essence of propositional language qua “state- 
ments about": 


Animal speech never has a structure. It is merely 
an emotional response. Apes may greet their 
ration of yams with a shout of "Nga!" But they 
do not say "Nga" between meals. If they could 
talk about their yams instead of just saluting them 
they would be the most primitive man instead of 
the most anthropoid of beasts. They would have 
ideas and tell each other things, true or false . . . 
they would make plans and invent laws and sing 
their own praises as men do. (1949, p. 11) 


ABSTRACT ATTITUDE 


Although there are certain differences in em- 
phasis between the theories of Cassirer, Langer, 
Head, and Goldstein, there is agreement among 
them on this point. Propositional language 
expresses reflective abstraction from the "im- 
mediate" by representing it through symbols. 

The more these conclusions were supported 
by the findings on aphasics, the clearer it became 
that the representational processes which under- 
lie thinking in verbal symbols extend beyond 
the sphere of language proper. Head (1926) 
himself recognized that the acts of more complex 
symbolic formulation require a greater degree 
of abstraction in their execution. His examina- 
tion of aphasics disclosed that the use of lan- 
guage as well as thinking can undergo a 
characteristic change. The patients usually pos- 


128 


sess emotional speech and many speech automa- 
tisms. However, they cannot employ language 
intentfully in the propositional sense. A word 
just uttered in an action context may fail them 
when it is demanded outside of that situation 
“in the abstract.” It is possible for them to ask 
for a cigarette or say, "It is five o'clock," but 
not to name subsequently a cigarette or clock 
upon request. When examined for his con- 
ceptual grasp of words, a patient may respond 
as follows: a knife presented with a fork is 
named "knife"; if presented with an apple, it is 
called an "apple parer"; with a pencil, a pencil 
sharpener, etc. Although he used the seemingly 
general word "knife," the patient actually gave 
an individual usage name each time, unable to 
say "knife" in a “categorical” sense (Goldstein, 
1940). Another patient "does not understand 
the word ‘manager’ used as a generalization but 
can use it correctly in designating the man he 
worked for" (Werner, 1948, p. 274). It further 
turned out that cortical dysfunction, with or 
without aphasia, tends to impair a range of per- 
formances. The patients may be unable to 
demonstrate an action out of context, e.g. 
threatening without being angry, or to repeat 
untruths such as "I can write with my right 
hand" when the right side is paralyzed, etc. 
They may fail to sort objects or colors according 
to a guiding concept and can only group in 
terms of realistic use or perceptual impressions 
of belongingness (Goldstein, 1936, 1939) . 

The extensive case material has suggested two 
generalizations. (1) In the cortically injured, 
the meaning and use of language tend to be 
restricted to a simpler, cognitive scope of im- 
paired conceptual thought. (2) The personality 
of the patient as a whole can become constricted 
to a primitive level. A pertinent distinction 
between abstract and concrete behavioral levels 
or attitudes has been introduced by Gelb and 
Goldstein (1925) and their co-workers (Gold- 
stein and Scheerer, 1941; Hanfmann, Rickers, 
and Goldstein, 1944; Scheerer, Rothmann, and 
Goldstein, 1945) and others. According to the 
experimental and clinical findings of these 
authors, normal behavior manifests a coordina- 
tion of the abstract and concrete attitudes. The 
performance changes in the brain-injured are 
symptomatic of a de-differentiation of the higher 
functional levels involving "abstraction." Ab- 
stract behavior detaches the self from direct 
“outer or inner experience," e.g., from the de- 
mand qualities of phenomenal objects and im- 
mediate thought contents. The normal person 
can therefore impose mediating ideational struc- 
tures on the given situation, transcending its 
immediate realness and uniqueness. Symbolic 


HANDBOOK OF SOCIAL PSYCHOLOGY 


representation, active planning, and shifting are 
possible. The highest degree of abstraction is 
the reflective awareness of the cognitive activity 
itself, enabling the individual to give a verbal 
account, e.g., of his conceptualization — which 
also can be formed less abstractly. E. Heid- 
breder (1945a,b, 1946a,b) developed a com- 
parable theory from her extensive experi- 
ments on concept formation. She, too, makes 
the distinction between higher and lower forms 
of human cognition, e.g., the conceptual and 
symbolic as against the perceptual-manipulatory 
grasp of concrete things. 

Abstraction has volitional intent whenever 
reflective awareness is present. We may trans- 
late this behaviorally into self-instruction, 
whereby the individual wilfully assumes a 
certain set corresponding to the “aufgabe.” 
The task-execution can then take place with 
situational support or in a habitual fashion 
without step-by-step reflective guidance, until 
a shift is necessary. This relationship between 
self-instruction via abstract attitude and con- 
crete enactment may be assumed for a wide 
range of actions, including, for instance, the 
taking on of varying roles in social behavior. 


SYMBOLS, SIGNS, AND SIGNALS 


Cognitive interpretations of symbolic and ab- 
stract behavior contrast sharply with two cur- 
rent theories, of which one reduces symbols to 
signs and the other tends to reduce concepts to 
stimulus generalization. The first is reflected 
in the attempts of Morris (1946), followed by 
Seward (1948) and Osgood (1952). The second 
originates directly in the Hullian school. Morris 
admits some difference between signal and sym- 
bol but subsumes both under the concept of 
sign. “A symbol is a sign produced by its inter- 
preter which acts as a substitute for some other 
sign with which it is synonymous; all signs, not 
symbols, are signals” (1946, p. 25). The differ- 
ence between a symbol and a signal is chiefly 
that the signal is provided by physical events in 
the environment or in the organism, The patter 
on the roof is a signal of rain, the heartbeat may 
be the signal of excitement. In contrast, the sym- 
bol is a sign provided by the organism in the ab- 
sence of a signal (or object), and substituting for 
it. The word rain, for example, would substitute 
for the signal of rain, or rain itself. The be- 
havioristic derivation of symbol is further e: 
tablished by the assertion that the sign “is a 
preparatory stimulus which in the absence of 
stimulus objects initiating response sequences 
of a certain behavior family causes a disposition 
in some organisms to respond under certain con- 
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ditions by response sequences of this behavior 
family . . .” (p. 10). The notion of behavior 
family is a special case of Hull’s habit family 
in that the response sequences terminate in simi- 
lar goal objects for similar needs. 

The crucial issue between Morris and cog- 
nitive theorists such as Suzanne Langer concerns 
the nature of the difference between signals or 
signs on the one hand and symbols on the other. 
For Langer (1951) signals and signs are synony- 
mous and differ from symbols. Signals evoke 
actions but a symbol does not necessarily evoke 
an action appropriate to the presence of its 
object. The word “Napoleon” does not call out 
specific behavior towards Napoleon but a 
thought about him. “In talking about things 
we have conceptions of them, not the things 
themselves; and it is the conceptions not the 
things that symbols directly ‘mean.’” Again, 
“signs announce their objects . . . whereas sym- 
bols” lead us to “conceive their objects" (Lan- 
ger, p. 61). Morris, on the other hand, considers 
the difference between signal (or sign) and 
symbol one of degree. He says, “The difference 
is not between 'announcing' an object in the 
case of signals and of ‘conceiving’ an object in 
the case of symbols, but in the degree of absence 
or presence of the supporting conditions under 
which dispositions to behavior issue in overt 
behavior” (1946, p. 52) . In reply to the example 
of Napoleon, Morris argues that one need not 
act any more in response to the word symbol 
of "Napoleon" than to the signalling pattern 
of raindrops on the roof. It suffices that the dis- 
position to behavior is evoked. One wonders 
whether this argument would hold if the words 
“peace” or "democracy" or “justice” were heard 
instead of Napoleon. The presumed “response 
disposition” must then become so general and 
abstract that the concept becomes diluted to a 
behaviorist’s version of an idea which can no 
longer be a response substitute for the behavior 
per se, be it overt or covert. Morris himself 
does realize certain limitations in his theory 
of sign response. He admits that animals seldom 
produce the signs which influence their be- 
havior, Human language symbols “characteris- 
tically do this, and to a surprising degree. Here 
is a basic difference between men and animals, 
and until behavioral theory develops a semiotic 
adequate to this difference it will remain what 
it is today: a careful study of animals and a 
pious hope for a science of the human person” 
(p. 198). 

The psychologist Osgood (1952) has extended 
Morris’ formulations to a strictly behavioristic 
derivation of meaning and concept. First, Os- 
good adds S-R learning to Morris’ notion of 
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behavior disposition. Second, he invokes the 
construct of partial identity between the be- 
havior disposition elicited by the sign and the 
previous behavior elicited by the object itself. 
“Words represent things because they produce 
some replica of the actual behavior towards 
these things as a mediation process” (p. 204). 
Therefore a stimulus pattern which is not the 
object is a sign of the object if it evokes a medi- 
ating reaction which is a fractional part of the 
total previously evoked behavior by the object. 
This mediating reaction is further defined as a 
selfstimulating one that leads to responses 
which would not occur without the previous 
association between sign and object. Productive 
as these propositions are, they clearly serve to 
rescue Morris’ concept of behavior disposition 
from its dangerous flirtation with “idea” and 
confine it to the Hullian “pure stimulus act.” 
Osgood's explanation rests upon a fractional 
identity of responses without really accounting 
for the psychological character of the identity. 
We have here the same situation as in some cases 
of problem solving, and of concept formation 
experiments, where the crucial feature is the 
discovery of the so-called identity. Moreover, 
cognition theorists may query, with G. W. All- 
port (1937): “How identical is an element?" 
'To what extent does the presumed identity 
require the very process of abstraction which 
the fractional identical response is supposed to 
replace? There remains the further challenging 
psychological question of whether these self- 
produced signs of signs — the symbols — are 
ever invented without preceding object associa- 
tions. Does not the language of scientific con- 
structs, of mathematics, of art symbols, or of 
games such as chess testify to at least some modi- 
cum of creativity? It is difficult to decide 
whether the notion that the organism’s re- 
sponses furnish stimuli which are the “surro- 
gates for ideas” could endow the “pure stimulus 
act" with original creativity. Hull (1952) has 
further refined the concept of "pure stimulus 
act,” especially with regard to human behavior. 
In this respect, Hull now accepts both “molar 
foresight or foreknowledge" as well as purpose 
in Tolman's sense, and even insight (p. 150 ff.) . 
However, the "identity of fractional antedating 
elements" does remain the basis of such cog: 
nitions (cf. also p. 325). 

While S-R theorists such as Seward (1948) are 
willing to grant propositional symbols to man 
alone, they are at the same time eager to grant 
other symbols besides words to animals. Con- 
ceding that there can be no debate concerning 
human speech, Seward excludes this phenome- 
non from further discussion. Instead, he sets 


130 


out to prove that animals can produce symbols 
and that therefore “the conative and cognitive 
processes of humans and other species belong on 
a continuum varying only in complexity” (p. 
293). 
Seward also interprets Morris’ exposition of 
a symbol as a surrogate response. But it is not 
a behavior disposition, it is an “intermediate 
reaction” which substitutes for a sign by pro- 
viding an internal stimulus for a response. The 
concession is made by Seward that even such 
nonverbal symbols occur in the central connec- 
tions between S and R. “Stripped of all sub- 
jective implications it is a behaviorist's version 
of an idea” (1948, p. 282). Since this central 
process concerns something perceptually absent 
and is at the same time a stimulus for behavior, 
it would be a typical case of a distal referent in 
disguise. If that should be denied, the central 
process must be a circumlocution for an image. 
The generality of this image can again not be 
likened to a conceptual symbol. Cassirer has 
clearly expressed this in discussing the theory 
of groups: 


The concept cannot be represented in the form 
of an image but only in the form of a rule... 
Concepts are psychologically actualized by "sche- 
mata," not by images... The rule may, in 
simple and exact terms, be defined as that group 
of transformations with regard to which the varia- 
tion of the particular image is considered. (1944b, 


p.22). 


THE COGNITIVE PROBLEM IN STIMULUS 
GENERALIZATION AND TRANSPOSITION 


Another attempt to bring the representational 
character of generalizing into the orbit of S-R 
laws is more directly connected with the Hullian 
postulate of generalization. Spence (1937, 1942) 
made an ingenious use of this postulate in rein- 
terpreting the configurational evidence for 
discrimination learning and transposition. Ac- 
cording to Gestalt theory, transposition re- 
sponses as well as many reactions to "equivalent 
stimuli" are governed by the relations between 
the stimuli rather than by their absolute proper- 
ties. Similar to melody transposition, relational 
dominance is demonstrated in the type of train- 
ing Yerkes and Kóhler introduced. For example, 
a small and large box are presented and the 
choice of the larger rewarded. A still larger box 
is then paired with the rewarded and transpo- 
sition tested; further stimulus pairs are added, 
increasing the size each step (the original size 
ratio is preserved and positional factors are 
controlled). With infrahuman subjects up to 
the chimpanzee the range of transposition is 
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limited in the number of steps the test pairs 
can be removed from the training pair. An ac- 
cepted explanation of this limit is still wanting. 
Lashley (1938) suggests a perceptual interpreta- 
tion. Test pairs requiring far transposition may 
obscure the relational and emphasize the abso- 
lute character of the stimuli. He offers the 
analogy of a pinhead and a wagon wheel, which 
do not belong to the same size dimension as a 
dime and a quarter. Extreme stimulus sizes will 
have absolute properties outweighing the rela- 
tional ones. For Spence (1942) this analogy 
lacks predictive value, since no quantitative 
specification can be deduced from it. He ad- 
vances an alternative nonrelational hypothesis 
to cover both transposition and its range. Con- 
sonant with S-R tradition, he posits an indi- 
vidual response to each of the two absolute 
stimuli in the original training pair. In essence, 
his explanation amounts to an assumed rein- 
forcement effect of the rewarded absolute 
stimulus (“excitatory potential”) and an in- 
hibitory effect of the nonrewarded absolute 
stimulus (“inhibitory potential’). To each of 
these two stimuli corresponds a generalization 
gradient. Since the two gradients must overlap, 
this will result in a differential between the 
excitatory and inhibitory potentials for any 
given pair of stimuli along a continuum of 
transposition (e.g., size). By mathematically 
constructing the curves for both gradients, 
Spence tries to account for the empirically found 
limits of transposition — confirmed with his 
chimpanzee subjects. 

The issues in this controversy clearly carry 
far-reaching implications for cognitive prob- 
lems, and for concept formation in particular. 
The presumed stimulus generalization is, of 
course, patterned after Hull's conditioning the- 
ory in which Pavlov's temporary irradiation 
effect becomes something more permanent. In 
the early stages of conditioning to a tone, for 
example, the generalization covers a wide range, 
so that the gradient shows little or no slope 
from the point conditioned, i.e, many other 
tones will elicit the C-R. Only through con- 
tinued reinforcement of the specific stimulus, 
usually by contrasting it with the "irrelevant" 
stimulus, will the range of generalization narrow 
and a gradient develop. The question is raised 
by Lashley and Wade whether “many instances 
of so-called generalization consist of nothing 
more than failure to observe differences” and 
failure to “form effective associations with spe- 
cific aspects of the stimulus” (1946, pp. 82-85). 
Consequently, such a “generalization by de- 
fault" requires the positing of relational per 
cepts rather than absolute stimulus responses, 
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since the latter depend on discrimination of the 
individual stimulus. The finer details of the 
Lashley-Hull controversy need not concern us 
here. The basic question of generalization by 
default seems to have been lost in the shuffle. 
Since it remains an unresolved problem it 
should invite reexamination. This need has 
been recognized by some S-R exponents. Mow- 
rer and Lamoreaux (1951) have called attention 
to the fact that true conditioning may occur in 
one trial. The conditioning is to the general 
features of the situation as a whole. Only after 
further trials do responses become specific. This 
latter, they claim, is discrimination learning, 
and should not be confused with conditioning 
proper. Nevertheless, the tacit assumption of 
behavior theorists seems to be that stimulus 
generalization, transposition, and "genuine" 
generalization in concept formation are basi- 
caly equivalent. "This assumption may be 
premature if not unwarranted. 


A case in point is the experimentation of M. 
Kuenne (1946) and Albert and Ehrenfreund (1951), 
who tested preschool children at progressive mental 
age and chronological age levels on a size transposi- 
tion problem. The experimenters adopted Spence's 
gradient of generalization for predicting the range 
of transposition. The more advanced groups suc- 
ceeded significantly better on a far transposition 
than the less mature groups. The investigators noted 
that the successful children were those who spon- 
taneously used "verbal cues” ("it's always the smaller 
one"). It is significant that the transposition results 
of the nonverbal and younger children were con- 
sonant with Spence's gradients. The gradient for the 
verbal children, on the other hand, remained in 
principle horizontal, indicating complete transposi- 
tion throughout. This deviation from the Spence 
gradients is attributed to verbal cues, which facilitate 
the far transposition. The psychological nature of 
these verbal cues is not further elucidated, but the 
term "verbal cues" bespeaks an associationistic con- 
ception of language acquisition. 


A hypothesis of lower and higher processes to 
cover stimulus generalization, limited transpo- 
sition range, and the findings on far transposi- 
tion has been offered by Scheerer (1949). For 
the lower processes he posits, in the same vein 
as Lashley, that in generalization by default 
there is a lack of discrimination, a generality 
of response based on inarticulate differentiation. 
For example, in early conditioning the response 
is made to sound, brightness, form, i.e, to an 
unanalyzed whole impression against a homo- 
geneous ground. "Since no other specific attri- 
butes are herein discriminated, the same 
response can occur within this undifferentiated 
range of stimuli" (1949, p. 249). Perceptually, 
other sounds, brightnesses, or forms have the 
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character of likeness or sameness at this stage. 
Correspondingly, in limited transposition, the 
differential characteristics between the original 
pair and the new pair are not cognitively regis- 
tered, but the preserved relations of brightness 
or size lead to a concrete impression of likeness. 
Kóhler, in a personal communication to Wap- 
ner, indicated that transposition alone does not 
constitute evidence for insight. Kóhler doubts 
whether "an animal who chooses relationally 
and therefore tends to transpose, need be aware 
of the fact that his choice is relative" (Wapner, 
1944, p. 51). 

Such an impression of likeness may be illus- 
trated on the human level by the fact that most 
people do not hear any difference when a 
melody is transposed, up to a certain point; 
“it’s just the same’ to them — there are no 
differentials for them from which to generalize” 
(Scheerer, 1949, p. 249). This also holds for 
many other Gestalt transpositions, e.g., triangu- 
lar or square forms, where the total impression 
of pointedness or angularity may persist without 
cognitive discrimination of dilferences. A gen- 
erality such as this, based on global likeness, 
underlies the so-called stimulus generalization 
as well as the limited range of tested transposi- 
tion. This likeness will break down when the 
perceptual disparity increases so that other 
absolute stimulus characteristics become domi- 
nant. lt is at this point that higher-order cog- 
nitive processes must take over if the factor of 
similarity or relational likeness is to be ab- 
stracted. This process is conceptual in kind 
and qualitatively different from concrete re- 
sponses to perceptual impressions of likeness. 
In the light of this interpretation the difference 
between the verbal and nonverbal groups in the 
Kuenne-type experiment may become explic- 
able. The verbal cues are word meanings, 
expressing a principle. 

Spence is right in demanding quantitative 
predictions as to when the presumed global 
likeness breaks down. The fact that this em- 
pirical question has not yet been answered 
satisfactorily does not decide the superiority 
of absolute over relational hypotheses. A rela- 
tional theory can well account for primary 
stimulus generalization, i.e. the period before 
the conditioned stimulus becomes individuated 
by the reinforcement; it does account for the 
occurrence of primitive discrimination in terms 
of patterned percepts, e.g., figure-ground; and 
tit accounts for the shift from near transposition 
to far transposition by distinguishing between 
pseudo and true generalization. In another 
connection, Maier suggests that the transfer of 
responses to new situations may depend upon 
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either one of the following processes: “ (a) re- 
actions to likeness alone, and (b) abstraction, 
which involves a balance between reactions to 
likenesses and differences" (1941, p. 179). This 
statement is consonant with the point of view 
presented here, that in true generalization the 
differences as well as the communalities are 
discriminated, and the apparent perceptual 
disparities are subordinated to a common prin- 
ciple. Numerous experimental findings on 
concept formation and on the acquisition of 
word meanings support this position (Hanf- 
mann and Kasanin, 1942; Scheerer, 1949; 
Werner and Kaplan, 1950). The conclusions of 
these authors all point in a similar direction, 
namely, to a distinction between reactions gov- 
erned by total impressions which are not ana- 
lyzed and processes of analytic generalizing 
abstraction. 


Recently Gibson (1951) has independently taken 
an almost identical position. In his discussion of 
schematic perception he distinguishes a precon- 
ceptual or preverbal "stimulus generalization" from 
genuine conceptual thinking in verbal symbols. He 
succinctly sums up the decisive developmental dif- 
ferences; "As primary stimulus generalization de- 
crcases, a different kind of generalization increases 
. .. human reactions manifest not only a kind of 
generalization based on ignorance but also a kind 
of generalization based on knowledge" (1951, pp. 
107-108). Scheerer (1953) has extended the notion 
of indiscriminate, primitive generalization to inci- 
dental learning in personality functioning. Acquired 
coping mechanisms of the individual can often be 
traced to an undifferentiated cognition which struc- 
tures aspects of certain situations as the same without 
an awareness of differences (e.g., fears, personal likes 
and dislikes, general means-ends patterns) . 


Maier (1941) and Wapner (1944) have like- 
wise questioned the premise that true generali- 
zation occurs in the more primitive type of 
equivalence and transpositional reactions. Wap- 
ner's experiments proved that partially decorti- 
cated rats show more relational responses than 
do intact animals. “This precludes any possi- 
bility of interpreting the relational aspects of 
equivalent behavior as a higher mental process" 
(1944, p. 49). He concludes that the term "'gen- 
eralization"' makes for a confusion of higher 
mental processes with primitive types of per- 
ceptual reactions. Along with Werner (1940) 
and Goldstein and Scheerer (1941), Wapner 
argues that the end result of a performance may 
in one case stem from the process of abstraction 
while in another the same result may occur ons 
a concrete perceptual level. Achievement and 

rocess therefore must be separately analyzed 
(cf. Scheerer, 1946). The principle of differen- 
tiating achievement and process has been im- 
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plicitly or explicitly adopted by most cognition 
theorists, with the possible exception of Bruns- 
wik. Their emphasis upon the mediating proc- 
esses in cognitive representation allows for 
qualitative distinctions as to the nature of both 
mediation and cognition. In contrast, SR 
theorists, even when they postulate intermediary 
processes, have attempted to conceptualize them 
as uniform, Therefore qualitative differences in 
function according to levels do not play any 
significant role. The S-R constructs of media- 
tion, as far as they are offered, are placed on a 
continuum from simple to complex, differing 
only quantitatively. 


COGNITIVE DEVELOPMENT 


It is an apparent paradox that those who are 
most inclined towards genetic reductionism are 
at the same time those who are least sensitive to 
or cognizant of the problems of developmental 
change. It is for this reason that cognition is 
neither given a special place in development nor 
is it treated developmentally, in terms of genetic 
stages. Investigators concerned with such ques- 
tions have been critical of an additive concep- 
tion of development which the terms "simple" 
and "complex" often imply. Goldstein (1936) 
marshalled data which show that the criterion 
of difficulty of performance cuts across a nu- 
merical scale of simple and complex. The 
lifting of a single finger is more difficult than 
the extension of the whole hand. It requires 
the voluntary isolation of the digit. It is easier 
to "scratch" one's nose or hit an object with a 
finger than it is to point to the nose or the object 
with the same finger in the act of "showing 
where.” The latter requires a symbolic func 
tion. Typically, the pathologically concretized 
person may reproduce from memory a house 
composed of many sticks. Yet he fails to repro- 
duce correctly the position of a single stick 
which is not tied to a concrete object. This 
requires the grasp of spatial directions in the 
abstract. The implied hierarchy of functional 
organization calls for a new criterion of simple 
and complex that is not merely additive. Such | 
a criterion is the degree of differentiation which 
is the precondition for qualitatively different 
levels of performance. The explanatory fruit- 
fulness of this construct for explaining func- 
tional differences has been attested in biology 
as well as in developmental psychology (Weiss, 
1939; Child, 1924; W. Stern, 1930; C. H. Bühler, 
1930; Werner, 1940; and Piaget, 1937) . 

Perhaps the most systematically formulated. 
theories of mental development have been those 
of Piaget and Werner. We are singling these 
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out because both made the emergence of cog- 
nitive processes the focus of their research. 
Both have gathered their material independ- 
ently, and neither is reductionistic. Werner's 
earlier formulations on this point are readily 
matched by the recent statement of Piaget: “It 
is not a matter of reducing the higher to the 
lower . .." The task is rather to "establish a 
reciprocal relationship between the two terms 
which does not at all destroy the originality of 
the higher term” (1952, p. 8; cf. Werner, 1948, 
pp. 5, 22). The genetic interrelationship be- 
tween lower and higher levels is conceived by 
both as a progressive sequence from the lower 
to the higher. Each new level has an organiza- 
tional structure of its own which forms a stage. 
Thus each level is a “whole” in terms of func- 
tional organization. It emerges from the preced- 
ing one but differs from it; what was a whole 
on the lower is now integrated as a subordinate 
part of the higher level. The genetic concep- 
tion here advanced supports and complements 
the hierarchy of mental functions previously 
discussed. In this connection the groundwork 
laid by Piaget and Werner must be considered 
in any theory of cognition because they en- 
deavor to trace the characteristic representa- 
tional form and content of each stage as well as 
the course of cognitive development as a whole. 
Despite differences in terminology many of their 
conclusions are mutually translatable without 
major distortion. In view of this congruency 
in their thinking we shall select portions of their 
research so that they will supplement each 
other. 

Piaget’s importance for cognitive theory lies 
in the following keynotes of his system. There 
is first his explicit postulate of cognitive struc- 
tures as mediating processes in perception, act- 
ing, and thinking. Distal achievement, in 
contrast to Brunswik, is studied in terms of the 
particular intervening organismic activities. 
Second, Piaget recognizes these mediating proc- 
esses chiefly as representations. Third, he im- 
plements this conception by genetically tracing 
the emergence of representations through onto- 
genetic stages. He follows the intellectual 
evolution from the cradle to maturity by em- 
pirical samplings at each stage. This develop- 
ment is treated in a framework of biological 
principles. Lue 

Mental processes grow out of basic organismic 
processes which lead to cognitive structuring in 
accordance with the adaptation of the organism 
at a given stage. Adaptation consists of an 
interplay between assimilation and accommoda- 
tion tending towards equilibrium. Physically, 
assimilation means that the organism absorbs 
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substances and changes them into something 
compatible with its own substance. Behaviorally 
it means that the organism, instead of submit- 
ting passively to the environment, modifies it 
by imposing certain structures of its own on it. 
Accommodation signifies the action of the en- 
vironment on the organism, which becomes 
affected or changed accordingly. However, the 
outer stimulations as such cannot have a direct 
impact. They can only modify the process of 
assimilation. Translated behaviorally, this 
model can be exemplified in sensory-motor co- 
ordination. Manipulation contains the factor 
of assimilation in grasping the object, and the 
factor of accommodation in adjusting the pre- 
hension to the form and resistance of the object 
grasped. With regard to personality function- 
ing, the same principle holds for the accom- 
modation of the individual to the social influ- 
ences and pressures of his environment, e.g., 
norms, conventional ideas, and “obligatory 
truths" (1950, p. 159). The model also implies 
a changed end-stage in the person. This is typi- 
fied when in vision a peripheral object is fixated. 
The eye movements change the initial state of 
lacking clarity to an end-stage of optimal clarity 
— the object is “assimilated,” while accommo- 
dation to it takes place simultaneously. Cor- 
respondingly, the assimilation of information 
or new ideas and goal attainments results in a 
change of the person. This important aspect of 
personality development has been emphasized 
by G. Murphy (1947) and by G. W. Allport 
(1937), by Goldstein in the notion of enrich- 
ment, and by Schilder (1942) in the theory of 
completion states. 

The equilibria involved in these adaptive 
cycles are considered quasi-stable systems. ‘They 
have a notable similarity to current theories of 
self-regulation, feedback, and to the reflex 
scheme proposed by Koffka (1935). For Piaget, 
the adaptive equilibria in mental functions dif- 
fer for each developmental stage, so that general 
adaptation is never complete but continues. 
Such stages as "sensory-motor intelligence," 
"preconceptual" and "intuitive thought" are 
quasi-equilibria, each at a specific level of cog- 
nitive development. The relative incomplete- 
ness of adaptation at each stage may be equated 
to immaturity, i.e., to the level of maturation at 
which the balance between assimilation and 
accommodation is excessively tipped in either 
direction. Methodologically speaking, Piaget 
designs his experiments and observational pro- 
cedures so as to detect the errors in the child's 
performance on tasks and to analyze the proc- 
esses which underlie the solution or nonsolu- 
tion. In principle, the errors are conceptualized 
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as the inability to “decenter,” and the develop- 
mental progress is conceptualized as an in- 
creased ability to decenter. The meaning of 
“decentering” may be appreciated by first think- 
ing of the tendency in the young child to become 
“centered” or rigid. The nine-month-old baby 
can reach for a milk bottle hidden before his 
eyes under one of two pillows. If then the bottle 
is hidden, in his view, under the other pillow, 
the baby will repeat reaching for it under the 
first pillow. In other words, certain aspects ofa 
percept, an action sequence, a situational per- 
spective from one point of view can become so 
dominant that a child is “fixated” and cannot 
shift orientation. Only by such a shift, i.e., by 
decentering, can the present cognitive structure 
be changed and widened to include other and 
new perspectives. (The similarities to Gold- 
stein’s concept of ability to shift and Wertheim- 
er's recentering are obvious.) 
The progressive decentering which Piaget de- 
scribes in his analysis of stages may be sys- 
tematized as follows: 


I. Sensori-motor intelligence (birth to 114 years) . 
Reflexes, e.g, grasping and sucking, are the earliest 
“perceptual-motor schemata” (1950, p. 100; 1952). 
Through decentering they come to form "parts" of 
primitive action wholes or “habits.” These simple 
schemata are progressively decentered and integrated 
into new complex units, those of voluntary action 
and means-end behavior. Thus grasping is used 
first to produce rattling when a string is pulled, later 
to remove a screen which partially hides an antici- 
pated goal, finally to obtain absent objects. The 
last act marks the beginning of representation and 
the notion of external permanence. Delayed action, 
detouring, and imitation appear. The functional 
character of things governs this stage. Facts are not 
noted for their own sake but only in relation to 
needs (autism). Make-believe play is restricted to 
using one’s “body to produce an action foreign to 
the present context,” as in feigning sleep. 

IL Preconceptual thought (114 to 4 years). The 
schemata of make-believe play become real symbols 
when dissociated from action with one’s own body, 
eg., putting a doll to bed. This “representative 
imitation” coincides with speech, marking the growth 
of a general “symbolic function” which encompasses 
“symbolic play, imaginal representation, and verbal 
thought” (1950, p. 126). Language is still tied to 
action and desires, as in naming. Names signify 
discovered action aspects of the objects and become 
part of them. Since the child does not invent 
language but is exposed to it, he has to “accommo- 
date” to the existing system of conventional 
meanings without fully understanding them. Thus 
he assimilates word meanings in accordance with the 
egocentrism of this stage. Piagets illustrations. of 
reconceptual thought parallel those of W. Stern 
(1930). " ‘Bow-Wow’ is said to a dog, to a hen, a 
cowbell, cows, guinea pigs, and a cat; subsequently 
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to anything moving from an ant to a tractor in the 
field" (1951, p. 218). In contrast to concepts, "in a 
schema such as ‘bow-wow,’ there is merely a sub- 
jective feeling of kinship between the related 
objects" (1951, p. 220). Such "participations" figure 
prominently in causal attributions as well. A moun- 


tain changes appearance as the child travels around, | | 


it. Unable to integrate the varying perspectives, he 
claims the mountain is changing. Though the idea 
of permanent objects has been formed in the im- 
mediate action sphere, this is not so “where distant 
space and reappearances at intervals are concerned” 
(1950, p. 128). The child “will be just as likely to 
say ‘slug’ as ‘slugs’ . . . without deciding whether 
the slugs encountered in the course of a single walk 
are one individual, a single slug, or a class of distinct 
individuals” (1950, p. 127; 1951, p. 225) . 

Ill. Intuitive thought (4 to 7 years). The use of 
representations for reasoning is limited by per- 
ceptual dominance. When faced with the problem 
of logically integrating several aspects of the same 
object or the same aspect of different objects, the 
child cannot disregard that aspect which is per- 
ceptually obtrusive. Some typical experiments 
illustrate this (1950, pp. 129-135). (1) The child 
puts an equal number of beads into each of two 
vials identical in shape and size. He also acknowl- 
edges the equal amount. Next, vial 4 is emptied 
into a taller and thinner glass, while vial B remains 
unchanged in sight. The child now states that the 
number of beads has changed, even though he avers 
that none have been added or taken away. Some 
children say that the new glass contains more beads 
because it is "higher," others insist that there are 
fewer beads because it is "thinner." Their reasoning 
is centered on what is to them the prominent per- 
ceptual aspect, height or width. Unable to decenter 
either aspect, they cannot grasp the mutual inter- 
dependence of height and width and abstract the 
conservation. of quantity. Similarly, they fail to 
group two different classes of objects in one super- 
ordinate class when the two subclasses are perceptu- 
ally different. (2) Three colored beads are strung 
on a rigid wire partially enclosed by a tube. As 
the beads are pushed through the tube, the child 
is asked to predict the order in which they will 
emerge, and also their order upon coming back 
through the tube. At the preconceptual stage, the 
original order (4BC) could be predicted but not 
the reverse order (CBA), which is now also mas- 
tered. However, when the experimenter rotates the 
wire stepwise by 180 degrees Dé turn), the child 
encounters difficulties. He can discover that a half- 
turn leads to a reversal and a full turn “repeats” 
the original order, but he cannot predict the order 
for many rotations. The reason for success or failure 
is that the child can represent to himself through 
action in thought the order of the beads for a few 
turns of the wire but cannot decenter his percept 
ally dependent imaginary manipulations in order to 
derive the general principle applicable for any num- 
ber of turns. 

IV. Concrete operations (8 to 11 years). In com 
trast to common usage, concrete operations are 


« 
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actions that have become abstract. They are manipu- 
lations in thought, based on inductive abstraction 
from below which overcomes the "intuitive" de- 
pendency on action context and situations. In the 
rotating-wire problem the child can now discover 
the rule that all half-turns reverse the order; all full 
turns repeat the original order. He thus predicts 
any number of turns. Concrete operations are, how- 
ever, limited to manipulating “ideas of objects” 
(1950, p. 139). Logical form is still dependent on 
its concrete content. In the vial problem, the child 
can now abstract the conservation of quantity, being 
able to decenter height or width and combine them 
into a logical structure. But he cannot yet conceive 
the conservation of volume of two identical pieces of 
dough when one is changed in shape. At this stage 
the perceptual potencies of height, width, and length 
still hinder a balanced abstraction so that these 
aspects cannot be conceptually integrated. Cor- 
respondingly, the child cannot deduce the identical 
principle or concept which he discovered and utilized 
with situational support when the same problem is 
presented merely in verbal form (1950, pp. 146-149) . 

V. Formal thought (12 years on). The child can 
manipulate ideas about ideas or, as Piaget puts it, 
“perform operations upon operations.” It is the 
level of abstraction at which reflective reasoning and 
formal logical thinking become possible. It permits 
mobility and reversibility of operations; the ability 
to conceive of the possible and the impossible; the 
formulation of propositions about space, time, num- 
ber, as systems; and the carrying out of hypothetico- 
deductive reasoning which proceeds from assump- 
tions not necessarily related to reality but related 
by logical necessity, independent of evidence from 
experience. Piaget emphasizes the attainment of 
the notion of "conservation" through "grouping 
operations." Conservation is the attribution of per- 
manence to objects and to relations in perception 
and thinking. It develops progressively from the 
perceptual constancies to abstract concepts. For ex- 
ample, conservation of quantity, weight, volume, etc., 
are shown to differ at various stages. Such attribu- 
tions of permanence demand the decentering proc- 
esses which have here been traced. 


We have epitomized the characteristics of 
these stages because Piaget's latest formulations 
in the English language provide the systematic 
basis for the understanding of his former theo- 
ries on causality, language, moral judgment, etc. 
It becomes clear that the given age limits are 
not absolute but only reference points for the 
sequence of development. The stages are fur- 
thermore combined into larger, more inclusive 
phases which have a general cognitive and ego 
organization in common. They are the phases 
of autism and egocentrism, subject of much 
speculation. For Piaget, autism is the assimila- 
tion of reality to organic needs, mainly in the 
sensori-motor stage. A decentering occurs when 

_ the need-satisfying “autistic symbolism” be- 
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comes transformed into representations of 
reality, i.e., when the child increasingly accom- 
modates to the environment. 

Egocentrism, the next phase, is defined as a 
lack of reversibility in operations involving 
cognitive and social perspectives. Piaget himself 
concedes that the term "egocentrism" is an un- 
fortunate choice (1951, p. 285). It does not 
signify self-centeredness but "a quite uncon- 
scious and natural illusion of perspective which 
precedes moral egoism and conscious egocen- 
tricity" (1937, p. 48). In direct experience, the 
child is not conscious of a distinction between 
subjective and objective, hence of his own “sub- 
jectivity" He has no awareness of the private 
or nonprivate character of a point of view and 
therefore cannot differentiate between what 
might be his own or another's frame of reference 
(1950, p. 160 ff.; 1951, p. 290). Since a frame of 
reference is necessary for orientation, a child 
may have the perspective of another without ex- 
periencing it as such. Young children often 
imitate without realizing it, believing they origi- 
nated the action, just as they may confuse their 
own experience with that of another. The 
adaptive balance between accommodation and 
assimilation in egocentrism makes for a step- 
wise differentiation between self, objects, and 
persons. It culminates when persons are seen 
to have perspectives of their own (i.e., cognitive 
representations) . 

In evaluating Piaget's work it must be recog- 
nized that he is stronger on theory and ingenious 
experimentation than on rigorously controlled 
methods. The criticisms of some of his earlier 
findings, especially by Susan Isaacs (1930), 
Dorothea McCarthy (1930) , and others, are well 
known. The chief objections have been: (l) 


. Piaget treats primarily of the rational elements 


in child development and neglects the affective 
and conative dimensions. (2) Piaget does not 
give sufficient weight to learning, experience, 
and socio-environmental factors. (3) His 
“stages” are neither distinct nor unitary func- 
tional levels. They overlap chronologically. 
They distribute over different performance 
areas at the same age. Actually the process of 
development is diversified as well as gradual, 
but not in evolutionary steps. It is conceivable 
that the critical reserve in accepting Piaget's 
theses will be lessened after his more recent work 
has been absorbed into current psychology. The 
documentation in this later phase appears to be 
more exact, although again a larger sampling is 
missing. Dennis, after citing chapter and verse 
of Piaget’s "sins" in methodology, adds to his 
critique this comment: 
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. . . Piaget can neither prove nor disprove his 
hypotheses. That must be done by others . . . 
We turn to Piaget for ideas, not for statistics . . . 
Following an unfavorable first reaction to his 
books, the writer attempted to show how out- 
rageous were some of his views, and, by gathering 
data with the expectation of refuting his theories, 
became convinced that many of them were 
sound.” (1951, p. 161) 


The assumption of developmental levels is also 
shared by Werner. In clarifying misconceptions 
of the term, “developmental level" Werner 
and Kaplan state: “the stage of maturity charac- 
terizing an individual or a group must not be 
conceived statically, as a fixed pattern of 
operation; it should rather be understood dy- 
namically, as a potential range of genetically 
graded functions. Mental growth is definable 
in terms of reorganization due to the inclusion 
of new forms of operations. This implies that 
an individual, depending on outer or inner 
circumstances, may operate at genetically dif- 
fering levels” — i.e., depending on the difficulty 
of the tasks (1950, p. 101). Werner includes 
the emotional dimension in his genetic scheme, 
in accord with his holistic point of departure. 
He derives general laws of development from a 
wide range of empirical data in comparative 
biology, psychology, anthropology, and psycho- 
pathology. At the base of Werner's treatise is 
the formal principle that every genetic sequence 
has an intrinsically required order and direc- 
tion. This developmental psychology does not 
deal with actual historic succession but rather 
with the functional conditions for primitive and 
advanced forms of cognition. A statement from 
Goethe’s morphology, cited by Werner, is 
apropos: 

The more perfect the creature becomes, the 
less similar become the parts to one another. On 
the one genetic pole the whole is more or less 
similar to the parts, and on the other the whole 
is dissimilar to the parts. The more nearly equal 
the parts, the less are they subordinated one to 
another. Subordination of the parts indicates the 
perfect creature. (Werner, 1948, p. 40) 


Developmental direction is thus conceptual- 
ized as a movement from an undifferentiated 
stage towards an increasing differentiation of 
parts; it goes hand in hand with a progressive 
hierarchy of function. In this hierarchy the 
foci of dominance shift at different genetic lev- 
els. At one level the dominant center is a 
perceptual-motor affective orientation, e.g., to- 
wards expressive features of things and persons. 
At a later stage these reactions become inte- 
grated with and subordinated to a new center 
of dominance, i.e., cognitive-conceptual control. 


> 
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Development is analyzed into four aspects, 
which reflect lower and higher degrees of dif- 
ferentiation: 


(1) Format aspects: Diffuse-articulate. Lack of 
articulation into distinct parts makes for diffuseness, 
eg. in primitive perception and thought. Another 
example is noise, or a glissando in music. In con- 
trast a melody has articulation because the parts 
constituting the whole are determinable. 

(2) pynamic aspects: Rigid-flexible; labile-stable. 
Rigidity is a lack of variability in response when a 
shift is called for. In children this lack of flexibility 
may be typified by their sensitivity to change, their 
tendency to all-or-none reactions, their dependency 
on definite order, eg., on eating and dressing se- 
quences and ritualistic modes of action. Lability is 
expressed in the fluidity of behavior, e.g., the dis- 
tractibility of young children, their impulsivity and 
the transient character of their emotions. Stability 
as well as flexibility stem from an increased central 
control of the direction of behavior. Both “preserve 
the functional equilibrium in the face of mutable 
situations” in contrast to lability and rigidity, in 
which the responses are more passively governed by 
the momentary valences (1948, p. 55). 

(8) COGNITIVE-AFFECTIVE AsPECTS: — Syncretic-dis- 
crete. Syncretic signifies an indistinctness between 
the experiences of inner states and of objects; a lack 
of separation between self, other, and environment; 
a lack of segregation in personality regions; a lack 
of demarcation of reality levels. Cognitively, all this 
is a merging of otherwise separable experiences into 
a global meaning. So-called "pars pro toto" thinking 
in magic is an instance of syncretism. Syncretic fac- 
tors enter into such child notions as the following: 
the child asks his mother for fish, because eating it 
will make him a good swimmer; having had a dream, 
he believes that it happened in reality; having 
"wished" a wish, he believes he has communicated 
it. On the affective side, syncretism expresses itself in 
reacting to the “physiognomic” expressive features. 
Persons and things are threatening, inviting, sad, 
gay, angry, friendly. On the action side, syncretism 
manifests itself in responding to objects as "things 
of action." They are chewable, graspable, throwable, 
hideable, bounceable. At this stage the various sense 
modalities and the affective motor sphere are inti- 
mately interwoven. Such a “total participation” 
fosters syncretic behavior and cognition. This re- 
cedes with greater individuation and discreteness of 
mental contents and functions. 

(4) IDEATIONAL AsPECTs: Concrete and abstract. 
These thought forms are implied in all phases of 
differentiation, and Werner refers to them in describ- 
ing levels of conception such as space, time, causality, 
etc. In particular, he traces the genesis of abstraction 
from its primitive preverbal forms to the symbolic 
conceptual. For Werner, true generalization is pre- 
ceded by “analogous functions" so that at each gene- 
tic level similar achievements can be attained via 
different processes (1937). Relations are first realized 
through sensori-motor activities, ¢.g., in the transpo- 
sitional use of action-things; then through perceptual 
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groupings when sameness and difference are com 
y apprehended; later through quasiclass con- 
cepts in terms of likeness. The next step is 
representation through schemata which culminate 
in genuine concepts. The same principle is applied 
to the stages of naming: the word expresses the 
physiognomic aspects of the object; the word repre- 
sents the object schematically; and the word is a 
conceptual symbol of a class. The achievement of 
naming objects is mediated through different proc- 
esses on cach of these steps, even though there is the 
"same" effect of naming. This is also demonstrated 
in the realm of number development. What is ac- 
complished concretely on the lower level is attained 
abstractly on the higher one. It is Werner's general 
thesis that such analogous functions of a lower order 
must emerge before a function of the next higher 
order can develop. 


The above resumé of and Werner 
should speak for itself with regard to the co- 
gency of their contributions for cognitive theory. 
An extensive analysis of the c 
and differences in their systems would be beyond 
the scope of this chapter, Two main parallels 
should be stressed, however. The principle of 
differentiation and hierarchization matches the 
principle of decentering. The concept of syn- 
cretism corresponds largely to that of ego- 
centrism, and the subaspects of both cognitive 
modes are complementary in many respects. 
Finally, the entire developmental dimension 
from concrete to abstract is treated in essentially 
similar form. Werner speaks of analogous func- 
tions and Piaget of grouping schemata as the 
functional equivalent of concepts. 


SUMMARY 


We have outlined some of the current think- 
ing on cognition. The major trends were seen 
to converge on the problem of representation. 
Cognition was identified as a centrally mediated 
process of representing external and internal 
events. The functions of phenomenal and of 
silent organization in perceiving, thinking, and 
learning were examined in their significance 
for cognition. An analysis of cognitive processes 
was presented in terms of different levels of 
organization. The implications of the "levels 
concept" were sketched with regard to generali- 
zation, concept formation, and propositional 
language. This was supplemented by a con- 
sideration of pertinent developmental theories 
of cognition. The role of cognitive factors in 
motivation, attitude, and emotion was explored 


and related to relevant interpretations of ego- 
function and of the self concept. 

A special attempt was made to delineate some 
basic tenets of cognitive theory within the con- 
text of contemporary psychological issues. In 
this framework we contrasted two main posi- 
tions: Organizational versus Stimulus-Response 
orientation. We have emphasized the commu- 
nalities rather than the differences among the 
cognitive as well as among the SR theorists. 
This was done in order to clarify the critical 
issues through a problem-centered approach as 
against a descriptive comparison of individual 
systems. Therefore only selected problem areas 
were discussed to illustrate these issues. These 
were the areas of molar-molecular, conation and 
cognition, cognitive development and genetic 
reductionism. 
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Our major intent in this chapter is first to 
present a précis of psychoanalytic theory and 
then to survey some of the applications of this 
theory in social psychology and the other social 
sciences. 

We shall focus our attention upon orthodox, 
Freudian psychoanalysis, and mention only in 
passing the major revisionists. This decision to 
emphasize orthodox psychoanalysis does not 
imply a lack of appreciation for the many im- 
portant contributions of other psychoanalytic 
theorists. It stems initially from limitations of 
space and beyond this from the feeling that the 
revisionist positions can be understood best 
against the background of the theory from 
which they evolved. Further, we are convinced 
that orthodox psychoanalysis, in spite of its 
formal shortcomings, is a far better elaborated 
theory than any of its close relatives, and is 
accompanied by a much greater wealth of per- 
tinent empirical evidence than the revisionist 
positions. 

Since this is primarily a chapter on theory, 
we have not felt obliged to review and sum- 
marize the empirical findings which in recent 
years have been appearing at an ever-accelerat- 
ing rate from psychological laboratories and 
clinics. Nor shall we offer a critique of psycho- 
analysis, whether our own or others’. Finally, 
our account of Freudian personality theory is 
not a historical one. When we feel it necessary 
to discuss historical matters, e.g., Freud's early 
theory of anxiety, we shall do so in small type. 
What we shall attempt to do then is to present 
systematically the major constructs of Freud's 
general theory of personality in their final form 
as evolved by Freud and to discuss the relation- 
ship of these constructs to social psychology. 


* We wish to express our appreciation to George 
S. Klein and Ernst Kris for their valuable advice 
during the preparation of this chapter. 


In addition, we shall discuss the impact of 
psychoanalysis upon the social sciences. 


PSYCHOANALYSIS AND PSYCHOLOGY 


The relation between Freudian theory and 
contemporary psychology bears considerable re- 
semblance to the relation between Darwin's 
theory of evolution and general psychology of 
fifty years ago. Darwin and Freud captured the 
imagination of thoughtful men to a point where 
psychologists could scarcely consider behavior 
uninfluenced by their views. The very act of 
minimizing the contributions of Darwin and 
Freud served to orient the deprecator toward 
the theory. Academic psychologists appeared 
restive under the influence of both of these for- 
mulators. Part of this discomfort may be di- 
rectly attributed to a characteristic professional 
narcissism which makes it difficult to accept the 
contributions of persons who are not members 
of the profession. A further discomfiting factor 
was that both of these theories possessed a sweep 
and grandeur that evoked quick enthusiasm on 
the part of the interested public, while arousing 
dismay and aloofness in the laboratory bred in- 
vestigator, with his concern for specificity and 
control. 

In general, we might say that the reserve, or 
downright hostility, that academic psychologists 
have customarily shown to psychoanalytic the- 
ory has both a rational and an irrational com- 
ponent. The rational component focuses upon 
psychoanalysts’ slight concern for carefully 
controlled empirical evidence, the absence of 
adequate empirical definitions, and the frequent 
statement of relationships in such a manner 
that they are not subject to proof or disproof. 
The irrational component derives from a kind 
of professional jealousy that strongly resents 
contributions made by individuals not trained 
in the discipline of psychology, a persistence in 
common-sense or normative views of behavior 
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which are so strikingly violated by many psycho- 
analytic precepts, and stubborn adherence to a 
naive positivism that considers nothing worth 
note that does not meet relatively high stand- 
ards of rigor and control. 

Although psychology is firmly rooted in the 
laboratory, while psychoanalysis is equally 
firmly rooted in medical practice with a tradi- 
tion of observation in the therapist's chambers, 
they share many common antecedents. Some 
of these shared influences are quite general, 
such as the influence of Darwinism and the 
exposure to vigorous brands of scientific func- 
tionalism. Perhaps more important, and cer- 
tainly more readily specified, is the common 
debt to nineteenth-century experimental physi- 
ology. 

No account of the development of psychology 
is possible without acknowledging the great in- 
fluence of Hermann von Helmholtz (Boring, 
1950). Nor is any account of the development 
of psychoanalytic theory complete without dis- 
cussion of the influence of Ernst Briicke, in 
whose physiological laboratory Freud spent six 
years prior to entering private practice. The 
extent of Briicke’s influence upon Freud has also 
been extensively documented (Bernfeld, 1944; 
Kris, 1950; and Boring, 1950). The relation be- 
tween these two ancestral figures, Helmholtz 
and Briicke, was not merely that of two indi- 
viduals working together in the same discipline 
in the same era. They were close friends and 
had studied together under Johannes Miiller at 
the University of Berlin. They shared a com- 
mon animosity toward vitalism and an assurance 
that all human behavior could be accounted for 
in terms of physical and chemical forces. An- 
other common antecedent was Fechner, who 
not only contributed in important ways to the 
development of psychology but also was an 
important influence upon psychoanalysis, as 
Freud’s many references to Fechner testify. 

We may conclude that the “new psychology” 
of the late nineteenth century and the first stir- 
rings of psychoanalytic theory appeared, if not 
under the same auspices, at least heavily in- 
fluenced by similar lines of thought. Thus, it 
seems fair to view the present rapprochement 
between academic psychology and psychoanaly- 
sis as representing the belated union of two 
intellectual traditions that have developed very 
differently but nevertheless began under im- 
portantly similar circumstances. 

Yet, as we have said, one should not assume 
that these mutual sources of influence produced 
an early compatibility between Freud and the 
psychologists of his era. For Wundt, Titchener, 
Kiilpe, Cattell, and most other figures of this 
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period, Freud’s work was regarded as falling 
outside the province of psychology. Only icono- 
clasts such as G. Stanley Hall, James, and Me 
Dougall were able to muster any early interest 
in psychoanalysis and to recognize it as psycho- 
logically relevant. Even among these psycholo- 
gists the interest was somewhat tentative and 
skeptical, as the following statement by Mc 
Dougall suggests: 


In my opinion Freud has, quite unquestionably, 
done more for the advancement of our under- 
standing of human nature than any other man 
since Aristotle . . . This judgment implies, of 
course, that there is much that is of value and 
relatively true in Freud's teaching. And yet I 
hold that every bit of such truth is mixed almost 
inextricably with error, embedded in masses of 
obscure implications and highly questionable and 
misleading propositions. (McDougall, 1936, pp. 
17-18) 


During the formative years of Freud and psycho- 
analysis, academic psychology had practically 
nothing to say of pertinence to the events that 
Freud was initially concerned with, i.e., general 
adaptive functioning of the human organism 
and the problem of mental disorder. Much of 
the psychology of this era consisted of specu- 
lative theorizing by individuals who were as 
much or more philosophers than psychologists. 
Such laboratory investigations as there were 
concerned themselves with the content of the 
normal, conscious, human mind. Not only did 
the focus upon the normal and conscious elimi- 
nate Freud and his interest in the pathological 
and unconscious but the methods of study em- 
ployed by the laboratory investigators were 
equally far removed from the analysts tech- 
niques for investigation. While the psychologist 
attempted to study specified aspects of behavior 
under controlled circumstances, where the in- 
fluence of motivational and social factors could 
be minimized, the analyst was concerned with a 
very wide range of behavior (or its verbal 
representation) and tried to establish a set- 
ting in which personal-motivational determi- 
nants would operate at their maximum. 

In spite of Freud’s concern with mental dis- 
order and the treatment process it would be a 
serious mistake to consider that psychoanalysis 
claimed to be merely an abnormal psychology. 
Freud made the contrary assumption: 


Psychoanalysis falls under the head of psychol- 
ogy; not of medical psychology in the old sense, 
nor of the psychology of morbid processes, but 
simply of psychology. (Freud, 1927b, p. 207) 


This conviction is borne out in Freud's writ — 
ings, where we find him perennially wrestling 
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with the problems of normal psychological 
functioning. The last chapter of the /nterpreta- 
tion of Dreams, lor example, is "pure" psychol- 
ogy in the sense that it is altogether concerned 
with perception, memory, motivation and 
thought. 

With respect to social psychology, Freud did 
not see a sharp distinction. between it and 
the psychology of the individual. Individual 
psychology, he felt, rarely could disregard the 
relations of the one to the many; "so from the 
very first, Individual Psychology is at the same 
time Social Psychology as well" (1921, p. 2). 
Consistently he felt that social psychological 
phenomena could be accounted for in terms of 
the same principles and constructs that had 
proven useful in representing the individual. 

It is not easy to trace the path that led to 
eventual interpenetration of psychology and 
psychoanalysis. There was a mutual awareness 
of each other's existence quite early. Freud read 
widely in psychological journals and his inter- 
national fame grew so rapidly during the first 
two decades of this century that most psycholo- 
gists knew something of his theories. He was 
honored along with a few of his colleagues at the 
Clark Vigentennium in 1910 but this event was 
not followed by any marked increase in his 
academic respectability. With only rare excep- 
tions, universities shut their doors to psycho- 
analysis until the thirties. The change that took 
place during the twenty years from 1925 until 
1945 is well illustrated by examining Taylor's 
(1926) Readings in Abnormal Psychology and 
Mental Hygiene and Tomkin's (1944) Contem- 
porary Psychopathology. These two volumes 
served rather similar functions in their respec- 
tive eras and yet we find that the Taylor volume 
is virtually devoid of references to psychoanaly- 
sis, excepting those that are derisive in character, 
while the Tomkins volume consists predomi- 
nantly of papers that bear the imprint of 
psychoanalysis. 

One of the factors leading to increased com- 
patibility between these traditions was the ex- 
tent to which representatives of each began to 
deal with overlapping empirical problems. 
While psychoanalysts were to some extent con- 
cerning themselves with nonpathological events, 
the academic psychologist was moving out of 
the laboratory and tackling problems of sociali- 
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ENERGY 


The concept of energy is no less difficult to 
explicate in psychology than it is in the other 
sciences. It is doubly hard to get hold of Freud's 


zation, adjustment, and personality, A certain 
amount of mutuality in empirical interest 
stemmed directly from World War II, where in 
some instances there was forced association be- 
tween professional psychologists and psycho- 
analysts, A further outcome of the war was the 
development of a vigorous subdiscipline of clini- 
cal psychology which from the beginning 
showed a strong affinity for the concepts and 
some of the procedures of psychoanalysis. The 
growing importance of clinical psychology and 
its acceptance within most major universities 
led to a great increase in the communication 
between psychologists and psychoanalysts. 

Independent of psychoanalysis there ap- 
peared in psychology a growing interest in 
motivation and motivational concepts (Me 
Dougall, Woodworth, Tolman, Allport) and 
this trend led to a natural reappraisal of the 
value of Freud’s psychology of motivation. An 
important integrative role was played by a few 
individuals such as Kurt Lewin and Henry A. 
Murray, who from within the confines of aca- 
demic institutions made earnest efforts to con- 
duct controlled empirical research that was 
related to, and in part inspired by, psychoanaly- 
sis. From these efforts, psychoanalysis received 
a growing lustre of investigative respectability 
and this in turn led other, more conventional 
psychologists to conduct related research. Dur- 
ing the late thirties the efforts that were made 
to bring about a rapprochement between Hull's 
reinforcement theory and aspects of psycho- 
analysis (Dollard, Miller, Mowrer, Sears) 
brought many experimentalists into at least 
temporary contact with Freud: conceptions of 
behavior. Finally, the period of the thirties was 
a time of unrest. Radical ideas were welcomed, 
social change was a matter of interest and ap- 
preciation, axioms were being challenged. In 
this setting the controversial nature of Freudian 
theory was more closely attuned to the spirit of 
the times than before. 

For all these reasons, psychoanalysis has be- 
come an area of great interest for psychologists. 
Today it seems evident that Freud’s position 
as a major figure in the development of psychol- 
ogy is secure. The benefits of this influence may 
still be debated, but its existence is no longer 
open to question. 
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notion of psychical energy because in his early 

theories he adopted a position from which he 

appears to have receded in subsequent reformu- 
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lations. The concept of libido, its definition as 
sexual energy, and its protean character in men- 


tal life are well known to every student of ` 


psychoanalysis. Yet the assumption that Freud 
emphasized the sexual impulse to the relative 
exclusion of other impulses, which is justified 
by Freude pre-1920 writings, does not hold for 
his final theoretical scheme. Freud did not help 
to clarity the changed state of affairs because 
he tended to keep old concepts while developing 
new ones. Thus when he formulated the polar 
instincts, life versus death, he was reluctant to 
part with a single form of energy, libido, which 
had worked so well for the polarity of the sexual 
instincts vs. the ego instincts. One of the reasons 
Freud abandoned the latter polarity was that 
inasmuch as libido governed both groups of in- 
stincts, they were not really opponents but 
partners in the enterprise of life. As a result 
of grouping the sexual and ego instincts under 
the life instincts, libido seemed to lose its close 
tie with the purely sexual instincts, to become 
desexualized, and to function as a general 
psychical energy which could rum all of the 
psychological processes, and not just those that 
are in the service of reproduction. In any event, 
libido is identified by Freud as the motive force 
of eros or the life instincts. 

What then is the energy of the death instincts? 
With this question Freud wrestles again and 
again. In one such encounter, he tries to pre- 
serve a monistic libido theory by deriving the 
energy of the destructive instinct from libido. 
He hypothesizes that there is a displaceable en- 
ergy which can be used indifferently for attain- 
ing erotic or destructive aims, but that this 
energy is itself desexualized libido (Freud, 
19232). However, neither this solution nor any 
other one satisfied Freud, because in his fi- 
nal summing up he states "there is no term 
analogous to ‘libido’ for describing the energy 
of the destructive instinct” (Freud, 1940, p. 22) . 
This statement suggests that Freud came to 
believe that there are two forms of energy, onc 
which governs the life instincts and all the de- 
rivatives therefrom and one which governs the 
death instincts and all the derivatives therefrom. 
It is not clear from Freud's writings whether 
these two forms are different or the same in 
quality. He does say that "nearly all the energy 
with which the mental apparatus is charged 
comes from the inborn instincts" (Freud, 1920, 

. 5), but this statement does not help us to 
decide whether he believed in one kind of en 
ergy or as many kinds as there are instinctual 
sources. In one of his writings Freud does imply 
a belief in a general instinctive energy because 
he criticizes Jung for making libido synonymous 


with psychic energy in general (Freud 1905, p. 
6). In this connection, Freud says that the 
differentiation of the libido from the other 
forms of instinctive energy represents one of the 
great achievements of psychoanalysis. These 
statements suggest that Freud did postulate a 
monistic psychical energy. But we cannot be 
sure. We turn now to a short account of libido 
theory, which will be elaborated in later sec- 
tions of this chapter. 


LIBIDO 


Freud’s clearest statement of what he meant 
by libido is found in Three Contributions to the 
Theory of Sex (Freud, 1905). In this mono- 
graph he defines libido "as a force of variable 
quantity by which processes and transformations 
in the spheres of sexual excitement can be meas- 
ured” (p. 611) and distinguishes it from general 
psychic energy by virtue of its origin in the 
erogenous zones of the body. Since any part 
or organ of the body can be an erogenous zone, 
i.e., seat of sexual excitement, the sources of 
libido are multifarious. The erogenous zones, 
Freud hypothesizes, are sensitized by chemical 
substances (hormones) secreted by the gonads. 

Libido either remains stored up in the ego 
or it is invested in sexual objects. In its stored- 
up state it is called ego libido, and in its in- 
vested state it is called object libido. When the 
investment is withdrawn, the libido may take 
the ego as an object and it is then known. as 
narcissistic libido, There is a reciprocity be- 
tween ego libido and object libido, so that the 
more there is of one the less there is of the other. 


CATHEXIS 


In one of his writings, Freud characterizes 
psychoanalysis as “a dynamic conception which 
reduces mental life to the interplay of recipro- 
cally urging and checking forces” (Freud, 1910b, 
p. 107). These forces are cathexes and anti- 
cathexes. Because cathexis refers to the way in 
which energy is distributed throughout the per- 
sonality it is a central concept in psychoanalytic 
theory. Yet its meaning is not always clearly 
understood. 

This much is plain, cathexis refers to an in- 
vestment or charge of energy. But what is it 
that becomes invested or charged with energy? 
Freud uses the term in such a way as to lead the 
reader to believe that either external objects or 
the ego itself can become invested with energy. 
If it is the former it is called an object cathexis, 
if it is the latter, an ego cathexis or narcissism. 
Narcissism results from the process of identifi- 
cation (incorporation) by which the ego assimi- 
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lates the object and secures for itself thereby the 
cathexis which formerly resided in the object. 
` Actually this is a very loose and misleading 
description of the cathexis process, and should 
not be taken in any literal sense. One does not 
invest energy in an object nor does one normally 
incorporate an object, except for food, liquids, 
and the ingredients of air. Rather, the invest- 
ments are made in psychological processes and 
their products, When Freud speaks of cathexes 
and anticathexes he is referring to the distribu- 
tion of energy throughout the whole psychic 
apparatus. That is to say, he views the person 
as a complex energy field and he seeks to under- 
stand the person in terms of the way his energy 
is distributed and for what purposes and with 
what consequences. In the most general sense, 
psychic energy may be said to be used for the 
purposes of living and dying, but this tells us 
no more than that a person lives and dies. 

Let us begin at the beginning. At first there 
is only a primitive psychic apparatus which is 
operated by a small quantity of mobile energy 
generated by the instincts. This energy dissi- 
pates itself in smooth muscle and striped muscle 
activity before it has a chance to accumulate. 
At this stage, one cannot speak of cathexis, since 
the energy is not invested in anything. It is 
spent as soon as it is earned. Environmental 
circumstances in conjunction with maturational 
developments force this reflex-like apparatus to 
undergo modifications in its structure, with the 
result that energy is used to operate such psy- 
chological processes as perception, memory, and 
thought. At this stage one can speak of the 
formation of cathexes because energy is invested 
in psychological processes and is not spent 
immediately in purely muscular and visceral 
activities, Energy flows into the perceptual, 
mnemic, and ideational systems and the con- 
tents of these systems, percepts, memories, and 
ideas, respectively, become charged with en- 
ergy. A high charge of energy is called a hyper- 
cathexis. As long as this charge remains on a 
content of a psychological system it is said to be 
bound. When the gateway into the motoric is 
opened, the energy becomes mobile and dis- 
charges itself through activity (Freud, 1900) . 

From this account, it can be seen how mis- 
leading it is to speak of an object cathexis. 
Properly speaking, one can cathect a perception, 
or a memory, or a conception of an object but 
not the object itself. Similarly, one can cathect 
a perception or a memory or a conception of 
one's self but not the ego itself. Therefore, 
when Freud speaks of object cathexis and ego 
cathexis or object choice and narcissism he 
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means the activation of a perception, memory, 
or thought whose reference is an external object 
and the activation of a perception, memory or 
thought whose reference is the self. Identifica- 
tion is simply a transformation of a cathexis for 
the perception of an object into a cathexis for 
a perception of the self. Incorporation, by the 
same token, represents a shift in cathexis from 
a psychological content which has an outer ref- 
erent to a psychological content which has an 
inner referent. The objective is transformed 
into the subjective. 

An anticathexis is a checking force in mental 
life. Energy is used for the purpose of resisting 
or inhibiting the discharge of energy through 
the motoric system. Since the aim of the primi- 
tive personality is to expel energy as soon as it 
is generated, any postponement of discharge 
requires an investment of energy in blocking or 
delaying processes. An example of a blocking 
process is repression, of a delaying process is 
thinking. 

Freud distinguishes between two kinds of 
frustration, one external, the other internal 
(Freud, 1915a). In external frustration the goal 
object is not available for reasons beyond the 
person’s control. This type of frustration does 
not involve anticathexis and might better be 
called privation or deprivation. Internal frus- 
tration proceeds from within the personality 
structure and consists of inner inhibiting forces, 
ie. anticathexes. The attempts of the per- 
sonality to overcome or circumvent these in- 
ternal frustrations, constitute the major task 
of psychological functioning. Freud observes 
that internal frustration does not come into ex- 
istence until external frustration has prepared 
the ground for it, by which he means that the ` 
person has to experience external privations 
and deprivations before he can develop inner 
controls. This is the basis for the formation of 
the ego and superego. 

Frustration, whether of the internal or ex- 
ternal variety, places a strain upon the per- 
sonality because the damming up of energy 
causes an increase in tension. The frustrated 
person can retain his mental stability by taking 
direct action on the world and wresting satisfac- 
tion from it, or he can use the energy for ends 
which do not have anticathexes attached to 
them (Freud, 1912) . 

From this brief account of cathexis and anti- 
cathexis we can see that these concepts are the 
very essence of Freud's dynamic personality 
theory, and that the measurement of these forces 
constitutes the chief task of the psychoanalytic 
investigation. 
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Tue SUBDIVISIONS OF PERSONALITY 


In Freud's final formulation of psychoanalytic 
theory (Freud, 1940) , the primary provinces of 
personality are identified as the id, the ego, and 
the superego. The original distinction between 
conscious and unconscious which were the theo- 
retical pillars of psychoanalysis prior to the 
formulation of the theory of the id, ego, and 
superego by Freud, first made public at the 
Berlin Psychoanalytic Congress in 1922, no 
longer enjoy this distinction. In fact, Freud 
relegates consciousness and the two kinds of 
unconscious, preconscious and unconscious 
proper, to the status of qualities. Thus, it is 
necessary in reading Freud’s publications prior 
to 1922 to substitute “id” for “unconscious” and 
"ego" for "preconscious" and "conscious" in 
many places. In the present account of psycho- 
analytic theory we have made these substitutions 
whenever it seemed appropriate to do so. Al 
though the contents and processes of the id are 
exclusively unconscious, one is not warranted 
in treating id and unconscious as equivalents, in 
spite of the widespread convention of doing so. 
The contents and processes of the id have the 
quality of being unconscious but the id is not 
the unconscious, except in a historical sense. 
By identifying that agency of the mental ap- 
paratus which Freud once called the uncon- 
scious as the id, we believe that much of the 
confusion which occurs when one reads pre-1922 
Freud and. post-1922 Freud should disappear. 
Moreover, although the entire id has the quality 
of being unconscious, much of the ego and a 
greater part of the superego have the same 
quality. Thus it is to compound an error to 
use “unconscious” as a synonym for the id. 

Although Freud finally demoted the concept 
of the unconscious to a mere quality of mental 
life (Freud, 1933) , the importance of the con- 
cept in the history of psychoanalytic theory, 
quite apart from its later identification as the 
id, should not be minimized. Freud felt that in 
order to establish psychology as a natural sci- 
ence, he had to destroy the popular notion held 
by psychologists and philosophers of the nine- 
teenth century that mental life consisted solely 
of that which is conscious. A psychology of 
consciousness limited itself, so Freud thought, 
to a very thin slice of mental life. Consciousness 
is only one quality or attribute of that which 
is mental, and a highly unstable one at that: 
Although later Freud was to characterize un- 
consciousness and preconsciousness in the same 
terms, except that they were more stable quali- 
ties than that of consciousness, by focusing upon 
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the unconscious as the larger part of what is 
mental he aspired to establish psychology as a 
complete science of the mind (Freud, 1938) . 

Since Freud does not equate mental with con- 
sciousness, his use of the term "mental" is 
unorthodox. The mental consists of somatic 
processes, which are largely unconscious, and 
they are similar in kind to all the other natural 
processes of which science has obtained knowl- 
edge (Freud, 1940). When Freud writes that 
“our scientific work in psychology will consist 
in translating unconscious processes into con- 
scious ones, and thus filling the gaps in conscious 
perceptions” (Freud, 1938), he means that the 
goal of psychology is to discover the essential 
nature of unconscious processes and their con- 
tents and to make this knowledge available 
to people so that they can become aware of that 
which they cannot experience directly. It is 
hardly necessary, one might add, to have a sci- 
ence of direct experience, since the goal of any 
science, whether it be physics or psychology, is 
to make known that which cannot be known 
directly. 


THE ID 


The id is the obscure, inaccessible, and un- 
organized part of personality. What little is 
known about it has been learned from study of 
the dream work and from investigations of the 
formation of neurotic symptoms. It has no 
direct relations with the external world but is in 
direct contact with somatic processes. Freud 
calls the id the "true psychic reality" (Freud, 
1900) , because it represents only the inner world 
and is not contaminated by objective (external) 
reality. 

Origin of the id. The id is the oldest of the 
mental systems. It consists of everything that is 
inherited, that is fixed in the constitution (in- 
cluding the instincts) , and that is present at 
birth. Experiences that are repeated often and 
intensely in individuals of successive generations 
are deposited in the id and become a part of 
the inheritance of succeeding generations. This 
is what Freud means when he says that the 
vestiges of countless egos are stored in the id 
(Freud, 1923a). In other words, Freud sub- 
scribes to the doctrine of acquired characters 
as the basis for the formation of the id. 
Throughout Freud's writings one finds reter- 
ences to the signal role of phylogeny in laying 
the foundation of the human mind. 

Contents of the id. “Instinctual cathexes 
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seeking discharge — that in our view is all that 
the id contains" (Freud, 1933, p. 105). This 
terse statement requires amplification, For 
Freud, an instinct is a distinctly psychological 
concept and not a biological one. It is defined 
as the mental (ideational) representation of a 
somatic (visceral) process (Freud, 1905) . These 
visceral processes are called "instinctual im- 
pulses,” and it is a looseness of terminology, 
Freud believes, to confuse instinctual impulses 
with the instinctual ideas which arise from them 
(Freud, 1915a). The id contains ideas of im- 
mediate gratification, ie, wishes. All of the 
energy of the id is invested in wishes, so that 
the id can only do one thing, namely, to wish. 
Thus wishes can be substituted for instinctual 
cathexes seeking discharge and the statement 
would then read "the id contains only wishes." 
It is essential to bear in mind the distinction 
between impulse and wish. Impulse is the so- 
matic counterpart of a wish and pari passu wish 
is the psychological counterpart of an impulse. 

The id knows no reality other than a subjec- 
tive, autistic reality. It is without values, ethics, 
and morality. The contents of the id are im- 
mortal, for they do not alter with the passage 
of time. Nothing in the id is past or forgotten. 
The laws of logic and reason do not hold for 
the id, so that contradictory wishes can exist 
side by side without negation of one by the 
other. The energy of the id is in a more fluid 
state than is the energy in the other regions of 
the personality (Freud, 1933). Freud’s meta- 
phor for the id is “a cauldron of seething excite- 
ment.” 

Functions of the id. The sole purpose of the 
id is to provide for the free and untrammeled 
discharge of quantities of excitation which arise 
from somatic sources. In fulfilling this sovereign 
purpose the id is said to obey the pleasure prin- 
ciple and to make use of the primary process. 

The pleasure principle is a tendency whose 
function is to render the psychic apparatus as 
a whole free from any excitation (this is called 
a state of Nirvana), or in lieu of this ultimate 
achievement to keep the amount of excitation 
constant and as low as possible. The aim of 
the pleasure principle differs in no respect from 
the universal tendency of all living matter, 
which is to strive to return to the quiescence of 
the inorganic world (the death instinct). Al- 
though the pleasure principle, through its agent, 
the primary process, tries to counteract excita- 
tions whose sources lie in the external world 
by discharging them as fast as they arise, its 
main task is to deal with the inner increases in 
tension by keeping them at a minimum through 
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providing a direct path from excitation to dis- 
charge (Freud, 1920). 4 

While the id knows no other principle of 
functioning than that of pleasure, the other 
agencies of personality, viz, the ego and super- 
ego, also bend to the will of the pleasure prin- 
ciple. The ego and superego may modify the 
pleasure principle in various ways but they 
cannot nullify it. Freud considered it to be a 
question of the greatest theoretical importance 
how it is ever possible for the pleasure principle 
to be overcome. “The consideration that the 
pleasure principle requires a reduction, or per- 
haps ultimately the extinction, of the tension of 
the instinctual needs leads to problems that are 
still unexamined in the relations between the 
pleasure principle and the two primal forces, 
Eros and the death instinct" (Freud, 1940, p. 
109) . 

The psychic process employed by the id is 
called the primary process (Freud, 1900) . This 
process consists of discharging tension as soon 
as any tension reaches the id. The first and 
most primitive operation of the primary process 
is to expend energy immediately in motor activ- 
ity. Examples of this are the emptying of the 
bladder's contents when the pressure on the 
sphincter reaches a certain level, or the mass 
activity of a baby when it has colic. 

For several reasons, this reflex action phase of 
the primary process does not serve effectively 
the dictates of the pleasure principle. In the 
first place, many of the somatic sources of ten- 
sion do not have built-in tension-reducing mech- 
anisms, as do urination and defecation. Most 
of them are like hunger, the tension of which 
can be reduced only by a specific kind of com- 
merce (eating) with a specific kind of object 
(food). Moreover, a reflexly discharging organ- 
ism is bound to encounter many rebuffs from 
the external world, so that instead of finding 
the pleasure which it seeks it finds pain, ie., 
increased tension. 

Consequently, the primary process develops 
another method for discharging tension. This 
method consists of producing a hallucination of 
the desired object. Thus the hungry individual 
attempts to rid himself of tension by cathecting 
a hallucinatory perception of food. 

This function of the id presupposes that the 
organism has had previous experiences with 
particular goal objects for satisfying particular 
needs and that it is capable of perceiving and 
remembering. It must be able to perceive food 
and to remember that it has perceived food, so 
that when it becomes hungry again the tension 
of hunger can activate the memory of food and 


e 


150 


reinstate the associated perception (hallucina- 
tion) of food. This is what Freud means when 
he says the primary process strives for discharge 
of excitation by establishing an identity of per- 
ception. Identity of perception refers to the 
equivalence of a centrally aroused image of food 
with a perception of food aroused by an actual 
food stimulus. 

The process which begins from a state of 
tension (need) and proceeds to the cathexis 
of perception by way of memory is what Freud 
calls wish-fulfillment. A wish may be defined 
as a cathexis for the perception of an object that 
will satisfy the wish, i.e., discharge the tension. 
Accordingly, the sole psychological function 
with which the id is endowed is that of generat- 
ing wishes. 

Wishing is said to be regressive in character, 
since it reverses the usual course of the excita- 
tion from perception to memory. Instead, the 
excitation flows backward from mnemic system 
into the perceptual system. The resulting hal- 
lucination proves, in the long run, to be no 
more satisfying than was the earlier reflex dis- 
charge of tension through the motoric. 
Consequently, the primary process is super- 
seded by another one which Freud calls the 
secondary process. The development of the 
secondary process goes hand in hand with the 
formation of a new structure in the personality, 
the ego. The secondary process is the process 
by which the ego accomplishes its main objec- 
tive, to wit, to be ruled by the reality principle 
instead of the pleasure principle. We shall take 
up the discussion of this process in the next 
section. 


THE EGO 


The ego is the organized subdivision of per- 
sonality. It is the executive, embracing all of 
those functions which are thought of as being 
distinctly psychological, viz., perception, learn- 
ing, memory, and reasoning. It occupies a cen- 
tra] position in the personality, standing 
between the id and the external world, the 
superego and the external world, and the id and 
the superego. Like the administrative head of 
a democratic political system, it both governs 
the other agencies of personality and is governed 
by them. Moreover, it has the responsibility of 
conducting all transactions with the outer 
world. 

Origin of the ego. Under the influence of the 
external world acting through the perceptual 
system one part of the id undergoes a special 
development in order to act as an intermediary 
between the id and the external world (Freud, 
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1940). This is the ego. Heredity influences 
the ego by way of the id. It may be said that 
even before the ego comes into existence its 
subsequent lines of development, its predisposi- 
tions, and its reactions are already determined 
by heredity (Freud, 1937a) . Thus, the ego is 
not the product of experience alone but of 
experience acting upon a prepared organism. 

The boundary between the ego and the id 
consists of resistances or anticathexes. In a 
healthy person, the amount of energy which goes 
into maintaining the ego-id barrier is kept to a 
minimum, while in a neurotic person a great 
deal of energy is utilized for this purpose. 

Functions of the ego. The primary function 
of the ego is to protect the life of the individual 
against the dangers which arise in the external 
world. To fulfill this objective, a number of 
differentiated psychological processes and sys- 
tems are developed in the ego. It becomes aware 
of the external world by means of stimuli acting 
upon the perceptual system. It stores up €x- 
periences of stimuli in the mnemic system. It 
learns either to adapt to the external world or 
to bring about changes in the external world 
through activity. Freud believed that mastery 
of the environment constitutes the highest 
achievement of the ego (Freud, 19262) . Think- 
ing and judging, along with perception and 
memory, serve to inhibit the immediate dis- 
charge through motor pathways of excitations 
reaching the ego. The ego, therefore, is said to 
control the path of access to motility. 

The ego's aim of self-preservation requires 
that it gain control over the demands of the 
instincts in the id, since the immediate gratifi- 
cation of the instincts may endanger the life of 
the person. The ego decides whether an instinct 
shall be allowed to obtain satisfaction. If the 
decision is in the affirmative, the time and cir- 
cumstances for its satisfaction are determined 
by the ego. If the decision is in the negative, the 
ego has the task of suppressing the instinctual 
excitations or of diverting them into other chan- 
nels. 

The ego binds the mobile energy of the id by 
transforming it into a quiescent, tonic charge 
(Freud, 1920). This is accomplished by the 
development of a secondary process which takes 
the place of the primary process of the id. The 
secondary process arranges mental activities in 
sequential order and tests their correspondence 
with reality. This is called reality testing 


` (Freud, 1923a) . The secondary process may even 


cathect that which induces pain if by so doing 
it is in accordance with the reality principle by 
which the ego is governed. After the ego has 
completed its work of thought, it removes the ` 
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inhibition and permits the release of energy 
through motility. 

In the last analysis, the ego in a normally 
well-adjusted person is the managing director 
of the personality, It synthesizes the forces act- 
ing upon it from the id, from the superego, and 
from the external world into a harmonious 
unity under the wise and benevolent jurisdic- 
tion of the reality principle. 

Reality principle. As a consequence of the 
failure of the primary process to provide grati- 
fication, there is formed out of the id a new 
mental agency, the ego, which is governed by a 
new principle, the reality principle (Freud, 
1911). The aim of the reality principle is to 
inhibit the discharge of excitation until the ap- 
propriate object in or condition of the environ- 
ment for the satisfaction of the need is 
discovered. The substitution of the reality prin- 
ciple for the pleasure principle does not mean 
that pleasure is forsaken. The reality principle 
serves the pleasure principle by postponing an 
immediate gratification, which is likely to bring 
pain in its wake, for a later gratification in 
which the risk of suffering pain is considerably 
reduced. All of the complicated mental activity 
which is enjoined by the reality principle repre- 
sents a roundabout way to the ultimate goal of 
pleasurable gratification. 

The inauguration of the reality principle calls 
for a succession of adaptations in the mental 
machinery, which are required by the im- 
portant role that the external world plays in 
the satisfaction of needs. The perceptual system 
develops more precise and accurate ways of 
reflecting external reality. A special faculty, 
attention, is instituted whose function it is to 
search the outer world for items of knowledge 
which can be used in the interest of need satis- 
faction. There is introduced a system of nota- 
tion, language, whose task is to preserve the 
results of attending and perceiving in the mne- 
mic system. Judgments are made according to 
the criteria of truth and falsity instead of those 
of pleasure and pain. Motor discharge is re- 
strained by means of the process of thought, 
which is an experimental way of acting that 
involves smaller quantities of cathexis and less 
expenditure of them. There is even provided 
for within the ego a mode of thought activity 
free from the demands of reality testing which 
consists of creating fantasies and daydreams. 
This is a self-conscious and rather more con- 
trolled use of the primary process, which the 
ego recognizes for what it is, namely, a pleasur- 
able pastime, and does not confuse with reality. 
Finally, there is a change in the motoric system 
so that it becomes more capable of manipulating 
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and modifying reality through skillful and effi- 


cient action. 

Secondary process and reality testing. The 
primary process of the id, it will be recalled, con- 
sists of discharging energy according to the 
model of a reflex apparatus, Input and output 
of excitation are immediate reflex actions. If 
the output cannot be effected because the object 
which will end the inner tension is absent, then 
the primary process hallucinates the desired 
object. When this operation proves to be un- 
gratifying an ego process is developed. The 
excitation is diverted by this process into the 
activities of thinking and judging, which seek 
to find the desired object in the real world. 
The process of reasoning out a way of discover- 
ing the appropriate goal object is called the 
secondary process (Freud, 1900). When one 
thinks he has found a satisfactory solution to 
a problem, the verification of the solution by 
trying it out is called reality testing. The secon- 
dary process is mental problem solving or 
experimental action, as Freud labels it, whereas 
reality testing is the behavioral means for veri- 
fying the truth of a thought or perception, i.e., 
its existence in reality. 


Perception and memory. All psychic activities 
proceed from stimuli acting upon sense organs 
and terminate in innervations of motor organs. 
That is, the psychic apparatus has a sensory end 
and a motor end. At the sensory end lies the 
perceptual system which receives excitations 
from the external and internal sense organs. 
The function of the perceptual system is to dis- 
tinguish among sensory qualities. The sensory 
qualities of perceptions which originate in the 
external world are many and varied but those 
which come from the viscera are restricted to 
the pleasure-pain series, ie., feelings (Freud, 
1900) . 


Perception is not merely a passive process. 
Small amounts of energy are sent out periodi- 
cally from the reservoir in the ego into’ the 
perceptual system and sample the stimuli acting 
upon the sense organs (Freud, 1925a). The 
functioning of the perceptual system requires 
energy, as does every system of the personality. 
When the energy is withdrawn the perceptual 
system ceases to function. 


The relation of perception to consciousness and 
the question as to whether there are unconscious 
perceptions are not clearly formulated by Freud. In 
some of his writings he seems to equate perception 
with consciousness, so that everything which is per- 
ceived must have the quality of consciousness, while 
in other writings Freud implies that there are un- 
conscious perceptions. If one accepts the hypothesis 
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that the quality of consciousness is produced by a 
hypercathexis of psychic material, then it is quite 
conceivable that some perceptions might have in- 
sufficient cathexis to raise them to consciousness yet 
have enough to permit their existence unconsciously. 
Faced with the necessity of making a decision be- 
tween these two alternatives, we have decided in 
favor of the view which ascribes to all perceptions 
the quality of consciousness. Our decision rests upon 
the following statement made by Freud, ". . . as 
soon as the cathexis is withdrawn [from the per- 
ceptual system], consciousness is extinguished and 
the functioning of the perceptual system comes to a 
standstill” (Freud, 1925b, p. 180). 


One may perceive, ie, be conscious of, 
thoughts as well as sensory excitations. This 
is accomplished by the association of ideas via 
the function of speech with memory traces of 
the perceptions of written or spoken words. 
Since ideas can become conscious, it is necessary 
to have a way of distinguishing between a per- 
ception whose source is a thought and a per- 
ception whose source is a sensory stimulation. 
The former is what Freud terms subjective real- 
ity, the latter objective or material reality. In 
order not to confound the two, the ego resorts 
to reality testing. This process consists of per- 
forming some action upon the external world. 
If the perception is made to disappear by such 
action, then the origin of the perception is 
known to lie in the external world. If such 
activity makes no difference to the perception, 
then the perception originates within the person 
and it does not have objective reality (Freud, 
1916). 

The perceptual system retains nothing; it has 
no memory. Accordingly, another system, the 
mnemic system, which lies next to the percep- 
tual system in the sequence of psychic activities 
{rom sensory input to motor output, is postu- 
lated. This system consists of memory traces 
which are the permanent deposits of percep- 
tions. 

Memory traces may be associated with one 
another, Freud conceives of association as a 
lessening of resistance and a smoothing of the 
way from one trace or mem-element to another 
trace or mem-element. In terms of cathexis, 
association consists of an unobstructed flow of 
energy [rom one trace to another trace. Associa- 
tions are formed when the percepts which left 
the traces occur together (law of contiguity) or 
when there is a similarity between the percepts 
(law of resemblance) or when other laws of asso- 
ciation operate on the percepts. Freud thought 
that there might be many mnemic systems or 
mem systems, as he dubbed them, each of which 
would be founded upon a particular law of 
association. (Freud, 1900). 


Memories that are painful to the person ac- 
quire a charge of anticathexis which renders 
their entry into consciousness difficult or impos- 
sible. They are said to be repressed. A memory 
which is not painful in itself may be repressed 
because it is associated with another memory 
which is painful. Thus we forget neutral or 
innocuous experiences because they are tied 
by cathexis to disagreeable experiences. This 
is the essential feature of Freud's theory of for- 
getting (Freud, 1904) . 

In a brief article published in 1925 Freud 
draws a parallel between the operation of a 
child's toy, the "mystic writing pad," and the 
functioning of perceptual-mnemetic systems 
(Freud, 1925b). This toy consists of an outer 
celluloid cover on which the child writes with 
a stylus, a thin sheet of waxed paper on which 
the impression made by the stylus is recorded, 
and a slab of wax or resin which is soft enough 
to permit indentions to be made in it, so that 
an impresion may be made on the covering 
waxed paper. When the child wishes to erase 
what he has written he merely lifts the outer 
cover and the thin undersheet, and the writing 
disappears. Consequently, the pad can be used 
over and over again. 

The parallel between this toy and the per- 
ceptual and mnemetic systems is as follows. The 
outer cover represents the perceptual system's 
protective screen against stimuli, the waxed 
paper is like the perceptual system, which re- 
ceives perceptions but does not retain them, 
and the under thickness of wax is analogous 
to the mnemetic system, which preserves the 
permanent traces of the perceptions. By imagin- 
ing the toy to be written on and erased in rapid 
alternation, one has a picture of how perception 
operates. 

Freud believed that memory traces could be 
inherited, so that the mind preserves not only 
what happens to it in the course of an indi- 
vidual's life but also what happened to the 
ancestors of the individual. Freud felt that by 
the analysis of dreams one might obtain knowl- 
edge of the archaic memory of man (Freud, 
1900) . 

Thought and judgment. The major objec- 
tive of the ego is to inhibit the outflow of 
excitations in the interest of the reality prin- 
ciple. The postponement of discharge is ac 
complished in large measure by the ego's 
capacity for thought. Thought is to be regarded 
as an experimental action which involves only 
a small expenditure of energy (Freud, 1925a) - 
It makes use of experiences stored in memory. 
Thinking employs the secondary process for 
securing an identity of thought with reality. 
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When thoughts are congruent with reality, the 
ego opens the sluice gates of the motoric system 
and an appropriate response is made. 

Incorrect thinking, such as occurs in conden- 
sations, displacements, confusions, and errors, 
- results from an intrusion of id processes. The 
ideas of the id are concrete and thing-bound. 
Thus a person who confuses a hole with a vagina 
does so because the id reacts solely to the physi- 
cal similarity of the two objects (1915b). The 
symbolism of dreams is but one example of the 
id's influence on thought. Autistic thinking in 
general, except insofar as it stems from the spe- 
cial fantasy-making region of the ego, is id-domi- 
nated thinking (Freud, 1900) . 

Judgment has two functions. It may assert 
or deny that a thing has a particular property 
or it may affirm or negate that a particular 
image exists in reality. The original property 
of an object which comes under the scrutiny of 
judgment is whether the object is likely to cause 
pleasure or to cause pain. The prototype for 
judging pleasure and pain is that of affirmation 
through incorporation, e.g., swallowing, or nega- 
tion through expulsion, e.g., spitting out. When 
something is affirmed as being pleasurable the 
love instinct is in control, whereas when some- 
thing is expelled as being painful the destructive 
impulse is ascendant. With the development 
of the ego, qualities of objects other than the 
pleasure-pain ones assume importance for the 
proper functioning of the ego. 

The highest development of judgment con- 
sists of determining whether a psychic state is 
real or not. It accomplishes this aim by redis- 
covering in the world, through perception, the 
referent of the thought. If it cannot be found 
in the world the thought is a delusion and is 
discarded. It is precisely in this way that a 
scientist attempts to test his hypotheses, i.e., his 
thoughts about some aspect of reality (Freud, 
1925a) . 

The ego in relation to the id. The ego, as has 
already been noted, originates out of and is the 
organized portion of the id. Consequently, the 
relationship between them is a very intimate 
one. There is no natural antagonism between 
the ego and the id, and in the mature, well- 
integrated person there is really no division 
between them (Freud, 1926a). They work as 
a unit under the guidance of the reality prin- 
ciple, which seeks to gratify the wishes of the 
id in a manner that does not violate the dictates 
of reality. If a split does occur between the ego 
and the id, the weakness of the ego then becomes 
apparent. When this happens the ego is forced 
to substitute rationalization for logical thought 
to gloss over the conflicts between the id and 
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reality, and to engage in other pretenses (Freud, 
1933). But when the ego retains firm ties with 
the id and is indistinguishable from it, then the 
ego displays its great power (Freud, 1926b) . 

The ego has no energy of its own but must 
borrow it from the id. Sometimes it resorts to 
trickery to withdraw further amounts of energy 
from the id. One such trick is that of identifi- 
cation. Since all object cathexes proceed from 
the id, the ego, by identifying itself with an 
object, recommends itself to the id in place of 
the object and seeks thereby to attract the energy 
of the id onto itself. Consequently, the ego takes 
into itself a large number of precipitates of for- 
mer object cathexes (Freud, 1933). 

The ego's relations with the id are influenced 
by the superego, which is partly a reaction for- 
mation against the id. Consequently, the ego 
is forced into the awkward position of serving 
two masters who are at odds with each other. 


THE SUPEREGO 


The third major subdivision of personality, 
the superego, represents morality. It is the last 
of the provinces of personality to be formed and 
reflects the standards of the society in which the 
child is raised, The superego strives for perfec- 
tion. 

The superego has two aspects, one the con- 
science, the other the ego ideal. The ego ideal 
sets the standards of ethical conduct, while the 
conscience acts in the capacity of a judge who 
metes out punishments for infractions of the 
rules. 

Origin of the superego. The superego is 
formed out of the ego in childhood as a conse- 
quence of the dissolution of the oedipus com- 
plex (Freud, 1940). This dissolution is 
accomplished by substituting identification with 
the parents for feelings of love and hate toward 
the parents. The parents or, more precisely, 
their qualities as punishing and rewarding fig- 
ures in the early life of the child, are introjected 
into the ego and become a separate institution, 
the superego, which performs the same functions 
as the parents once did. The punishing qualities 
of the parents become the conscience and the 
rewarding qualities become the ego ideal. When 
the ego does not live up to the expectations of 
the ego ideal, conscience punishes the ego by 
making it feel guilty. When the ego does man- 
age to fulfill the ideals of the superego, it is 

rewarded by feelings of pride. The superego 
is modelled upon the earliest parental images, 
when the parents were thought to be perfect 
and omnipotent. These early images of the 
parents affect the child's later relationships with 
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other authority figures and with the laws and 
ideals of society (Freud, 1933). 

The superego is derived primarily from audi- 
tory impressions received from the parents in 
the form of commands, accusations, threats, and 
exhortations (Freud, 1923a). As a consequence 
of its origin in auditory perceptions, the super- 
ego remains accessible to consciousness, al- 
though it is for the most part unconscious. A 
person can remember, in substance at least, 
what his parents once said to him because it has 
been incorporated and is permanently retained 
by the superego. The voice of conscience is a 
transcription of the parents’ voices. However, 
the cathectic energy of these verbal remnants 
does not originate from the auditory perceptions 
as such but from sources in the id. That is, since 
the superego represents an introjection of the 
earliest object choices of the id, i.e., the parents, 
the cathectic energy in the superego is derived 
from the strength of the cathexis for the parents. 
In other words, the severity of the superego is 
determined not by the severity with which the 
parents treated the child but by the intensity 
of the cathexis for the parents and the con- 
sequent strength of the resistance against the 
dissolution of the oedipus complex. ‘The 
stronger the oedipus complex is, the stronger 
is the superego, since the energy of the former 
flows into the latter when the oedipus complex 
is abandoned (Freud, 1940). 

In general, parents follow their own super- 
egos in bringing up children, The result is 
that the child’s superego is really not a reflection 
of the parents’ behavior but is a reflection of 
the parents’ superego. In this manner, the tra- 
ditional values and ideals of a society are handed 
down from generation to generation (Freud, 
1933) . 

Functions of the superego. 'The functions of 
the superego are far simpler than those of the 
ego. Representing as it does parental authority, 
the superego behaves toward the ego as the 
parents once did toward the child. When the 
ego breaks a superego rule, the superego pun- 
ishes the ego by making the person feel guilty 
in the same way that a child is made to feel 
guilty by parental admonition. The superego 
withdraws its approval as the parents once with- 
drew their love, so that the reproaches of con- 
science correspond to the child's fear of losing 
his parents’ love. The superego commands, cor- 
rects, and judges the ego in a wholly parental 
fashion. In a similar way, the superego bestows 
its favor upon the ego when the ego defers to 
the superego. The ego is filled with pride when 
it measures up to the ego ideal, just as a child 
feels proud when his parents praise his virtuous 


conduct. The conscience has the specific task of 
keeping the ego under surveillance to see 
whether it lives up to the ego ideal (Freud, 
1940) . 

All in all, the chief function of the superego 
is to limit instinctual gratifications, not in ac- 
cordance with the reality principle which guides 
the ego's control over the impulses, but in ac- 
cordance with perfectionistic and unrealizable 
standards imposed by the parents or, more cor- 
rectly, by the superegos of the parents. This 
aim of limiting instinctual gratification is due 
to the circumstance that parental authority is 
exercised mainly in suppressing or regulating 
sexual and aggressive impulses. Parental rules 
of conduct are categorical imperatives and, 
therefore, it is the categorical imperative which 
rules in the superego. 

The superego in relation to the id. The id 
and the superego have one thing in common; 
they both represent influences of the past on the 
individual. In the case of the id the past is con- 
tained in the individual's biological inheritance, 
while in the case of the superego the past is 
represented by the social tradition in which the 
individual grows up. Since Freud believed that 
social tradition can become a part of one's bio- 
logical inheritance, the past of the id and the 
past of the superego are considered to have 
much in common. This belief moved him to 
observe that “what belongs to the lowest depths 
of the mind is changed through formation of the 
ego ideal (superego) into what we value as the 
highest in the human soul" (Freud, 1923a, p. 
48). 

Leaving to one side these biological specula- 
tions, the superego can be said to be the re- 
ceptacle for the earliest object choices of the id. 
Thus the id cathects the superego as it once did 
the introjected objects of the superego and uses 
it as an ally against the ego. The superego can 
act as the representative of the id in relation to 
the ego (Freud, 19232) . 

However, the superego contains more than 
the object choices of the id, It also contains 
reaction formations against these choices. The 
superego not only says in effect “you ought to be 
like your father," but it also says "you may not 
do all that he does," e.g., have sexual relations 
with the mother (Freud, 1923a) . 

The relations of the ego and superego. Al- 
though the superego develops out of the ego as a 
consequence of the resolution of the oedipus 
complex, its relation to the ego is a burdensome 
one for the ego. In the first place, the super- 
ego develops during a stage when the ego is still 
so weak that it is unable to cope realistically 
and effectively with the demands of this new 
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province upon it. In the second place, since 
the superego contains the oldest object choices 
of the id it is a fellow conspirator of the id 
against the ego. And thirdly, introjected ma- 
terials of the superego were once real persons in 
the environment of the individual, so that the 
superego may be said to be a representative of 
the outer world. In every sense, the superego 
stands in opposition to the ego. When the ego 
is strong it can perform its synthetic function 
and effect a fusion of the two forces, the id and 
the world, which the superego represents, but 
when it is weak, as it is when the superego is 
formed, it is more likely to be ruled by than to 
rule the superego. By establishing the superego 
the ego masters the oedipus complex, but in so 
doing it places itself again in subjection to the 
id and to a small extent to the external world 
as well. Because the ego is the representative 
of the internal subjective world, conflicts be- 
tween the ego and the superego arise over the 
distinction between what is real and what is 
mental (Freud, 1924) . 

The expression of tension between the ego 
and the superego manifests itself in feelings of 
guilt, whereas the expression of harmony be- 
tween the two is felt as pride. The ego is 
constantly measuring itself against the perfec- 
tionistic standards of the superego and striving 
to attain them. In a sense, the displacement of 
libido to the ideals of the superego and the 
attainment of these ideals represents a recovery 
of the primary narcissism which was lost when 
the ego was formed (Freud, 1914). The ego 
can now love itself by living up to its ideals, a 
kind of bootstrapping which is not at all un- 
common in the relations between the various 
agencies of personality. On the other hand, the 


ego can gain the condemnation of the superego 
by identifying with objects and persons who 
have been rejected as being unworthy by the 
superego, Thus self-love and self-hate both 
have their sources in the kinds of identifications 
made by the ego. 

As long as all goes well in the life of a person, 
the superego is lenient and lets the ego do 
pretty much as it pleases, just as a parent, when 
he is in good spirits, takes an indulgent attitude 
toward the child’s conduct. However, should 
a calamity occur, the superego goes into action, 
holds an inquisition, discovers the sin, heightens 
the standards, imposes abstinences, and demands 
penance from the ego (Freud, 1930). This sud- 
den transformation of the superego reminds one 
of a similar transformation in a parent who 
discovers that his child has broken a favorite 
possession. The superego pursues its own ends 
and is independent of the ego as regards the 
energy at its disposal. 

In concluding this section on the subdivisions 
of personality it is well to bear in mind what 
Freud said about the dividing up of personality 
into separate provinces. He felt that there are 
no sharp boundaries between the three major 
provinces; rather they are like areas of color 
which shade off into one another. After separat- 
ing them “we must allow what has been sepa- 
rated to merge again” (Freud, 1933, p. 110). 
Freud conceived of the total personality of the 
normal person as working as a unit and not as 
three separate systems in opposition to one 
another. The latter state of affairs characterizes 
the neurotic individual who is out of joint with 
himself. For such a person, psychotherapy con- 
sists of effecting a psychosynthesis. 


Tue MENTAL QUALITIES 


A content of the mind may have one of three 
qualities. It may be conscious, preconscious, or 
unconscious. Only those contents which are the 
result of ego processes can be conscious. Neither 
the id nor the superego can make anything 
conscious, On the other hand, not all that the 
ego produces is conscious. Most of it, in fact, is 
preconscious. 

The contents of the ego which may become 
conscious are perceptions, feelings, and thoughts 
(ideas). A perception is a response to a par- 
ticular quality of an object in the external 
world. When a person says that he sees a red 
table, i.e., is conscious of, aware of, or has an 
image of a red table, he is perceiving a stimulus 
property red. Feelings are qualities of the 
pleasure-pain series which have their sources 
in the processes of the id. 


Freud was not certain as to just what processes in 
the id were responsible for feclings of pleasure and 
pain. At one time he felt that an increase of in- 
stinctual tension was painful, a decrease pleasurable, 
but he soon saw that this could not be true, since 
there are some tension increases, notably those which 
occur in sexual excitement, which are clearly pleas- 
urable. Subsequently he decided that pleasure and 
pain depend upon a qualitative factor in the tension, 
e.g., the rhythm of tension changes, rather than upon 
a simple quantitative lowering or raising of tension 
(Freud, 1924) . 


A feeling can never be preconscious or uncon- 
scious, although the processes which give rise 
to it are preconscious or unconscious (Freud, 
19233). Thoughts which are originally precon- 
scious become conscious by being connected 
with the memory traces of speech symbols. 


LJ 
D 


156 HANDBOOK OF SOCIAL PSYCHOLOGY 


These memory traces have been etched in the 
mind by the perception of verbal and written 
material. In order for a thought to become 
conscious it must have a connecting link with 
language (Freud, 1939). Infantile memories 
cannot become conscious because they have no 
associations with language symbols. 1 

The quality of consciousness is the only one 
of the three qualities known to us directly. 
What it means for a mental content or process to 
be preconscious or unconscious is a “shrouded 
secret" (Freud, 1940). Freud believed, how- 
ever, that it has primarily to do with the charge 
of energy (cathexis) on the percept or idea. 
If the cathexis is weak the idea is preconscious, 
and if the cathexis is still weaker the idea is 
unconscious. However, one must take into con- 
sideration not only the cathexis of the process 
itself but also the possibility of there being 
another process whose cathexis acts in opposi- 
tion to the first one. The net cathexis that 
remains after the second one (the anticathexis) 
has been subtracted from the first one will de- 
termine with what quality, whether conscious, 
preconscious, or unconscious, the process is 
endowed. 

In Freud's earlier theorizing, when uncon- 
scious, preconscious, and conscious were con- 
ceptualized as regions of the mind, he pictured 
the anticathexes or resistances as functioning 
at the boundary between the conscious and pre- 
conscious, and the boundary between the 


preconscious and unconscious (Freud, 1900). 
These resistances constitute the censorship 
operating upon material striving to cross the 
border in the direction of becoming conscious. 
The terminology used for describing this earlier 
model of the mind perseverates in Freud’s later 
writings and causes some confusion. However, 
it seems quite clear to us that when Freud 
changed his model from a conscious-precon- 
scious-unconscious one to an id-ego-superego 
one, he intended to do away with the conception 
of boundaries or barriers between consciousness 
and unconsciousness, and to replace a geographi- 
cal plan with a dynamic one. The conscious or 
unconscious quality of a percept or idea is 
determined by the sum of positive and negative 
charges of energy. Consequently, one may speak 
of an idea as being nearer or farther away from 
consciousness, depending upon its charge and 
opposing charges. An idea that has a strong 
unconscious quality, i.e., “is buried deep in the 
unconscious” is either one which has a weak 
charge on it or one which has a strong anti- 
cathexis, or both. The qualitative categories of 
unconscious and preconscious are merely con- 
venient classes for grouping material which 
differs quantitatively but not qualitatively. In 
other words, there is a continuum from the 
weakest possible unconscious idea to the strong- 
est possible conscious one. Where one draws 
the lines between the three qualities is purely 
arbitrary. 


‘THE INSTINCTS 


In this section we consider Freud's motiva- 
tional theory which, like his theory of the struc- 
ture of personality and his theory of anxiety, 
underwent such drastic transformation in the 
early twenties that few of Freud's associates were 
willing to accept the reformulation (Freud, 
1920). Psychoanalysts had become used to the 
older libido theory, so that the subordination 
of sex to the hegemony of the life force and 
the introduction of a powerful death force 
upset the theoretical apple cart. Freud was 
not able to put the apples back in the 
cart again to his own satisfaction let alone that 
of his colleagues. To this day, the relationship 
of the new motivational theory to other psycho- 
analytic theories has not been satisfactorily 
worked out. The death instinct, in particular, is 
an embolism in the psychoanalytic body which 
has yet to be absorbed. It may be fairly said 
that the resistance to the acceptance of a death 
instinct outweighs the resistance to any other 
feature of psychoanalysis, even that of infantile 
sexuality, which aroused so much animosity in 
the early years of psychoanalysis. 


DEFINITION AND ORIGIN OF INSTINCT 


An instinct is the mental representative in 
the id of an inner somatic source of stimulation 
(Freud, 1905). A stimulus arising from an €x- 
citing process in an organ or bodily tissue is 
called a need. 


Freud tends to confuse the terms need and instinct 
in his writings, although it is clear to us that he 
intended to distinguish them as separate concepts. 
Need is a biological concept, instinct a psychological 
one. Need is to instinct as stimulus is to perception. 


The study of needs or the sources of instincts 
lies outside the province of psychology. Instinct 
is “a borderland concept between the mental 
and the physical, being both the mental repre- 
sentative of the stimuli emanating from within 
the organism and penetrating to the mind, and 
at the same time a measure of the demand made 
upon the energy of the latter in consequence 
of its connection with the body" (Freud, 1915¢ 
p. 64). 

A need differs from an external stimulus in 
several ways. The source of the two is different, 
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since a need is located within the body while 
an external stimulus is located in the outer 
world. A need operates as a more or less con- 
stant force, whereas a stimulus usually has a 
momentary impact. The most important dif- 
ference is that one can escape from an external 
stimulus by flight, whereas one cannot escape 
from a need by this means. External stimuli 
are more easily mastered than are the instinctual 
stimuli and the latter make greater demands 
upon the person because they require more 
complicated activities in order to be satisfied. 
Freud believed that needs and not external 
stimuli are the more important cause of the 
development of the nervous system. However, 
he asserts that there is nothing to prevent one 
from assuming that the needs themselves are 
hylogenetic residues of external stimulation 
(Freud, 1915c). 


CHARACTERISTICS OF INSTINCTS 


In addition to their sources in bodily need 
states, instincts have an impetus, aim and an 
object. The impetus of an instinct is its motor 
element, i.e., the quality of impulsion or activity 
which characterizes all instincts. The aim of 
an instinct is to abolish the tension created by 
the need. Thus an instinct is said to be con- 
servative in nature, since it seeks to re-establish 
a basal tension level which existed before the 
need arose, i.e., a return to an earlier state. In 
accomplishing this aim the individual experi- 
ences pleasure. The object of an instinct is, as 
a rule, a particular kind of commerce with an 
object in the external world. The aim and the 
object of an instinct can be altered or exchanged 
for other aims and objects. However, the object 
is the most variable thing about an instinct and 
substitutions of objects are more easily made 
than substitution of aims. In general, then, an 
instinct can be thought of as a sum of energy 
which imparts direction to psychological proc- 
esses (Freud, 1933) . 

Freud says that instincts are conservative by 
nature, By this he means that they seek to re- 
peat that which has gone before (the repetition 
compulsion) or if development has taken place 
they strive to return to an earlier stage of de- 


velopment (regression) . 


NUMBER OF INSTINCTS 


It is possible to distinguish an indeterminate 
number of instincts but they all belong to or are 
derived from two basic groups, the life instincts 
and the death instincts. The instincts within 
each group are distinguished from one another 
by virtue of the fact that each has a distinctive 


somatic source, e.g., hunger vs. sex, and a dis- 
tinctive special object, e.g., eating vs. copulation. 


The dichotomy of ego instincts and sexual instincts 
was Freud's earliest instinct theory. The ego in- 
stincts had as their aim the self-preservation of the 
individual, while the sexual instincts aimed at pres- 
ervation of the species through reproduction. As 
early as 1914 Freud expressed dissatisfaction with 
this theory but he was not inclined then to give it 
up entirely (Freud, 1914). Instead, he elaborated 
the concepts of ego libido and object libido, which 
in a sense constituted a new instinct theory and 
paved the way for the final formulation of life in- 
stincts and death instincts (Freud, 1923b). Freud 
recognized that the ego was the main reservoir of 
libido, having taken this function away from the id, 
and that it invested its energy in object cathexes. 
However, a goodly portion of it always remained 
in the ego and was said to be used narcissistically 
because the ego cathected itself. Moreover, there 
was a constant transformation of ego libido into 
object libido and of object libido back into ego 
libido. This fact made Freud suspicious of the basic 
nature of the distinction between the two kinds of 
cathexes, object versus ego, with the consequence 
that he banished his original instinct theory, and 
created a new one. The ego instincts were fused with 
the sexual instincts to give the life instincts, and a 
new group of instincts, the death instincts, which 
bear no relation to any early features of the theory, 
were postulated (Freud, 1920). 


EROS OR THE LIFE INSTINCTS 


The life instincts have as their goals the 
preservation of life and the preservation of the 
species. They accomplish these aims by estab- 
lishing ever greater unities both in connection 
with the synthetic function of the ego and in 
the formation of libidinal relations with ob- 
jects. The person becomes more highly inte- 
grated within himself and with others (Freud, 
1940) . 

Since the principal feature of an instinct is 
that it seeks to return to an earlier state, one 
wonders what the earlier state is which Eros 
tries to reinstate. It would have to be a state of 
unity which subsequently had been split up and 
to which the individual wished to return. Freud 
was not satisfied with this analysis because he 
felt that there is nothing in the actual history of 
living substances to support such a hypothesis, 
other than the supposition that the birth of life 
out of inorganic matter was accomplished by 
the shattering of an inorganic unity. If this 

ewere so, the life instincts and the death instincts 
would have the same aim, viz., to return to the 
inorganic, and there would be no need to dif- 
ferentiate between them (Freud, 1940). 

The best known and most minutely analyzed 
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group of life instincts are the sexual instincts. 
The psychoanalytic literature is replete with 
studies of the sexual instincts, so much so that 
for many people the distinguishing and ubiqui- 
tous feature of psychoanalysis is its pansexual- 
ism. Prior to 1920 this may have been true; after 
that date pansexualism became a misnomer. 

The general characteristics of the sexual in- 
stincts are as follows (Freud, 1915c). They are 
numerous, arising from many bodily sources. At 
first each sexual instinct acts more or less inde- 
pendently of the other sexual instincts but as 
the individual matures they achieve a more or 
less complete synthesis in the service of propa- 
gation. In the early stages of development they 
serve the aims of the ego instincts (self-preserva- 
tion) from which they only gradually detach 
themselves and acquire their own aims. In their 
choice of objects, also, they at first adopt the 
objects of the ego instincts. 

The sexual instincts are very plastic. They 
can change their aims and their objects and sub- 
stitute for one another, so that a particular 
sexual instinct can be vicariously satisfied by 
another sexual instinct (Freud, 1933) . 

The sexual instincts do not appear suddenly 
at puberty but go through a course of develop- 
ment beginning in earliest infancy (Freud, 
1905). The erogenous zones of the body, which 
come into prominence one at a time, are the 
key to this development. An erogenous zone 
is a portion of the skin or mucous membrane 
from which unpleasant feelings of sensitiveness 
arise, The removal of these feelings by manipu- 
lation of the zone, e.g., sucking or massage, 
evokes pleasure. 

The first such zone is the mouth, from which 
pleasure is derived independently of nourish- 
ment, Oral pleasure is obtained in two ways, 
by stimulation of the lips and oral cavity and 
by using the teeth for achieving aggressive aims. 
Oral erotic pleasure develops before oral ag. 
gressive pleasure, because the latter has to await 
the appearance of teeth before it can be fully 
satisfying, although even before the teeth ap- 
pear, aggression finds an outlet through spitting- 
out behavior. The two oral stages are often 
referred to as oral incorporation and oral 
sadism. 

The second erogenous zone to come into 
prominence is the anus, which does so by virtue 
of its relation to the excretory function. The 
anus has two modes of functioning, retention 


and expulsion, either of which yields pleasure. > 


To a far greater extent than is true of the oral 
period, pleasure derived from the anal modes 
depends upon the discharge of destructive en- 
ergy. Toilet training procedures, parental dis- 


cipline in the form of reprobations and 
approbations, and external controls over bodily 
pleasures enhance the aggressiveness of the child 
during this stage. 

The third region of the body from which the 
child obtains pleasure through tactual stimula- 
tion, ie. masturbation, is the genital zone. 
Since the male genitals are of primary im- 
portance in this stage, this is called the phallic 
stage of libido development. 

These three stages, oral, anal, and phallic, 
which are not to be thought of as appearing and 
disappearing in a one-two-three order but which 
overlap and occur simultaneously, constitute the 
pregenital development of the child during the 
first five years of life. At the end of this period, 
the child goes into latency, from which he 
emerges at puberty and pursues a final libidinal 
development. This final development consists 
of the organization of all of the sexual functions 
in the service of reproduction. Some of the 
earlier cathexes are restrained or repressed, some 
are integrated into the sexual act as forepleas- 
ure, while others are transformed into character 
traits or displace their aims through the process 
of sublimation. The principal distinction be- 
tween the pregenital and genital sexual func- 
tions is that the former are autoerotic and 
narcissistic while the latter are alloerotic and 
object oriented. 

Freud points out that much of our highly 
valued cultural heritage has been acquired at 
the cost of inhibiting and repressing the sexual 
instincts. "The energy which is used for sexual 
gratification is compelled to find new channels. 
Love is one such channel. All manifestations of 
love, whether they be in the form of self-love, 
love between parents and children, friendship 
and love for humanity, or devotion to objects 
and abstract ideas, are expressions of the same 
basic instinct, sexual love (Freud, 1921). These 
manifestations of love are usually diverted or 
prevented from attaining the culmination of 
sexual union, and in fact their very strength and 
tenacity are dependent upon the inhibition of 
this aim, since the attainment of sexual union 
dissipates libido and weakens emotional ties. 


THE DEATH INSTINCTS 


“The goal of all life is death" (Freud, 1920, 
p. 47). By this, Freud means that there is resi- 
dent in living organisms a compulsion to return 
to the inorganic state out of which living matter 
was formed. The aim of the death instincts is 
just the opposite of eros. Whereas eros forms 
ever larger unities, the death instincts work to 
destroy unity. 


PSYCHOANALYTIC THEORY AND ITS APPLICATIONS IN THE SOCIAL SCIENCES 159 


The death instincts do not put themselves on 
display in behavior as do the life instincts. For 
the most part they continue their inner task of 
bringing about the death of a living thing in- 
conspicuously (Freud, 1923a). However, ag- 
gression turned outward or aggression turned 
inward, which are derivatives of the death in- 
stincts, are far from being inconspicuous in be- 
havior. The derivative instinct of destruction 
diverts the energy of the death instincts away 
from the person's own dissolution and projects 
it out on the world. When the superego takes 
form, some of the aggression is used by the 
superego against the ego in the form of self- 
destruction. 


INTERACTION BETWEEN LIFE INSTINCTS 
AND DEATH INSTINCTS 


Rarely is any act the product of a single in- 
stinctual impulse. Rather, the rule is that the 
two basic instincts of life and death, and their 
innumerable derivatives, either work together 
in some sort of synthesis or they work against 
each other and reveal a conflict of interests. 
Any action is, therefore, the product of a num- 
ber of motives (Freud, 1932). 

The death instincts are rendered relatively 
harmless by entering a coalescence with eros. 
The libido of eros neutralizes the self-destruc- 
tive energy by diverting it outward in the form 
of mastery, domination, aggression, and the will 
to power. Love may use mastery in gaining 
possession of an object. On the other hand, 
love may acquire a sadistic quality by being 
combined with the destructive instinct. There 
are innumerable ways in which instincts can and 
do combine and the unravelling of the motives 
which produce a given action is often a difficult 
and always a complicated business. But this 
unravelling, if it is pursued to completion, al- 
ways brings the investigator to the basic life and 
death instincts (Freud, 1924). 

When the life and death instincts are pur- 
suing their respective aims more or less inde- 
pendently, that is, one at a time, then the 
gratification of the one makes it possible for 
the other to put in an appearance. When eros 
has been satisfied, the death instincts are likely 
to come to the fore, and vice versa. This ac- 
counts for the observation that love is often 
succeeded by hate, and hate by love. 


THE EGO IN RELATION TO THE INSTINCTS 


Although Freud never clearly formulated the 
relations of the instincts to the ego and the 
superego, one gains the impression from a care- 
ful reading of his various thoughts on this sub- 


ject that the ego is the agent of the life instincts 
and their derivatives, and the superego is the 
agent of the death instincts and their deriva- 
tives. This distinction is by no means a strict 
one, and it is much clearer in the case of the 
superego than it is in the case of the ego. 

The primary aim of the ego, it will be re- 
called, is that of maintaining life, and all of its 
vast and complicated elaborations of psychic 
processes is directed toward this superordinate 
goal. When Freud says that life is a roundabout 
way to death, he must mean that the round- 
aboutness is provided by the development of 
an ego. For an id without an ego would soon 
destroy itself. 

The ego's relation to the sex instincts is one 
of trying to find satisfaction for them in line 
with the reality principle. This involves finding 
appropriate objects in the external world, of 
incorporating these objects by the mechanism 
of identification, of sublimating (desexualizing) 
the sexual instincts, and of utilizing numerous 
other techniques for dealing with the sexual 
instincts. There is always a danger that the 
control may go too far in opposing the purpose 
of eros, with the consequence that the death 
instincts may seize the opportunity to assert 
themselves when the sexual instincts are in- 
hibited. If this is the case, then the ego becomes 
indirectly the agent of death (Freud, 1920) . 

However, the ego also possesses the means of 
dealing with the primitive death wish of the id. 
It can project it outward in the form of aggres- 
sion or sublimate it in the form of mastery over 
the world or diminish its force by turning it 
into masochism, which is a form of suffering 
without dying (Freud, 1924) . 


THE SUPEREGO IN RELATION TO THE INSTINCTS 


The superego, based as it is upon external 
authority, has closer ties with the death instincts 
than with the life instincts. Perhaps the most 
characteristic feature of the superego is the 
aggression which it displays against the ego. 
Although the model for this aggression may lie 
in the punitive parent, the real source of it 
lies in the death instinct of the id. From this 
source to its ultimate appearance in the super- 
ego, the death wish of the id undergoes con- 
siderable modification, Through the action of 
the ego on it, by the mechanism of projection, 
the wish for one’s own death is transformed 
nto aggression toward others. The external 
authority figure, by imposing prohibitions on 
the manifestations of the life force, especially 
that of sex, becomes a particularly fitting target 
for aggression. This defiance elicits a counterat- 
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tack from the parent, e.g., castration threats and 
loss of love, so that the child in the interest of his 
own safety inhibits his aggression and develops 
a superego which assumes the role of the puni- 
tive parent toward the ego. The superego may 
seek to destroy the ego through harsh and cruel 
treatment, so that the death wish, by this ex- 
tensive detour, may finally achieve its purpose. 
Depression, melancholy, and suicide are mani- 
festations of the death wish which has found a 
new base for its operations in the superego. The 


ego becomes masochistic under the influence of 
a sadistic superego (Freud, 1924). 

We can now understand why it is said that 
the superego is corruptible. It is corruptible in 
the sense that a primitive id wish for death ob- 
tains gratification under the guise of the super- 
ego's morality. Moreover, the superego’s sadistic 
morality can be projected outward in the form 
of cruel treatment of others, which is rational- 
ized as being justified by a high moral purpose. 


ANXIETY 


The three basic emotional states which Freud 
discusses are love, anger, and fear or anxiety. 
Love and anger each have their own energy 
sources in the life and death instincts, respec- 
tively. Anxiety, on the other hand, is derived 
from the same circumstances of life which insti- 
gate the formation of the ego and superego. 
It is a by-product, albeit an extremely important 
one, of psychological development and, in turn, 
anxiety causes further transformations in per- 
sonality. The avoidance of anxiety becomes a 
very important motive in the shaping of charac- 
ter and in the institution of defensive measures. 


DEFINITION AND FUNCTIONS OF ANXIETY 


Anxiety is a painful affective state which is 
differentiated from other states of unpleasure, 
e.g. tension, pain, and melancholy, by a specific 
quality which is contributed by automatically 
innervated responses of the heart, lungs, and 
other internal organs through the perceptual 
system to consciousness. As so defined, anxiety 
and fear are interchangeable terms. Freud seems 
to have accepted the James-Lange theory of 
emotions, although nowhere does he say so. 

The sole function of anxiety consists of its 
being a danger signal to the ego, so that the 
ego may institute measures to terminate the 
danger (Freud, 1926b) . 


TYPES OF ANXIETY 


There are three types of anxiety, reality or 
objective anxiety, neurotic anxiety, and moral 
anxiety. These types do not differ from one 
another in any qualitative way; they are all ex- 
perienced by the ego as a painful emotional 
state. They differ in respect to their sources. 
Reality anxiety results from the ego's relation 
to the external world, neurotic anxiety from the 
ego's relations to the id, and moral anxiety from 
the ego's relation to the superego (Freud, 1926b, 


1933) . 1 o ^ Z 
Objective or reality anxiety. Objective anxi- 


ety is an emotional reaction to a perception of 
danger in the external world. A danger is 
defined as an external condition which one re- 
members as having had a traumatic effect in the 
past. By traumatic is meant something which 
paralyzes the person so that he is not able to 
make a coping response to the source of the 
intense, excessive excitation. 

When the anxiety reaction occurs it can take 
one of two directions. Either the person avoids 
the danger by fleeing from it or by mastering 
it, or he does nothing and the anxiety mounts 
until it overwhelms the ego and the person re- 
gresses to a state of infantile helplessness. 

Objective anxiety appears first in develop- 
ment and arises out of the condition of help- 
lessness of the baby. Any condition which 
portends a return of this helplessness, eg. 
threat of castration or loss of love, will initiate 
the anxiety reaction. The external danger must 
be perceived as one which is related to a situa- 
tion of helplessness that has been experienced 
earlier in life (Freud, 1926b, 1933) . 

Neurotic anxiety, Neurotic anxiety is an af- 
fective reaction to a perception of danger in 
the internal world. Specifically, it is a fear of 
the potential harm that may be done by the 
unleashing of the instincts, Neurotic anxiety 
is a derivative of reality anxiety, since fear of 
the id is basically a fear of what the id may 
cause the person to do. 

Neurotic anxiety can be observed in three 
forms. There is a free-floating, general appre- 
hensiveness which is ready to attach itself to 
any more or less suitable environmental circum- 
stance. Neurotic anxiety also manifests itself 
in phobic reactions. A phobia is characterized 
by an intensity of anxiety which is not war- 
ranted by the objective situation. A person 
who becomes paralyzed with fright when he is 
on a high place, even though he is fully pro 
tected from falling, is obviously reacting to 
something more than the hazard of an objective 
situation. He is reacting to an unconscious 
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desire to jump, the effect of which is to throw 
him into a state of horror at the thought that 
the wish might be realized. Fear of high places, 
or any other phobia for that matter, may also 
be an expression of moral anxiety. A person is 
afraid that his superego will take advantage 
of an external situation in order to punish the 
ego. Finally, there are anxiety states which seem 
to have no justification whatsoever in any ex- 
ternal danger. Inexplicable panic reactions are 
of this nature. 

Neurotic anxiety is often projected outward 
as objective anxiety because it is easier to pro- 
tect oneself from an external danger than from 
an internal danger. Obviously, this projection 
offers no permanent solution, since the person 
continues to carry the danger around with him 
(Freud, 1933) . 

Moral anxiety. Moral anxiety is an emotional 
reaction to a perception of danger from the 
superego. The ego considers itself to be in dan- 
ger when the superego is angry with it or 
threatens to punish it or ceases to love it. Moral 
anxiety expresses itself as feelings of shame and 
guilt. 

Since the ties between superego development 
and the oedipus and castration complexes are so 
intimate it is not surprising that moral anxiety 
also has close ties with neurotic and reality 
anxiety. The superego, on the one hand, abhors 
the instinctual impulses in spite of the fact that 
these same impulses may exploit the superego 
for their own ulterior purposes, and on the other 
hand it is the introjected voice of parental au- 
thority which, at one time, actually produced 
objective anxiety by threats and punishments. 
Thus moral anxiety gathers force by being a 
synthesis of neurotic and objective anxiety 
(Freud, 1926b) . 


ORIGIN OF ANXIETY 


Anxiety is produced primarily by an accumu- 
lation of excitation resulting from external 
stimulation which is so excessive that the ego 
is not able to bring it under control, Le, to bind 
it. The event which is the prototype for this 
source of anxiety is birth, when the young or- 
ganism is likely to be overwhelmed by an ex- 
cessive amount of excitation. 

Anxiety may also be produced by perception 
of an impending traumatic situation. The anxi- 
ety, in this case, acts as a signal to the individual 
that he is in danger. The origin of the signalling 
function of anxiety is one that lies in the in- 
dividual's past experience with traumatic situa- 
tions. He learns to anticipate trauma and to 


do something about it by paying attention to 
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the incipient anxiety which the perception of 
danger initiates. By developing apprehension, 
the ego is in a position to ward off the danger 
and avoid the accumulation of intolerable 
amounts of excitation. 

Objective or reality anxiety appears develop- 
mentally before either neurotic or moral anxi- 
ety. Freud observes that in each stage of 
development there is a particular condition for 
fear which is peculiar to that stage. For ex- 
ample, in the oral dependent phase, the great 
fears are the loss of the love object or the loss of 
love, in the phallic phase it is fear of castration 
for boys, and during latency the child fears his 
conscience. Freud believes that fear of castra- 
tion plays the greatest role in the anxiety of: 
boys, and that fear of loss of love is the decisive 
factor in the anxiety of girls. 

Neurotic anxiety develops in one of two ways. 
As a result of inhibition, excessive tension may 
accumulate in the id until it exerts such a strong 
pressure on the ego that the ego is reduced to a 
state of helplessness and reacts to its helpless 
condition with anxiety. Or the ego may ex- 
perience neurotic anxiety when an instinctual 
demand entails a real external danger. The 
person is afraid of an impulse in the sense that 
he is afraid of the trouble the impulse might 
get him into should it be expresed. An in- 
stinctual demand of moderate strength is not 
dangerous in itself; it becomes so when it is 
likely to lead to an objective danger. 

Moral anxiety arises whenever the ego per- 
ceives that it has failed to live up to the stand- 
ards set by the superego. Since the superego 
is the representative within the personality of 
external authority figures, the ultimate source 
of moral anxiety is an external danger (Freud, 
1926b, 1933). In fact, the ultimate source of 
all anxiety is the perception of an external 
danger. 


Until 1926 Freud believed that anxiety was a 
consequence of the damming up of energy by repres- 
sion. In his essay, Inhibitions, Symptoms, and Anxi- 
ety (1926b), he reversed himself and announced that 
anxiety was the cause of repression rather than its 
effect, In a later lecture he wrote “To our surprise 
we discovered that it is not the repression that creates 
the anxiety but the anxiety is there first and creates 
the repression” (Freud, 1933, p. 120). Freud said 
that the division of the personality into id, ego, and 
superego forced him to adopt this new position with 
regard to the problem of anxiety. 

‘After reversing his earlier theory, Freud continued 
to assert that repression made available energy which 
might be converted into anxiety, although he felt 
that it was of no importance which portion of the 
general energy was employed for this purpose (Freud, 
1926b) . 
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THE EGO IN RELATION TO ANXIETY 


The ego is both the seat of anxiety and that 
part of the personality which attempts to abate 
the anxiety. The way in which the ego deals 
with anxiety is as follows. When an external 
stimulus acts upon the primitive mental ap- 
paratus and is perceived as being painful, rest- 
less motor activity takes place until the painful 
perception disappears. When the percept re- 
appears, the same thing happens, so that a trace 
of the experience is laid down in the mnemic 
system. Eventually, the beginning of a painful 
excitation arouses apprehension or anxiety, 
which is a signal to the ego that more and 
stronger pain is in the offing unless steps are 
taken to avoid the painful situation. If the ego 
feels helpless, the anxiety mounts until the ego 
either rises to the occasion in some effective 
way or surrenders to the anxiety, ie. has an 
anxiety attack or becomes completely immo- 
bilized (Freud, 1900) . 

Through the processes of learning and reten- 
tion, the amount of anxiety which is necessary 
to act as a signal is reduced to a minimum. The 
efficiency of the ego consists, in part, of antici- 
pating dangers on the basis of reduced cues. 
Anxiety may also be dealt with by defensive 
measures instituted by the ego. 


DEFENSES OF THE Eco 


In its normal functioning the ego makes use 


of its various subsystems to avert danger when 
the anxiety alarm is sounded. It attends to 
and tries to apprehend the nature of the danger; 
it thinks over the various possible courses of 
action; it draws upon memory to recall former 
ways of handling the danger; it considers, 
judges, and evaluates; finally it takes some ap- 
propriate action. These normal functions oper- 
ate more or less successfully in dealing with 
external threats, but they are not as well suited 
for protecting the ego against internal dangers, 
i.e., id and superego demands. Accordingly, the 
ego initiates additional measures whose purpose 
is to deal with conflicts between the id and the 
ego, and the superego and the ego. These 
measures are called the defense mechanisms of 
the ego. Freud characterized them as being 
“pathological,” since they accomplish their mis- 
sion by falsifying and distorting reality and by 
using energy which could be employed to better 
advantage by the rational systems of the ego 
(Freud, 1937b) . 

Freud first used the term “defense” in 1894 (Freud, 


1894) but soon abandoned it in favor of the term 
“repression.” In 1926 he revived the earlier concept. 
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THE SUPEREGO IN RELATION TO ANXIETY i 


The superego is responsible for the develop- 
ment of moral anxiety in the ego. When the 
ego is conscious of the sources of this type of 
anxiety it is experienced as guilt feelings. Con- 
sciousness of guilt arises before there is a 
conscience and consists of fear of external au- 
thority. When the superego originates, the 
fear of external authority is converted into fear 
of the superego's authority and criticizing func- 
tion. The ego feels guilty when it perceives that 
it has failed to act in accordance with the wishes - 
of the superego (Freud, 1930) . 

A mere intention to do something that trans- 
gresses the superego's code is sufficient to pro- 
duce feelings of guilt. Feeling guilty before any 
misdeed has occurred is a signal to the ego to 
obliterate the intention in order to avoid more 
intense condemnation from the conscience. The 
stronger the superego is in relation to the ego, 
the more the person is predisposed to experience 
guilt. This follows from the fact that a strong 
superego demands more instinctual renuncia- 
tion and every time the ego feels any pressure 
for gratification from the id, the ego becomes 
flooded with guilt. It follows that the person 
with the most fully developed conscience will 
experience the greatest amount of moral anxi- 
ety. 


At that time he defined defense as a general name 
for all of the techniques, other than the normal 
ones, which the ego uses in putting down conflicts. 
The term "repression" was reserved for a special 
method of defense. "Thus, in reading the writings 
of Freud which appeared between The Aetiology of 
Hysteria (1896) and Inhibitions, Symptoms, and 
Anxiety (1926b) it should be borne in mind that ` 
the term "repression" is sometimes used in a re- 
stricted sense, ie, as merely one of many defense 
mechanisms, and is sometimes used in a general 
sense, ie, as a designation for all of the defense 
mechanisms. 


REPRESSION 


Perhaps the earliest defense initiated by the 
ego against the id is that of repression, which 
frustrates the discharge of instinctual energy 
by setting up a block, i.e., an anticathexis, so 
that the instinct cannot become conscious OF 
directly expressed in overt behavior. "That is, 
repression acts directly upon the aim of the 
instinct. No other defense operates upon the 
aim, so that repression is more sharply dif- 
ferentiated from the other defenses than they 
are from one another. f- 

Freud differentiates two phases of repression. 
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The first phase, which he calls primal repres- 
sion, is a denial of entry into consciousness of 
the ideational presentation of the instinct. That 
which has always been in the id stays there. 
However, the id has the capacity for transform- 
ing its content into forms which are capable of 
being smuggled past the resistances into the ego. 
When the ego unmasks the disguised material 
and discovers an instinct presentation, it auto- 
matically institutes the mechanism of repression. 
This repression of the derivatives of material 
which is subject to primal repression is called 
repression proper. Repression can also be used 
against material which, although not harmful 
in itself, has come into association with material 
that is judged to be harmful. Freud points out 
that a storehouse of primal repressions in the 
id, which at no time have been in the ego, is a 
prerequisite for repression proper (F reud, 
1915d) . 

Repression is established and maintained by 
anticathexes, generated by the ego and used to 
resist the driving forces of the id. Anticathexis 
is the sole mechanism used in primal repression. 
In the case of repression proper, there is, in 
addition to resisting forces, a withdrawal of 
cathexis from the instinctual idea which changes 
a preconscious quality into an unconscious one. 
Freud suggests that the cathexis which is with- 
drawn from the idea is the very one which is 
used in generating an anticathexis (Freud, 
1915b) . 

The mechanism of repression is carried out 
by the ego under the influence of anxiety. The 
ego becomes aware of an incipient instinctual 
demand which, if permitted to discharge itself, 
would endanger the safety of the individual. 
The cathexis of this demand must be removed. 
If the ego is strong, the cathexis can be removed 
by assimilating it into the organization of the 
ego. If the ego is weak, painful feelings are 
generated and repression is automatically 
brought into play. By repressing the instinctual 
demand, the painful feelings are eliminated. 
The earliest and most basic repressions arise 
out of traumatic situations, when the ego is 
faced with excessive pressures from the id 
(Freud, 1938) . 

What happens in the id during repression? 
According to an earlier hypothesis, Freud 
thought that the energy of the repressed impulse 
was converted directly into anxiety. Upon his 
repudiation of this theory of anxiety, Freud 
re-examined the problem of what happens in 
the id when repression occurs and concluded 
that the effects might be varied. The repressed 
impulse might retain its libidinal cathexis and 


continue to exist unchanged in the id. Or the 
libidinal cathexis might be entirely removed 
from the impulse and diverted into other chan- 
nels. The impulse could then be said to have 
been destroyed in the id. This is what happens 
normally to the oedipus complex. Or there 
might be a regression of the libidinal organiza- 
tion to an earlier stage of development. In any 
event, by outlawing material from the ego, it 
becomes subject solely to the laws which govern 
the id, especially the principle of the repetition 
compulsion, so that the ego very seldom suc- 
ceeds in undoing its repressions (Freud, 1933) . 

The ego, although it often institutes repres- 
sion upon orders from the superego, may turn 
repression against the superego itself and secure 
oblivion for a sense of guilt. 

Represed material returns to the ego and 
becomes conscious or expresses itself in be- 
havior, e.g., symptoms or errors, when (1) the 
strength of the anticathexis is diminished or 
(2) the strength of the id cathexis is increased 
(Freud, 1939). There are many conditions, 
such as illness and sleep, which diminish the 
power of the resistances, and there are many 
conditions, such as energy changes at puberty 
and the climacteric or the addition of energy 
by some circumstance of life which makes con- 
tact with the repressed material, that will in- 
crease the strength of the cathexis. Also, 
because the id is capable of working upon its 
material by the processes of distortion, displace- 
ment, condensation, and symbolization, it can 
transform it so that the ego is not aware of its 
source. When the ego does become aware of the 
danger inherent in the material, the ego breaks 
out with anxiety and repression is again insti- 
tuted. It is possible, of course, for the repressed 
material to express itself in the form of more 
or less permanent symptoms and character traits 
which may never be recognized for what they 
are by the ego. A sort of secondary anxiety may 
arise from the presence of a disabling symptom 
or trait, but this secondary anxiety is always 
more welcome than the anxiety which would re- 
sult were the individual to discover the real 
origin of the symptom or trait. A trait or symp- 
tom gains satisfaction for a repressed impulse 
without incurring anxiety. That is why they 
are so hard to dislodge once they have taken 
hold in the person (A. Freud, 1936) . 

Another way that the subject matter of a 
repressed idea can enter consciousness is for 

«the ego to deny its acceptance of the idea, When- 
ever a person uses a negative, e.g., “I don’t want 
to hurt you,” he has actually had to assert the 
positive before it could be interdicted. The 
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example should be read in two parts. “I want 
to hurt you" represents the repressed idea, and 
“No, I don't" is the denial (Freud, 1925a) . 


REGRESSION 


In The Interpretation of Dreams Freud dis- 
tinguishes three kinds of regression. One he 
calls topical, by which is meant the substitution 
of the primary process for the secondary process. 
This is the kind of regression that is responsible 
for dreaming. Another kind of regression is 
called temporal, which refers to the appearance 
of older psychic formations in place of more 
recent ones, and a third is denoted as being 
formal because it describes the substitution of 
primitive modes of expression and representa- 
tion for the customary and normal ones, Having 
made these distinctions, Freud then proceeds 
to say that they are all basically one, “for that 
which is older in point of time is at the same 
time formally primitive and, in the psychic 
topography, nearer to the perception-end” 
(Freud, 1900, p. 497) . 

Regression qualifies as a method of defense 
because it is employed by the ego to satisfy 
the demands of the id in such a way, i.e., autisti- 
cally, that the ego is not really endangered. 
There is a substitution of fantasy for reality or, 
putting it in another way, a symbol is created 
for an anxiety-producing referent. 


REACTION FORMATION AND PROJECTION 


These two favorite mechanisms are discussed 
together because one of them, reaction forma- 
tion, involves a reversal of a conscious affective 
state while the object of the instinct remains the 
same, and the other, projection, involves a re- 
versal of subject and object while the affective 
state remains the same. In reaction formation, 
the anxiety-producing feeling, “I hate you,” is 
changed into the anxiety-reducing feeling, “I 
love you.” Subject (I) and object (you) do 
not alter their form, but the conscious feeling 
of love is now just the opposite of hate. Of 
course, the hate still exists unconsciously, i.e., 
it is repressed, but its place in consciousness has 
been taken by a completely fraudulent feeling. 
In projection, “I hate you” becomes “You hate 
me.” In this case the true feeling of the in- 
stinct is expressed openly, but the change in 
object, ie, you into me, constitutes the de- 
fensive distortion. 

It is easy to see why projection is such 2 
prevalent defense when one remembers that, on 
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the whole, the ego is better prepared by ex- 
perience and practice to deal with external 
dangers. By fabricating a "real" enemy, the 
person feels more adequate to take command 
of the situation. He can, for example, destroy 
or attempt to destroy his enemy, which brings 
satisfaction for his aggressive impulse without 
incurring a feeling of shame. For this reason, 


projection is probably the most effective of the < 


defense mechanisms. By contrast with it, re- 
action formation is much less satisfying, since 
the person has to pretend a feeling he does not 
have and the expression of which gives him no 
gratification for the real instinct. Still it does 
act as a bulwark against anxiety. 


SUBLIMATION 


Whether or not sublimation should be classed 
as an ego defense is a perennial question. 


Anna Freud lists sublimation as an ego defense, 
but she says it “pertains rather to the study of the 
normal than to that of neurosis" (A. Freud, 1936, 
p- 47). S. Freud does not make a definite pronounce- 
ment as to the status of sublimation, although he 
does observe in Civilization and Its Discontents 
(1930) that sublimation is one method available to 
man for guarding against pain. 


By sublimation, Freud meant that sexual en- 
ergy (libido) is diverted from its sexual objec- 
tive to higher and finer cultural goals. Culture 
is, in fact, acquired at the cost of the sexual 
instincts. Law and order, for example, consti- 
tute sublimations of anal eroticism. Freud says 
that many people have the capacity to direct 
part of their sexual powers into their work. 
"The sexual impulse is particularly suited to 
yield such contributions because, it is endowed 
with the capacity of sublimation, i.e., it has the 
power to exchange its nearest aim for others of 
higher value which are not sexual" (Freud, 
1910a, p. 46). Freud admits that he does not 
know what it means to say that an aim is higher 
or finer, and that he is using these terms only 
in a metaphorical sense (Freud, 1930) . 


OTHER DEFENSES 


In addition to the ones already discussed, . 
Anna Freud (1936) discusses five other de- 
fenses. They are isolation, undoing, introjec- 
tion, turning against the self, and reversal. For 
a discussion of these defenses the reader is re- | 
ferred to Anna Freud (1936) and Fenichel 
(1945) . 
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CHARACTER FORMATION 


Freud's well-known formula for the forma- 
tion of character states that the permanent 
character traits are “either unchanged perpetua- 
tions of the original impulses, sublimations of 
them, or reaction formations against them” 
(Freud, 1908a, p. 50). To this he later added 
the notion that character is also created by in- 
corporation of early parental functions, and 
identifications with parents at a later date and 
with other persons in authority. Thus a boy 
will assume masculine character traits by identi- 
fying with his father, while a girl will acquire 
feminine character traits by identifying with 
her mother. A child models himself after the 
same-sex parent in an attempt to achieve the 
same kind of love relationship with the opposite- 
sex parent that the same-sex parent has. He 
also strives for identification in order to allay 
his anxiety arising from open aggression toward 
"and fear of the same-sex parent. This is known 
as identification with the aggressor and plays a 
major role in character formation. Identifica- 
tions are also made with people with whom one 
has had object relationships which have been 
abandoned (Freud, 1933) . 

The prototypes of what later become charac- 
ter traits occur during the pregenital phases of 


development. A person whose character is heav- 
ily saturated with perpetuations, sublimations, 
reaction formations and identifications of the 
oral phase is said to have an oral character. In 
like manner, if his character is formed around 
a nucleus of anal prototypes, he is said to have 
an anal character, However, the character of 
most people will include precipitates from all 
of the early phases of libidinal development. 

Freud makes a sharp distinction between 
character and neurotic symptoms, although not 
between character and defenses. In character 
formation repression either does not operate at 
all or it easily reaches its objective, which is to 
replace the repressed impulses with reaction for- 
mations and sublimations. There is no return 
of the repressed, as there is in neurotic symp- 
toms, Defense mechanisms can, however, be- 
come character traits. 

Although Freud is not explicit on this point, 
it may be surmised that he believed that charac- 
ter traits could represent either defenses against 
anxiety or rational solutions by the ego in deal- 
ing with its environment and with the id and 
superego. Again we are faced with the problem 
of how Freud would distinguish between the 
pathological and the normal in personality. 


SoctaL PSYCHOLOGY 


Freud's contributions in the general area of 
social psychology may be summarized under five 
major headings: (1) socialization of the in- 
dividual, (2) family structure and dynamics, 
(3) group psychology, (4) origin of society, and 
(5) the nature of human culture. 


SOCIALIZATION OF THE INDIVIDUAL 


. Most of Freud's views on the socialization of 
the individual have already been discussed in 
the section on the superego, and not a great deal 
more can be added here. The child early in life 
becomes assimilated into the social tradition 
of his particular culture by the process of identi- 
fication with his parents, and this assimilation 
is strengthened later in life by identification 
with other representatives of the culture (Freud, 
1921). The agents of social tradition use re- 
wards and punishments to coerce the child into 
acceptance of the cultural rules and customs 
` (Freud, 1930). An intrapsychic system, the 
superego, takes over the role of external author- 
ity figures and exacts conformity to society. The 
person is then said to be socialized. 

The person may rebel as well as conform to a 


social tradition. Rebellion may be produced by 
a number of factors, but two in particular are 
of major importance. These are (1) the severity 
of the hardships imposed by society on the in- 
dividual and (2) the weakness of the ego and 
superego. If the prohibitions of society relative 
to its rewards are excessive, or if the person does 
not have the ability to find satisfactions within 
the framework of society (ego weakness), or 
if the social tradition has not been sufficiently 
incorporated due to a failure of identification 
(superego weakness) , then the tendency to rebel 
against society is increased (Freud, 1927a). 
Crime is the most common expression of this 
revolt. 


FAMILY STRUCTURE AND DYNAMICS 


Freud has been criticized repeatedly for over- 
emphasizing the role of biological determinants 
and underemphasizing the role of sociocultural 
ihfluences in shaping the course of personality 
development. It is true that Freud does not give 
much consideration to the ways in which spe- 
cific economic, political, religious, and social 
institutions may affect personality or how dif- 
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ferent types of social organization may produce 
different types of character, although in his 
book Civilization and Its Discontents (1930) he 
discusses at length how society imposes prohibi- 
tions upon the person and how the person may 
react to these prohibitions. One may, of course, 
apply these general principles in making an 
analysis of the influence of a particular institu- 
tion or a particular form of society upon the 
people living in that society, as Kardiner (1939, 
1945) has done. Freud himself has applied 
psychoanalytic concepts in the analysis of one 
important social institution, namely, religion. 
In The Future of an Illusion (1927a) and later 
in Moses and Monotheism (1939) he addresses 
himself to three major questions: (1) how re- 
ligion originated, (2) how it has evolved, and 
(3) what effects it has upon the person. Yet the 
general impression which these books leave up- 
on the reader is that in them Freud is writing 
from the standpoint of social criticism and not 
from the standpoint of social science. Freud, of 
course, was not a social scientist, and when he 
turned his attention to social phenomena it 
seemed to be with the intention of indicting 
society for its deleterious influences upon per- 
sonality. The view of society that one gets from 
neurotic patients is not likely to be a favorable 
one, and this fact probably made it difficult for 
Freud to maintain a dispassionate attitude to- 
ward society, Moreover "society" did not treat 
Freud well in the early years of psychoanalysis, 
nor later when the Nazis came to power. Freud's 
general pessimism regarding society and its in- 
stitutions has been remarked on more than once. 

There is yet another social institution to 
which Freud has devoted an enormous amount 
of attention. This is the family. Freud's analy- 
sis of the structure and dynamics of the family 
as a primary group constitutes one of his greatest 
scientific achievements, and his most notable 
contribution to social psychology. 

Let us first consider a three-member family, 
mother, father, and child. A relationship be- 
tween the mother and father exists before the 
child is born, and the nature of this relation- 
ship will have a bearing upon the child's place 
in the family structure. The child's first libidi- 
nal object choice is the mother, while the father 
is seen as an interloper and rival. This triangu- 
lar relationship involves the child in his first 
love and hate experiences. Since by the very 
nature of the case, he cannot maintain this pure 
dichotomy of feeling for the parents, modifica- 
tions in the social triad occur. The mother dis- 
appoints the child by being an imperfect lover. 
She scolds and rejects him. She makes him do 
things he does not want to do and forbids him 
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to do some of those things he would like to do. 
Consequently, hostile feelings are engendered 
which, existing side by side with love, produce 
ambivalence of feeling. Ambivalence appears 
to be the normal state of affairs in all close social 
relationships, the prototype of which is the 
mother-child relation. Likewise, hostility for 
the father does not continue to persist in an 
unchanged form. The boy makes an identifi- 
cation with the parent of the same sex, while the 
girl begins to transfer some of her love for the 
mother to the father. The reason for this trans- 
fer, in addition to possible biological ones, is 
that the mother is a source of greater disap- 
pointment to a daughter than to a son. The 
daughter, discovering that the mother does not 
have a penis, places the blame upon the mother 
for her own lack of a penis. She turns to the 
father, who possesses that which she wants, and 
at the same time her identification with the 
mother is strengthened. Later the girl's wish for 
a penis will become transformed into a wish 
for a baby (Freud, 1940). Thus, psychological 
processes play into the hands of biological func- 
tions. 

The boy's disappointments with the mother 
are not nearly so great, because he does have a 
penis; yet this very fact constitutes a danger. 
He may be deprived of it by his father as a 
punishment for loving his mother too ardently. 
Fear of castration brings the oedipus complex 
to an end in the boy, whereas the female coun- 
terpart of the castration complex, penis envy, 
creates the oedipus complex in a girl (Freud, 
1933) . The girl's relationships with members of 
either sex always remain more complicated than 
do those of the boy. 

Of course, the dynamics in any particular 
family may vary considerably from the fore- 
going model. Object choices and identifications, 
love, hostility, and ambivalence of feelings, the 
course of the oedipus complex and the intensity 
of the castration complex depend upon the per- 
sonalities of the mother, father, and child and 
the ways in which their personalities interact 
with one another. Freud also places great em- 
phasis upon the bisexual constitution of the 
child, and in one of his writings he wonders 
whether ambivalence toward the parents should 
not be attributed entirely to bisexuality (1 923a). 

The dynamics of a multiple-child family are 
similar to those of a single-child family. There 
is, however, one additional factor, namely, com- 
petition among the siblings for the exclusive 
possession of the loved parent and cooperation 
among them in expressing hostility against the 
resented parent. Since love and resentment are 
not divided between the two parents respec 
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tively, but instead there is always ambivalence 
of feeling toward both parents, the situation in 
regard to sibling competition and sibling co- 
operation is a complicated one. 

The dynamics of family organization is re- 
flected in the dynamics of all organized groups. 
For Freud, the family is the prototype for every 
association among people, whether the associa- 
tion be political, economic, social, recreational, 
or religious in character. The family is the 
bridge between individual psychology and social 


psychology. 


GROUP PSYCHOLOGY 


"The essence of a group and that which holds 
it together are libidinal ties among the mem- 
bers. These ties are derived from aim-inhibited 
sexual impulses and from identifications which 
develop when object choices have to be re- 
nounced. Identification, therefore, is the key 
concept in Freud's theory of group dynamics. 
A primary group, according to Freud, is a 
number of individuals who have taken the same 
person, i.e., the leader, for their ideal and who, 
by virtue of having a common ideal, identify 
themselves with one another (Freud, 1921). 
Thus, emotional bonds are established among 
the members of a group and between each mem- 
ber and the leader. The bond with the leader 
is of more importance to group stability than 
are the ties between members, since it is the 
former which confers significance upon the lat- 
ter. When the ties of the members with the 
leader are broken, the group dissolves unless 
another person acquires the status of an ideal 
figure. 

The effects of the group upon the individual 
are to intensify his emotions and inhibit his 
intellectual functioning. These effects result 
from the libidinal relationships within the 
group, so that it may be said that in a group a 
person is governed more by his id than by his 
ego. He yields some of his ego autonomy for 
the sake of instinctual gratification (Freud, 
1921). 

The archaic prototype of a primary group is 
a father and his sons. The father forces sexual 
abstinence upon his sons, and the libido, denied 
discharge, is used to produce identifications 
between the sons and the father and among the 
sons. Freud says, “His (the father’s) sexual jeal- 
ousy and intolerance became in the last resort 
the causes of group psychology” (1921, p. 94). 
Group formation is a revival of a primal horde, 
in which the primal father is the group ideal 
which governs the ego in place of the ego ideal. 
Freud observes that the direct expression of 


sexual impulses is unfavorable to the formation 
of groups, although he observes that it is a re- 
markable fact, for which he does not know the 
explanation, that homosexuality even when it 
is overtly expressed is more compatible with the 
maintenance of group ties than is heterosexual- 
ity (1921). 

Each individual is a member of numerous 
groups, so that his ties of identification take 
many directions and his ego ideal is a reflection 
of various leaders (Freud, 1921). (For a much 
fuller account of Freud's group psychology the 
reader is referred to Scheidlinger, 1952.) 


ORIGIN OF SOCIETY 


Freud's theory of the origin of society is set 
forth in detail in Totem and Taboo (1913) and 
is summarized in Moses and Monotheism (1939). 
In the dawn of human society, man lived in 
small hordes under the domination of a strong 
male. This strong male was the father of the 
whole horde. He had unlimited power which 
he used sadistically against his sons. Since all 
of the females of the horde were his property, 
the father could and did force the sons into 
sexual abstinence. If they excited his jealousy, 
they were killed or castrated or driven out of 
the tribe. Consequently, the sons were forced 
to steal their wives from other tribes, and this 
led to intertribal warfare, which further 
strengthened group ties within the horde. One 
of the sons, often the favored younger son, was 
chosen by the father to be his successor. 

The sons who were driven out of the tribe 
organized and overcame the father. After kill- 
ing the father, the sons ate him in order to 
produce identification with the father by in- 
corporating a part of him. 

Following this patricide, the sons began to 
quarrel with one another for power. Upon 
discovering that fraternal aggression was dan- 
gerous they formed the first social contract 
which was based upon a renunciation of in- 
stinctual gratification. To ensure group solidar- 
ity they established the taboo against incest and 
the law of exogamy. Thus the first form of 
society developed out of the family. 

Brotherly love was instituted to control ag- 
gression among members of the same society, 
and aggression was drained off to some extent 
by declaring war on alien societies. War serves 
the purpose of keeping the in-group solidified. 


. 
NATURE OF CULTURE 
Freud defines culture or civilization, terms 
used interchangeably in translating the German 
word Kultur, as "the sum of achievements and 
" 
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institutions which differentiate our lives from 
those of our animal forebears" (1930, p. 49). 
Culture serves two functions, it protects people 
against the dangers of the natural world, and it 
regulates the relations of human beings among 
themselves so that they do not destroy one an- 
other. Human existence in communities only 
becomes possible when a number of men unite 
together so that their strength is superior to 
that of any single individual. The substitution 
of the power of a united group of individuals 
for the exclusive domination of a single man 
is, in Freud's opinion, the decisive step toward 
civilization (Freud, 1930). 

However, since a group can become united 
only by renouncing the freedom of individual 
instinctual gratification, civilization is neces- 
sarily frustrating. One gives up pleasure for 
security. However, the masses have no love for 
instinctual renunciation, so they must be co- 
erced into working by an educated and intelli- 
gent minority, Freud wonders whether a change 
in society and universal education can convert 
the hostility of the masses toward culture into 
acceptance of it. Freud doubts whether they 
can, although he considers it a “noble project.” 
Freud's pessimism grows out of his belief that 
people in general are not capable of extensive 
sublimation of aggressive and sexual impulses 
(Freud, 1930) . 

Freud is scornful of the popular notion that 
freedom is one of the major benefits of civili- 
zation, for, as he observes, civilized society 
deprives man of freedom instead of conferring 
it upon him. Society consists largely of impos- 
ing prohibitions upon a man's freedom of 
action, The ideal of society is not liberty but 
equality of treatment for all members of the so- 
ciety — in a word, justice, An unjust society, 
one in which the privileged few are accorded 
special favors while the many are held to a 
different standard of conduct, is bound to be an 
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unstable society, since the masses will tend to 
rise up against the tyranny of injustice (Freud, 
1930) . 

There is a similarity between cultural de- 
velopment and libidinal development. In both, 
the instincts are modified by the emergence of 
the reality principle as the dominant force in the 
individual and the group, by the formation of a 
superego, which is represented in society by laws 
and ideals, and by the institution of various 
defenses on a group as well as on an individual 
basis. Aggression and not sexuality is the most 
powerful obstacle which stands in the way of 
cultural progress. The aim-inhibited traits of 
tenderness and friendliness work in the interest 
of cementing cultural ties among people, while 
the aggressive impulse can give rise to no such 
positive contributions to cultural progress. Ag- 
gression can be drained off in intercultural 
hostility, but this is hardly a satisfactory solu- 
tion, since the aim of warfare is always to 
destroy culture. Freud views the evolution of 
culture as a struggle between Eros and death 
(Freud, 1930). 

It is paradoxical that culture, which was in- 
stituted to reduce human misery, should itself 
produce so much suffering. People are likely to 
overlook the benefits which culture bestows and 
to build up a tremendous amount of resent- 
ment and hostility against the self-abnegations 
imposed by culture. This hostility against cul- 
ture expresses itself in envying primitive people 
who appear to be happier than civilized man 
(but who really are not, says Freud — they are 
probably worse off) and in a desire to regress 
to their level (Freud, 1930) . Resentment against 
civilization also manifests itself in the other- 
worldliness of religion, as well as in many other 
forms, notably crime and antisocial actions. 
Freud also recognizes the role that civilization 
plays in exacerbating mental disorders (Freud, 
1908b) . 


RECENT TRENDS IN PSYCHOANALYTIC THEORIZING 


EGO THEORY 


The prevailing interest among psychoanalytic 
theorists today is the elaboration and reformu- 
lation of Freud's theory of the ego, which he 
enunciated in Group Psychology and the Analy- 
sis of the Ego (1921), The Ego and the Id 
(1923a), and later writings. As Anna Freud 
observed, in reference to these publications of 
her father, "the odium of analytical unorthe- 
doxy no longer attached to the study of the 
ego and interest was definitely focused on the 
ego-institutions” (1936, p. 4). Theorists have 
made the most of this explicit sanction to study 


the ego, and the literature on this topic is an 
expanding one. The direction is away from 
studies of the defensive functions of the ego 
to studies of the synthetic (Nunberg, 1931), 
integrative (French, 1952), organizing (Hart 
mann, 1950) , mastering (Hendrick, 1943) , and 
multiple (Waelder, 1936) functions of the ego. 
Rapaport (1950, 1951a, 1951b, 1951c) has made 
a number of important contributions to €go 
theory, especially in the domain of thought 
processes. Hartmann, Kris, and Loewenstein 
(1947) have insisted that the ego should be de- 
fined in terms of its functions, and their empha- 
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sis has been more upon the healthy, adjustive, 
rational processes of the ego than upon its 
pathological formations. For these writers, how- 
ever, ego defenses are not necessarily pathologi- 
cal or negative in character but may serve a 
healthy purpose in the formation of personality. 
“What developed as an outcome of defense 
against an instinctual drive may grow into a 
more or less independent and more or less 
structured function. It may come to serve dif- 
ferent functions, like adjustment, organization, 
and so on” (Hartmann, 1950). These theorists 
have also revised Freud’s basic hypothesis that 
the ego develops out of the id. They believe that 
both the ego and the id develop out of an un- 
differentiated phase which consists of a “matrix 
of animal instinct” and that consequently the 
ego is a partially independent unit of the per- 
sonality, using neutralized libidinal and aggres- 
sive energy (Hartmann, 1952) . 

This trend of treating the ego as an autono- 
mous system whose origin is parallel with the 
id and which is endowed with autonomous 
functions and energy sources has not gone 
unchallenged by other psychoanalysts. Nacht 
(1952), for example, deplores this new "ego 
psychology,” which he considers sterile and 
regressive. He points out the dangers of 
"byzantinism" which threaten current psycho- 
analytic thinking and asserts that "Psychoanaly- 
sis has nothing to gain from these exegetic 
elaborations in which the letter always takes 
precedence of the spirit" (p. 50). 

The development of psychoanalytic ego psy- 
chology is seen by one writer (French, 1952) as 
offering an opportunity to effect a rapproche- 
ment between psychoanalysis and psychology. 
French himself has attempted to integrate the 
views of Tolman, Hull, Pavlov, and Lewin with 
those of psychoanalytic theorists. 

For social psychology Erikson's formulations 
regarding the ego are probably the most rele- 
vant. He characterizes his book, Childhood and 
Society (Erikson, 1950) , as being in a sense "a 
psychoanalytic book on the relations of the ego 
to society" (1950, p. 12). In this book he tries 
to show by case studies of people living in dif- 
ferent circumstances and in different cultures 
how the ego's development is inevitably bound 
up with the nature of social organization. He 
has also demonstrated how the ego's develop- 
ment is related to changing historical reality 
(Erikson, 1947). Erikson's principal concepts 
are those of ego identity and group identity. 
In its subjective meaning ego identity is an 
awareness of the fact that there is continuity in 
the ego's synthesizing methods and the feeling 
that these methods are effective in safeguarding 
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the sameness and continuity of one's meaning 
for others. Group identity is the group's basic 
ways of organizing experience for the members 
of the group. In general, Erikson has had more 
to say about society's effect upon the person 
than about the person's effect upon society. 


INSTINCT THEORY 


Virtually all theoretical psychoanalysts reject 
or ignore Freud's death instinct. Hartmann, 
Kris, and Loewenstein (1949), for example, as- 
sume an innate aggressive or destructive drive 
which is independent of a death instinct. For 
these writers, sex and aggression are the two 
basic instincts or “instinctual drives,” as they 
prefer to call them, since the term "drive" con- 
notes greater plasticity than the term “instinct” 
does. (See also Hartmann, 1948.) 

Simmel (1944), on the other hand, believes 
that aggressive and destructive behavior is a 
manifestation of self-preservation or, more spe- 
cifically, of eating. Libido is the common 
energy for both sex and aggression. 

More and more, instinct theory is losing 
popularity among contemporary theorists and 
is being handed over to the biologists, since it is 
felt that instinct is a biological concept. This 
trend is but a part of the larger movement to 
"debiologize" psychoanalysis by minimizing the 
role of heredity and maximizing the role of so- 
ciety in the formation of personality. Karen 
Horney (1939) represents an extreme propo- 
nent of this movement. 

Another feature of contemporary psychoana- 
lytic theorizing is the substitution of ontogenetic 
propositions for phylogenetic propositions. 
Hartmann and Kris (1945) attribute Freud's 
predilection for "acquired characters" to the 
biological Zeitgeist of the nineteenth century in 
which he was reared. They do not believe that 
there is sufficient empirical support for the 
Lamarkian explanation of evolution. For Hart- 
mann and Kris, fear of castration is “in the air” 
rather than in the genes. Curiously enough, it 
seems not to have occurred to psychoanalysts 
that it is possible to substitute a phylogenesis 
based upon mutation and natural selection for 
one based upon acquired characters. 


RESEARCH METHODS 


For validating their genetic hypotheses, psy- 
oanalytic investigators have consistently fa- 
vored the method of reconstruction. The past 
of the person, who is usually in the role of a 
patient, is reconstructed from his free associa- 
tions and dreams, yielding causal connections 
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between childhood events and adult formations. 
Within recent years another method, that of 
discovering the origins of personality by ob- 
serving infants and children, has become popu- 
lar. The results of many of these investigations 
have been appearing in the annual volumes of 
The Psychoanalytic Study of the Child. 
Psychoanalysts have also been urged to make 
more use of experimental methods (Kubie, 
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1952) and comparative studies with animals 
(Levy, 1952), but as yet these promptings have 
not borne much fruit. The experimental vali- 
dation of psychoanalytic propositions has been 
pursued primarily by nonpsychoanalysts (Hil- 
gard, 1952). Research of special relevance to 
psychoanalytic theory has been surveyed by 
Sears (1943, 1944) and more recently by Hil- 
gard (1952), Kubie (1952), and Blum (1953) . 


APPLICATION OF PSYCHOANALYTIC THEORY IN THE SOCIAL SCIENCES 


In the judgment of many thoughtful people, 
Freudian thought is one of the main currents 
in the intellectual stream of modern life. There 
is scarcely any aspect of the contemporary scene 
that has not felt the impact of Freud's thought. 
"The social sciences are no exception. 

It would be a tremendous and unrewarding 
task to search out and document every applica- 
tion of Freudian concepts in the social sciences, 
since so many of the applications have been 
made without noting or perhaps without know- 
ing their source. These hidden influences we 
shall not take into account. Instead we shall 
limit our analysis to those self-conscious at- 
tempts to apply Freudian theory in the social 
sciences. It is well to bear in mind that some 
of the contributions have been made by pro- 
fessional social scientists, while others have 
been made by professional psychoanalysts who 
have become interested in a particular social 
science, There are drawbacks on both sides. The 
social scientist is not likely to have intimate 
and personal knowledge of the clinical context 
from which psychoanalytic theory has sprung, 
and he is less likely to view psychoanalysis in 
the full scope of its development than is the 
practitioner of this discipline. The psycho- 
analyst works under the handicap of being an 
amateur in the field to which he is trying to 
apply his concepts, and amateurs are prone to 
make errors and also are likely to be viewed 
with suspicion by professionals. 


ANTHROPOLOGY 


In relative amount the impact of psychoanaly- 
sis upon cultural anthropology has probably 
been greater than upon any other social science 
discipline. Kluckhohn, in discussing the rela- 
tion between psychiatry and anthropology, 
suggests that: “. . . American anthropology, for 
good or for ill, has seemed to find only in psy- 
choanalysis the bases for a workable social 
psychology" (1944b, p. 590). He notes further 
that, with the exception of relatively recent 
contact with Horney,, Kardiner, and Fromm, 


almost all of the influence has stemmed directly 
from orthodox Freudian psychology. 

The first evidence of anthropological interest 
in psychoanalysis did not materially precede 
similar signs in sociology and psychology, but 
the relative importance of this interaction 
eventually became very different. Kluckhohn 
(1944b) suggests that interest in psychoanalytic 
theory on the part of an American anthropolo- 
gist (ten years after similar recognition on the 
part of the English anthropologists, Rivers, 
1926, and Seligman, 1932) was first evident in 
a note published by E. Parsons (1915) in a psy- 
choanalytic journal, It was not until 1920 in a 
paper by Boas that any important notice was 
taken of psychoanalysis, and even here Boas’ 
suggestion that there was merit for the anthro- | 
pologist was surrounded by much more caution - 
than enthusiasm. At about the same time, Kroe- 
ber (1920) reviewed Freud's Totem and Taboo 
(1913) in a highly critical vein, but nevertheless 
indicated there was much of value in the volume 
for the professional anthropologist. Some 
twenty years later Kroeber suggested that he 
still felt his criticisms were warranted but went 
on to say, in connection with the original re- 
view: 


... I found myself somewhat conscience: 
stricken when, perhaps a decade later, I listened 
to a student in Sapir's seminar in Chicago making 
his report on Totem and Taboo, who, like myself, 
first spread out its gossamer texture and then 
laboriously tore it to shreds. It is a procedure too — 
suggestive of breaking a butterfly on the wheel. ` 
An irridescent fantasy deserves a more delicate 
touch even in the act of demonstration of its un- — 
reality. (1939, p. 446) 


In spite of these initial signs of awakening in- 
terest in Freud there seems little question but 
that the early attitude of anthropologists toward 
psychoanalysis was one of firm hostility. This 
initial view was sufficiently marked so that both 
Sapir (1934) and Kluckhohn (1944b), follow- - 
ing him, suggested the possible influence 

personal factors in the professional anthropolo- 
gist, which had led him into his profession in 
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the effort to escape from the | and 
familiar world and which also served to produce 
considerable resistance to the highly personal 
and psychological theories of psychoanalysis. 
The determinants that Sapir points to are, of 
course, double-edged, the same factors that lead 
a person to seek depersonalized experience may 
cause him to become violently interested in in- 
timate and highly personal experience. This 
reversibility may account in part for the extent 
to which psychoanalysis later became a focus 
of interest for many social anthropologists. 

Among the first of the American anthropolo- 
gists to show the influence of Freud were Sapir, 
Malinowski, and Mead. All three of these fig- 
ures were highly skeptical of the general merit 
of psychoanalysis as a tool for the ethnologist. 
In spite of their unwillingness to embrace psy- 
choanalytic theory, however, they were heavily 
influenced by psychoanalytic assumptions and 
hypotheses in their field work and publications. 
Prior to the thirties there was little attempt on 
the part of the ethnologist to publish the de- 
tails of individual life histories or the develop- 
mental process — material of crucial significance 
to the social psychologist and psychoanalyst 
(Kluckhohn, 1944b, 1945). There seems little 
doubt that the changes in anthropological 
modes of field work and publication practices 
were closely related to an increasing interest 
in psychological problems, in part stimulated by 
Freud and his theory. The increased interest in 
this kind of data that was manifested by both 
psychoanalysts and anthropologists during the 
thirties led directly to the formation of a small 
but vocal group of “personality and culture" 
students. Individuals interested in this border- 
line area remain a potent force within contem- 
porary anthropology, and the effectiveness of 
their efforts is demonstrated by many of the 
studies discussed in Chapters 25 and 26. 

We might roughly characterize the influence 
of psychoanalysis upon anthropology in the fol- 
lowing way. First, we find anthropologists work- 
ing vigorously to demonstrate the fallibility of 
formulations involving universal psychological 
processes. Thus, Malinowski (1927) consistently 
and Mead (1928, 1932, 1935) , at least early in 
her career, appeared motivated to provide data 
in order to explode the myth of psychoanalysis. 
Second, there were anthropologists who showed 
an interest in psychoanalysis as a means of 
accounting for puzzling aspects of the data they 
observed and who also tended to shift their 
field-work tactics somewhat to produce data that 
were more pertinent to psychoanalytic proposi- 
tions (Kluckhohn, 1942, 1944a, 1947, 1951; 
Opler, 1935, 1936, 1938; Beaglehole, 1937, 1938). 
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Third, we have a number of examples of the 
fruitful collaboration of anthropologists and 
psychoanalysts in investigation, or at least in the 
analysis of data. Easily the outstanding illus- 
tration of this trend is the work of Linton and 
Kardiner (Linton, 1945; Kardiner, 1939, 1945), 
although there are numerous less well-known 
examples, many of them discussed by Kluck- 
hohn (194b). The Columbia University Re- 
search in Contemporary Culture, described by 
Mead (1951), is perhaps the most ambitious 
presentday attempt at the interdisciplinary 
application of psychoanalytic theory to cross- 
cultural data. Fourth, and overlapping with 
the previous category, are the anthropologists 
who have made serious attempts to equip them- 
selves to collect and interpret data that are 
especially relevant to psychoanalytic theory (Du- 
Bois, 1944; Henry, 1940, 1941; Henry and 
Henry, 1953; Hallowell, 1936, 1941, 1942, 1951) . 
Fifth, there are the small group of psycho- 
analysts (Roheim, 1934, 1943, 1947, 1950a, 
1950b, 1951; Erikson, 1939, 1950; see also Wilbur 
and Muensterberger (Eds.), 1951) who have 
actually conducted field-work investigations of 
their own and interpreted their findings within 
a psychoanalytic framework. 


SOCIOLOGY 


The impact of psychoanalysis upon sociology 
has been considerably less than upon anthropol- 
ogy. Devereux (1952) suggests that this state 
of affairs has arisen in spite of the greater rele- 
vance of psychoanalysis for the problems of the 
sociologist than for those of the cultural anthro- 
pologist. This is a somewhat moot point, as 
the conceptual differences between sociology 
and anthropology are difficult to specify and a 
matter for argument even among experts. How- 
ever, it seems safe to conclude that psycho- 
analysis is as relevant for the problems of the 
sociologist as for those of the anthropologist. 
If this is the case, why is it that the two fields 
have been influenced to such different degrees? 
In a general discussion of the influence of 
psychoanalysis upon sociology, Burgess (1939) 
has considered some of the factors that may have 
dampened the enthusiasm of sociologists for 
psychoanalysis. Among other factors, he sug- 
gests the following as responsible for the so- 
ciologist’s rejection of psychoanalysis: objection 
to representing all behavior as motivated by 
sexual needs, a preference for accounting for 
behavior in terms of cultural determinants, the 
failure of psychoanalysis to integrate with pre- 
vious instinct theories and in addition a general 
trend away from instinct concepts on the part 
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of sociologists, the appearance of deviant psycho- 
analytic groups, and the preoccupation of 
sociologists with their own theories and prob- 
lems. While some of these factors undoubtedly 
played a role in dulling the interest of the 
sociologist in psychoanalysis, few of them seem 
equal to accounting for the difference in the 
reaction of the sociologist and the anthropolo- 
gist, for most are equally applicable to both 
groups, Only in the case of the move away from 
instinct theories does there appear to be an 
important difference between the two dis- 
ciplines. And even here this is apparent pri- 
marily because the leaders in the anti-instinct 
movement were sociologists (Bernard, 1921; 
Faris, 1921). 

If Burgess’ formulation does not differentiate 
between the anthropologist and the sociologist, 
what does? First of all it is clear that historical 
accident must have played some role here, as in 
most complex phenomena. The fact that Sapir, 
Malinowski, and Mead were influenced by psy- 
choanalysis is not alone tied to the fact that 
they were anthropologists. There must also 
have been some accidental factors linked to the 
particular experiences they were exposed to and 
the kinds of people they were. More important, 
the anthropologist, by virtue of his field-work 
experiences, was in much more intimate contact 
with the details of human existence than the 
sociologist. This means that he was exposed to 
data relevant to psychoanalytic formulation and 
also that he was more inclined toward accep- 
tance of the methods of the psychoanalyst, as 
they resembled his own. "Third, by the second 
decade of this century there was a vigorous posi- 
tivistic movement in sociology, with its accom- 
panying stress upon quantitative data and 
careful empirical controls. Certainly, for the 
demographer the subjective and poorly specified 
data of the analyst must have appeared a long 
step backward in the attempt to develop a sound 
discipline concerned with social behavior. 
Among the anthropologists there was no equiva- 
lent affection for quantification and "rigor." 
Fourth, American anthropology under the in- 
fluence of Franz Boas was much more theory-less 
than American sociology and there was conse- 
quently less likelihood that psychoanalysis 
would be rejected because of conflicting or com- 
peting theoretical convictions. Fifth, sociology 
in the early decades of this century is generally 
considered to have been tinged with the goals 
and values of the Protestant ministry. From, 
such a vantage point Freud's pessimistic and 
sensate conception of man was certain to be 
greeted with hostility. Sixth, psychoanalytic for- 
mulations were directly concerned with many 
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classical problems of anthropology (ritual, in- 
cest taboo, kinship patterns, marriage practices), 
while these same topics were of only secondary 
interest to the sociologist. And finally, it is 
evident that Freud's early excursions into areas 
specifically labeled anthropological, and sub- 
sequent applications by Jones (1923, 1924, 
1925) and Reik (1928), made it more difficult 
for the anthropologist to ignore psychoanalysis 
than the sociologist. 

Granted its slowness to respond to psycho- 
analytic theory, what is the current state of 
sociology in this regard? What are the traces 
of psychoanalytic influence? The impact has 
been most continuously evident in the area 
of the family where, understandably, psycho- 
analytic formulations have had a consid- 
erable vogue. From the mid-twenties, writers 
in this area have dealt more and more heavily 
with conceptions tinged with Freud's view of the 
family and the process of socialization, The 
operation of this influence during the thirties is 
well illustrated by the volumes of Folsom (1934) 
and Waller (1938). More recently, the influ- 
ence is clearly shown in the text written by 
Burges and Locke (1945) and culminated in 
the volume by Winch (1952). In the latter we 
find both formulations and actual investigations 
carried out by the author heavily indebted to 
psychoanalysis. Perhaps more impressive is the 
extent to which psychoanalytic formulations 
appear to have influenced sociological theory. 
Easily the best example in this area is the recent 
work of Talcott Parsons (1947, 1950, 1951, 
1952a, 1952b, 1953) who, in his persistent at- 
tempts to develop a general theory of social 
action, has leaned heavily upon psychoanalytic 
concepts in the effort to represent the motiva- 
tional process, The collaboration of Parsons, 
Shils, Tolman et al in the attempt to develop 
an interdisciplinary theory of behavior (Parsons 
and Shils, 1951) drew heavily from psychoana- 
lytic sources. Interesting papers by Green 
(1945, 1946) , Davis (1936), Shils and Janowitz 
(1948), Devereux (1939a, 1939b, 1942), and 
Bales (1945, 1946), as well as papers in the 
volume honoring Roheim (Wilbur and Muen- 
sterberger, 1951), provide further evidence of 
growing interest on the part of sociologists in 
conceptions influenced by psychoanalysis. A 
final important example of the influence of 
Freud upon sociological thought is found in the 
works of David Reisman (1950a, 1950b, 1950c) . 


SOCIAL PSYCHOLOGY 


In 1932 Fay Karpf (1932) surveyed the his- 
tory and current status of social psychology in 
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the United States, and the European back- 
ground from which it developed. Her book does 
not indicate in any way whatsoever that Freud 
has had the slightest effect upon the develop- 
ment of social-psychological thinking. Nearly 
twenty years later, Krech, in his survey of psy- 
chological theory and social psychology (1951) , 
writes 


. it is obvious that much of the current vi- 
tality of social psychological theory is due to the 
influence of psychoanalysis and of those social 
psychologists trained in, and familiar with, the 
concepts of psychoanalysis. (p. 680) 


Certainly Krech's point of view is fashionable 
among contemporary social psychologists. Evi- 
dence of this is provided by the Lewin Memorial 
Address of 1948 in which both speakers (All- 
port and Tolman) alluded to Freud and Lewin 
as the most important sources of contemporary 
influence in this area of psychology. If it is true 
that some time during the past two decades psy- 
choanalysis made its impact upon social psy- 
chology, we should expect to find evidence of it 
in textbooks and reported investigations within 
social psychology. 

A sampling of textbooks written during the 
thirties reveals that psychoanalytic theory, with 
one exception, finds little application or sys- 
tematic attention relative to the problems of 
social psychology. Freud is usually mentioned, 
and sometimes aspects of his general theory, e.g., 
wit, sublimation, taboo, conflict, are presented, 
but often his theories are misrepresented, and 
always the integrity of psychoanalytic theory is 
damaged by referring to concepts out of con- 
text, The one exception to the rule is J. F. 
Brown's Psychology and the Social Order (1936). 
It is obvious that Brown is well grounded in 
psychoanalytic theory as well as in field theory, 
and his serious effort to synthesize these two 
major viewpoints is without parallel in social 
psychology. He is one of the few social psy- 
chologists who has paid much attention to 
Freud's social psychology as set forth in Group 
Psychology and the Analysis of the Ego (1921), 
while his chapter on the structure and genesis 
of personality is, for its day, a succinct and fairly 
accurate resumé of psychoanalytic theory. Like 
most social psychologists, Brown believes that 
Freud overemphasized the role of biological 
determinants in personality formation and so- 
cial interactions, and he is critical of Freud's 
methodology. These latter criticisms are pre- 
sented in more detail in Brown's article on 
Freud and the Scientific Method (1934) . During 
the thirties, then, it may be said that only one 
out of many social psychology textbook writers 
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reacted to Freud in any significant way. Nor 
does a perusal of the first handbook of social 
psychology (Murchison, 1935) change the pic- 
ture, With the exception of two chapters, Wil- 
loughbys on magic and Wells on adaptive 
regression, Freud and Freudian concepts are 
rarely mentioned. 

The social psychology textbooks of the forties 
do not change the picture. Freud is still men- 
tioned, and some of his concepts and special 
theories are presented briefly, but there is little 
evidence that Freud has made much of a con- 
tribution to current social psychological 
thought. In view of Krech's opinion quoted 
above, one might expect to find much psycho- 
analytic theory incorporated in Krech and 
Crutchfield's influential book, Theories and 
Problems of Social Psychology (1948) , but such 
is not the case. There is some talk about un- 
conscious motivation and other psychoanalytic 
concepts, but there is no mention of Freud's 
group psychology. The vitality of this book, as 
so many other recent volumes in social psychol- 
ogy, stems from the theories of gestalt psy- 
chology, Lewin, and Tolman, not from 
psychoanalysis. 

_ Three recent textbooks have been surveyed. 
Queener’s Introduction to Social Psychology 
(1951) observes that there is a great deal of 
irrational behavior in social life, the under- 
standing of which is provided for only by psy- 
choanalytic theory. Yet there is little evidence 
in this book that psychoanalytic concepts have 
been used for understanding anything. Asch's 
Social Psychology (1952) is written entirely 
from a gestalt viewpoint and is critical of 
Freud's negative and pessimistic attitudes to- 
ward man and society. Asch asserts that Freud 
ignored the productive forces of man, that he 
overemphasizes social friction, and that he gives 
a onesided picture of the oppressiveness of 
society on the individual. Nevertheless, Asch 
is of the opinion that the guiding ideas of social 
psychology are derived from behaviorism and 
psychoanalysis. Newcomb's Social Psychology 
(1950) contains more references to Freud than 
to any other single person, and his presentation 
of psychoanalytic theory is more complete than 
that of any other textbook in social psychology 
since J. F. Brown's, although oddly enough he 
does not refer to Freud's Group Psychology and 
the Analysis of the Ego. Yet when one examines 
the substance of what Newcomb has to say about 

*psychoanalysts two features stand out. The re- 
visionists, Fromm, Horney, and Sullivan, con- 
tribute much more to Newcomb's theory of 
social interaction than does Freud, a fact that 
Newcomb frankly acknowledges. The second 
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feature is that Freud has had much less influ- 
ence upon Newcomb than Newcomb has had 
upon Freud. What Newcomb does repeatedly is 
to translate psychoanalytic theory into New- 
comb's version of role theory. As Newcomb 
says, "Many of Freud's contributions — like 
those of other really great men — stand up 
under the necessity of being translated into the 
language of a later day" (p. 376). 

"This survey of textbooks of social psychology 
fails to support Krech's opinion regarding the 
vital influence of psychoanalysis upon social 
psychology. Revisionists like Fromm and Hor- 
ney, who have placed the main accent on cul- 
tural] determinants of behavior, have had 
considerably more influence than Freud prob- 
ably because their views are more in accord with 
the dominant stream of social psychological 
thought in this country. Perhaps one should not 
say that they have had an influence, since they 
have chosen to swim with the current rather 
than to make new ones. They have been more 
influenced than influential. 

When we turn from a survey of textbooks to 
a survey of current research trends in social 
psychology the situation is somewhat different. 
Much of the research in social psychology is, of 
course, largely empirical and atheoretical in 
character. That which is theory oriented de- 
rives most of its testable hypotheses from gestalt 
psychology. Lewin, role theory, perception-cog- 
nition theory, and neobehaviorism. This im- 
pression is fortified by examination of the recent 
edition of Readings in Social Psychology (Swan- 
son, Newcomb, and Hartley, 1952). This 
volume intends to present a representative pic- 
ture of the research in social psychology, and 
a large and diverse committee aided in the 
selection of the studies to be included. Freud's 
name is mentioned only four times in the index 
and in two of these cases the reference is to a 
critical statement. More significant is the inci- 
dence of studies that can be seen as growing out 
of psychoanalytic theory. Even using very gen- 
erous standards not more than a half dozen 
papers seem in any direct way related to psycho- 
analytic theory. This is in striking contrast to 
Lewin's field theory. 

Psychoanalytic theory, however, has made 
some significant contributions to social psycho- 
logical research. One example of this is found 
in the area of prejudice. The authors of the 
widely known volume, The Authoritarian Per- 
sonality (Adorno, et al. 1950) acknowledge: 
their debt to Freudian theory in formulating 
their research hypotheses. Other writers (Acker- 
man and Jahoda, 1950; Loewenstein, 1951; 
Lowenthal and Guterman, 1949; Simmel (Ed.) 
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1946) have used the insights of psychoanalysis 
to illuminate the roots of prejudice. The well- 
known scapegoat theory of prejudice represents 
a relatively straightforward application of psy- 
choanalytic theory (Freud, 1939) and has led to 
several investigations (Miller and Bugelski, 
1948; Lindzey, 1950) . Psychoanalytic theory has 
also been applied in the area of delinquency 
(Aichorn, 1935; Eissler (Ed.) 1949; Friedlander, 
1947). Research in social psychology being con- 
ducted at the Tavistock Institute of Human 
Relations in London is heavily influenced by 
psychoanalytic theory. Especially noteworthy 
are the studies of W. R. Bion on experiences in 
a group (Bion, 1948a, 1948b, 1949a, 1949b, 
1950a, 1950b, 1951). Waelder’s (1939) discus- 
sion of the conditions leading to war is a notable 
application of orthodox psychoanalytic theory 
to a complex social phenomenon. Numerous 
studies concerned with the socialization process 
and child development reflect psychoanalytic 
thinking, as the literature summarized in Chap- 
ter 18 clearly indicates. Bowlby’s (1951) survey 
of studies of the effect of maternal care upon 
mental health provides evidence of the vitality 
of psychoanalytically influenced studies in this 
area. The recent increase in studies concerned 
with social-psychological problems in a cross- 
cultural frame of reference also shows some 
attention to Freudian formulations (McClel- 
land and Friedman, 1952; Chapters 14 and 18, 
this work). The avalanche of investigation that 
followed publication of Frustration and Ag- 
gression (Dollard, Doob, et al, 1939) can be 
considered heavily indebted to Freud's ideas, 
as the authors of this monograph indicate. The 
impact of psychoanalysis has been felt to some 
extent in the study of propaganda or mass 
media, as studies by Kris and Speier (1944) and 
Kecskemeti and Leites (1947, 1948a, 1948b, 
1948c) illustrate. In the investigation of small 
group behavior the work of Bales (1950) and 
Miller and Stine (1951) represent attempts to 
derive insights from psychoanalysis, Finally, 
psychoanalysis has had a great deal of indirect 
influence upon research in social psychology 
through the medium of Lewin and his students. 
A number of Freudian concepts, prominent 
among them displacement and conflict, have 
been adapted and incorporated in Lewin's own 
conceptions and in this form have had heavy 
impact upon contemporary research and con- 
ceptions of social psychologists. 

Are there events that have served to lessen 
the interplay between psychoanalysis and social 
psychology? Certainly one important factor is 
that psychoanalysis is regarded as an “instinct” 
psychology, and social psychologists have been 
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opposed to instincts since the twenties. Another 
clearly significant factor is that Freud's pessi- 
mism regarding society is incompatible with 
the environmentalism and meliorism among 
social psychologists, and their conviction that 
man can find happiness in society as soon as 
certain attainable changes are made in society. 
A third factor is that psychoanalysis has been 
closely associated with psychopathology and for 
the most part psychoanalytic theory has not 
been presented in such a way that the social 
psychologist could easily see the relevance of 
it for his field. Probably a fourth consideration 
is that psychoanalytic theory has not been ade- 
quately taught in American universities. One 
cannot apply something of which one is ig- 
norant. The inadequacy of instruction is asso- 
ciated with the open hostility of many 
psychologists toward psychoanalysis, as well as 
the strong in-group tendencies on the part of 
most psychoanalytic groups. Marjorie Brierley 
(1951), an English psychoanalyst, believes that 
the potential contribution of psychoanalysis to 
social psychology is very great but it is not being 
realized because “disabling tendencies to fission 
and isolation" (p.153) exist in psychoanalysis. 


OTHER SOCIAL SCIENCES 


Aside from occasional instances, psychoana- 
lytic theory has not penetrated extensively into 
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the domains of history, economics, political 
science, and law, although it has had consider- 
able influence upon the humanities. The early 
and important writing of Harold Lasswell 
(1927, 1930, 1935, 1948) and the book of read- 
ings edited by Stanton and Perry (1951) show 
the influence of psychoanalysis on political sci- 
ence. In law, the volume by Jerome Frank 
(1930) depends almost exclusively on Freudian 
theory. In economics there is practically noth- 
ing except Schneider's (1948) effort to compare 
Freudian psychology with Veblen's social theory, 
while in history the influence of Freud is diffi- 
cult to evaluate, although biographers have 
been using psychoanalytic concepts in recent 
years to explain the conduct of their subjects. 
De Saussure, a psychoanalyst, is amazed "that 
psychoanalysis has not been systematically ap- 
plied by historians, because in itself the method 
is historical" (1950, p. 7). Psychoanalysts have 
shown no reluctance in applying their concepts 
and methods in the analysis of historical figures 
and movements. Freud did just this in Leo- 
nardo da Vinci (19102) and in Moses and 
Monotheism (1939). Loewenfeld has remarked 
that "the role of the great man in history is a 
special field for analytic investigation” (1950, 
p. 289), and Fromm (1930, 1941) has applied 
psychoanalytic concepts freely in historical 
analysis. 


SUMMARY 


We have attempted to present a detailed sum- 
mary of Freud’s theory of personality and its 
application to social phenomena. In addition, 
we have tried to summarize briefly some of the 
major features of the influence psychoanalysis 
has exerted upon the social sciences. Our em- 
phasis upon the structure of Freud's theory was 
determined largely by the fact that this theory, 
especially as it pertains to the psychology of 
personality and to social psychology, has not 
always been clearly understood by psychologists. 
There are many reasons for this confusion, 
among them the sheer quantity of Freud's writ- 
ings and the much greater popularity of Freud's 
theories of abnormal behavior. The complexity 
of this theory and limitations of space have 


made it impossible for us to deal with many 
issues of great importance in a full considera- 
tion of this topic. The contributions of Adler, 
Jung, and the neo-Freudians have had to be 
overlooked in spite of their favor among many 
social psychologists. Further, the many difficul- 
ties involved in the use of psychoanalytic theory 
as a guide to thought and research in this area, 
as well as the structural shortcomings of the 
theory itself, have not been mentioned. Our 
sole intention, then, has been to present to the 
reader the core of psychoanalytic theory and to 
convey some of our conviction that it is a poten- 
tial tool of great power in the hands of in- 
terested psychologists. 
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In this chapter, we shall attempt to describe 
in some detail the theoretical and research con- 
tributions of Kurt Lewin and his associates. 
Since the phrase "field theory" has become 
closely identified with Lewin's work in social 
psychology, we shall in a brief introductory 
section indicate some of the characteristics of 
field theory in physics as a means of pointing 
out the rather tenuous connections between 


“field theory" in physics and in social psychol- 
ogy. In a second section, we shall characterize 
the main features of Lewin's “way of thinking" 
about theory in psychology. The third section 
will deal with the basic concepts of field theory 
as developed in individual psychology. In a 
final section we shall discuss "action research" 
and “group dynamics." 


INTRODUCTION 


During the latter part of the nineteenth cen- 
tury new and bold ideas were introduced into 
physics which profoundly altered scientific con- 
ceptions of physical reality. The impact of these 
ideas was not limited to physics. They crumbled 
the foundations of philosophical viewpoints and 
served as a model for new conceptions in the 
biological and social sciences. "These ideas — the 
work of such men as Faraday, Maxwell, and 
Hertz — are commonly referred to by the term 
"field theory." 

Clerk Maxwell in the Preface to his post- 
humous book, Matter and Motion, characterized 
the change which field theory brought about in 
physical conceptions in these words: 

Physical science, which up to the end of the 
eighteenth century had been fully occupied in 
forming a conception of natural phenomena as 
the result of forces acting between one body and 
another has now fairly entered in the next stage 
of progress — that in which the energy of a ma- 
terial system is conceived as determined by the 
configuration, motion of that system, and in which 
the ideas of configuration, motion, and force are 
generalized to the utmost extent warranted by 
their physical definitions. 

To put it in other words, Newtonian mechan- 
ics had asserted that it was possible to explain 
physical phenomena by assuming that simple 
forces were acting between unalterable particles. 
Hence, if one knew the mass, the velocity, the 
direction, the location, etc., of material particles 
one could accurately predict what would occur 
when a number of them interacted. Every 
change of motion, according to Newton, was due 
to a force which arises either through impact — 


as in the case of a baseball being struck by a bat 
— or by an attraction exercised mutually by the 
bodies upon each other, according to his law of 
gravitation. The gravitational force between 
bodies (e.g., between the sun and the earth) 
was assumed to act instantaneously — that is, 
without time — producing action at a distance. 
Space and time were treated as the absolute, 
fixed framework within which the movement of 
a particle would occur — that is, they were re- 
moved from the process itself. 

A series of brilliant experiments by Oersted 
and Faraday on electromagnetic phenomena 
challenged the notion of simple forces acting 
instantaneously at a distance on particles in an 
empty space. These experiments demonstrated 
that it was useful to think of regions or fields 
through which electromagnetic forces were 
spread and to assume that these forces existed 
even when there were no material particles pres- 
ent to which the forces were applied. It was 
found that a knowledge of the properties of the 
electromagnetic field was sufficient to explain 
electromagnetic phenomena and that, given this 
information, knowledge about the source of the 
electromagnetic field was unnecessary. In other 
words, the electromagnetic field had a definite 
reality which was not in any way the end result 


* 1 wish to express my indebtedness to a number 
ofecolleagues for their critical reading of a first draft: 
Drs. Kurt Back, David Emery, Harold Gerard, John 
Harding, Philip Lichtenberg, and Marie Jahoda. In 
addition, this chapter is indebted to an unpublished 
manuscript which was written with the collaboration 
of Dr. Emery. e 
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of distinct, individual particles or sources. As 
Einstein and Infeld (1938, p. 259) have pointed 
out: “It needed great scientific imagination to 
realize that it was not the charges nor the par- 
ticles but the field in the space between the 
charges and particles which is essential for the 
description of physical phenomena.” 

Clerk Maxwell systematized and gave mathe- 
matical form to these experimental findings in 
his field equations. These equations, unlike the 
equations of Newtonian physics, are structural 
equations which rule out action at a distance. 
Einstein and Infeld (pp. 152-153) have de- 
scribed the difference between mechanical laws 
and field laws in these terms: 


All space is the scene of these laws and not, as 
for mechanical laws, only points in which matter 
or charges are present . . . Maxwell's equations 
enable us to follow the history of the field, just 
as the mechanical equations enabled us to follow 
the history of material particles . . . However, 
they do not, as in Newton’s laws, connect two 
widely separated events; they do not connect the 
happenings here with the conditions there. The 
field here and now depends on the field in the 
immediate neighborhood at a time just past .. . 
They allow us to increase our knowledge of the 
field by small steps. In Newton’s theory, on the 
contrary, only big steps connecting distant events 
are permissible. 


Einstein's theory of gravitation, formulated at 
the beginning of the twentieth century, routed 
the concept of action at a distance from its last 
remaining stronghold in the physical sciences. 
The notion of a space filled with material ether 
vanished from physics and was replaced by a 
conception of space as a definitely distributed 
system of gravitational and electromagnetic 
forces and stresses. In the field conception the 
distribution of forces in a given environment 
determines what an object with certain proper- 
ties will do in that environment. Thus, when 
we know the object and observe what it does in 
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a specified environment we can deduce the 
properties of the field in that environment. 

The field concept in physics has served as a 
stimulating analogy for scientific workers in 
other areas. Essentially, “field theorists" in the 
nonphysical sciences have attempted to consider 
the phenomena they investigate as occurring in 
a “field” — i.e., as part of a totality of coexisting 
facts which are conceived of as mutually inter- 
dependent. In other words, "field theory" as 
it is employed in the social and biological sci- 
ences does not refer to theories about physical 
(e.g, gravitational or electromagnetic) phe- 
nomena; rather it refers to a “method of analyz- 
ing causal relations and of building scientific 
constructs" (Lewin, 1951b, p. 45, or 1943a). 
This method, in a manner analogous to that of 
field theory in physics, assumes that the proper- 
ties of any event are determined by its relations 
to the system of events of which it is a compo- 
nent and that "changes here and now depend 
on changes in the immediate neighborhood at 
a time just past." 

'The term "field theory" in psychology has 
been applied primarily to the work of the 
Gestalt psychologists, and it has been employed 
to characterize especially the work of Kurt 
Lewin and his students. However, in view of 
our previous discussion of the meaning of ''field 
theory," it should be evident that there are 
many possible approaches to theorizing in the 
social sciences which could be “field theoretical" 
in spirit. In the domain of social psychology, 
the theoretical writings of Asch, Krech and 
Crutchfield, Festinger, Sullivan, Gardner Mur- 
phy, Newcomb, Coutu, Brown, G. H. Mead, 
Parsons, Cottrell, Barnard, Homans, Simon, 
Heider, Barker, Tolman, Wright, the staffs of 
the Research Center for Group Dynamics and of 
The Tavistock Institute, and many other 
equally prominent theorists are more or less 
"field-theoretical" in their approach. 


THE METATHEORY 


Kurt Lewin and his students have made many 
important substantive contributions to the field 
of social psychology. It may well be, as some 
critics have asserted, that Lewin's brilliant ex- 
perimental studies are not necessary implica- 
tions of his theoretical statements or of the field 
theoretical approach. Nevertheless, anyone who 
has had the opportunity to watch Lewin at work 
or who has attended the annual meetings of the 
“Topological Psychologists” (a group of people 
sympathetic to Lewin's approach) has very 
likely sensed a characteristic "way of thinking." 
It can be sensed in the method of formulating 


questions for psychological research, in the em- 
phases and neglects, in the manner in which 
ideas are linked together, and in the insights 
achieved. 

We shall try to describe some of the main 
features of the way of thinking which charac- 
terizes the "field-theoretical" approach to psy- 
chology. Many of the features of this way of 
thinking are not unique to Lewin's approach. 
The wide impact that Lewin and Gestalt psy- 
chologists have already had on theorizing in 
psychology as well as an increasing interest in 
the nature of theory construction among all 
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schools of psychology have led to a considerable 
overlap in their basic viewpoints. 

A good proportion of Lewin’s writings are de- 
voted to the metatheory of psychology. We shall 
not attempt to quote his writings in any detail 
but rather shall attempt to present the essen- 
tials of the way of thinking which characterized 
his approach. The reader, however, may find it 
useful to consult his relevant writings in Field 
Theory in Social Science, A Dynamic Theory of 
Personality, Principles of Topological Psychol- 
ogy, and The Conceptual Representation and 
the Measurement of Psychological Forces. 

We shall discuss the characteristics of Lewin’s 
approach to theory building under the follow- 
ing headlines: A. The Psychological Approach, 
B. An Emphasis on the Total Situation, C. Sys- 
tematic Rather than Historical Causation, D. 
The Constructive Rather than Classificatory 
Approach, E. The Dynamic Approach, F. 
Mathematics and Field Theory. 


THE PSYCHOLOGICAL APPROACH 


Despite Lewin's use of terms which have the 
flavor of physics — e.g., tension, force, fluidity 
— his theoretical approach is consistently psy- 
chological. His use of concepts like “tension” 
and “force” is not an attempt to derive psycho- 
logical from physical concepts but rather reflects 
his view that what he termed the “logic of 
dynamics” might be similar in the various sci- 
ences. He takes the stand that psychological 
phenomena must be explained in psychological 
terms just as physical phenomena must be ex- 
plained in physical terms. 

Let us briefly attempt to characterize what is 
meant by an explanation of psychological phe- 
nomena in psychological terms. [Our discussion 
is much indebted to Heider (1927), Wright and 
Barker (1950), and Tolman (1932) and relates 
to the molar-molecular distinction which Litt- 
man and Rosen (1950) have thoroughly re- 
viewed.] First of all, this approach assumes that 
it is scientifically respectable to describe the 
phenomena in which psychologists are interested 
in terms that are not reducible or equivalent to 
the terms of physics or physiology. Secondly, it 
postulates that psychology is concerned with 
the individual's behavioral transactions with his 
internal and external environment. Behavior is 
a transaction of an individual, not merely of 
some part of the body. As Wright and Barker 
(1950, p. 79) point out, the individual: 


... does not sweat or salivate, nor does he 
often bend his knees in walking, manipulate his 
tongue in talking, move his eyeballs in reading, 
or bend at the waist in sitting down. He walks, 
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talks, reads, or sits down, leaving his glandular 
and motor apparatus to take care of the sweating, 
salivating, bending, manipulating and all such 
molecular units of behavior which, as molecular, 
are lost to the person in what he actually does. 


The behavioral transactions of an individual 
with his environment are mediated by physical 
and physiological processes; in other words, 
these processes are the mechanisms or tools for 
behavior, not behavior itself. It would make no 
more sense to describe the latter as behavior 
than to describe the movement of a steam shovel 
as behavior when a construction worker is em- 
ploying it to dig a foundation for a building. 
This is not to assert that mediating tool proces- 
ses may not at one time or another be the object 
of behavior — this is particularly likely to be 
the case when the individual is learning how 
to use the given tool; e.g., as the baby learns to 
walk, his behavior is directed at movements of 
his legs. 

The psychological explanation of behavior 
assumes that all behaviors have directional char- 
acteristics. Hence, it is concerned with the 
purposes which underlie behavior and the goals 
toward or away from which behavior is directed. 
There is little direct interest in tools or mediat- 
inig processes per se apart from the interest in 
how they are learned or acquired and in how 
desirable and effective they are considered to be 
by the individual. 

An emphasis on the psychological explanation 
of psychological events in contrast to an ex- 
planation in terms of physical or neurophysio- 
logical processes has several important conse- 
quences. The scientist who works with the 
assumption that psychological phenomena are 
realities, even if they cannot be expressed in 
physical terms or located in physical space, is 
more apt to investigate them. Recognizing that 
people in everyday affairs use such concepts as 
"hope," “desire,” "action," “ability,” “ought,” 
etc., to characterize others as well as themselves 
(and acknowledging that these phenomena have 
reality in their own right), he is more likely to 
feel that the concepts of this everyday psychol- 
ogy are legitimate and important objects for 
investigation by a scientific psychology (Heider, 
1953). That is, he will want to know under 
what conditions a person with given charac- 
teristics will say he is “hopeful” and under what 
conditions this person will say that another 
person is “hopeful.” In other words, he is more 
apt to concern himself with the psychological 
phenomena which occur in everyday life, in- 
cluding the concepts of everyday psychology, 
than is the scientist who feels that he is not being 
scientific unless he can either say “hope” is really 
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a specified pattern of neural excitation in a 
certain region of the brain or that “hope” is 
really a certain pattern of bodily movements 
which can be expressed in physical terms. This 
is not to assert that the psychologist who em- 
ploys a psychological approach in theorizing 
will be content to stop at the level of description 
of the phenomenal properties of psychological 
events. He will want to know how “hope” can 
be characterized in terms of the constructs of a 
psychological theory. Lewin, in his writings, 
continuously stresses that the understanding of 
causal relationships requires one to proceed 
from the more accessible, phenomenal proper- 
ties of psychological events to their underlying 
dynamic or genotypical properties. 
The emphasis on a psychological explanation 
of psychological events leads to an avoidance 
of "achievement concepts" (i.e, of concepts 
which define behavior in terms of what it ac- 
complishes) since, as Lewin (in Rapaport, 1950, 
p. 90) writes, it is impossible "to subsume, un- 
der unitary psychological laws, processes defined 
by achievements." More generally, Lewin ob- 
jects to the use of simple external criteria to 
identify or define a psychological state or proc- 
ess. In effect, he asserts that it is poor psycho- 
logical theorizing to conclude that a person 
who is exposed to a given external event (e.g., 
of being "rewarded") will necessarily interpret 
or experience the event as it was intended, Simi- 
larly, it is poor theorizing to assume that all 
of the effects of a person's behavior (e.g., result- 
ing in another person "feeling insulted") are 
psychologically determined. Lewin's point is 
that to develop psychological laws we must deal 
with psychological processes — the perception 
of the external event, not the event by itself; 
behavior, and not its effects by themselves. Thus, 
we ought to be concerned with the psychological 
impact of a "reward," e.g., whether it is per- 
ceived as a “bribe” or as a "sign of accomplish- 
ment," rather than simply with the external 
situation of reward. In essence, one has to deal 
with what exists psychologically, what is real 
for the person being studied. If a stimulus 
object does not exist for an individual, con- 
sciously or unconsciously, the object cannot be 
thought of as having any psychological effects; 
similarly, if the behavior effect was not con- 
sciously or unconsciously motivated, it cannot be 
thought of as being psychologically determined. 
Emphasis on the study of the person and the 
psychological environment as it exists for the 
person has led critics (e.g., Brunswik, 1943) to 
characterize Lewin's approach às "postpercep- 
tual and prebehavioral" Lewin's lack of de- 
tailed discussion of perception and of action 
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provide a justification for this criticism. Never- 
theless, Lewin denied this, asserting that per- 
ception and action are, in his view, legitimate 
problems of psychology. He wrote (Lewin, 
1951b, p. 57, or 1943a) : 


This view is a necessary consequence of the 
field-theoretical approach according to which the 
boundary conditions of a field are essential charac- 
teristics of the field. For instance, processes of 
perception which should be related to the boun- 
dary zone depend partly on the state of the inner 
part of the psychological field, ie, upon the 
character of the person, his motivation, his cog- 
nitive structure, his way of perceiving, etc, and 
partly on the "stimulus distribution" on the 
retina or other receptors as enforced by physical 
processes outside the organism. 


Lewin was fully aware that the scientist could 
not depend upon intuition in order to appre- 
hend what was psychologically real for another 
person. Rather, he had to infer or construct 
this reality from observable data, such as the 
course and consequences of the person's actions 
and the situations in which action was occur- 
ring. Further, he was aware that “the investi- 
gation and representation of each single case is 
an infinite task in itself and one whose solution 
would presuppose a full knowledge of the laws 
of psychology" (Lewin, 1936a, p. 17). Never- 
theless, his interests and his views as to a fruitful 
strategy for psychology led him to a relative 
neglect of problems of perception and action in 
favor of an emphasis on motivational problems. 
As a consequence, he used a rather “make-do” 
methodology for identifying the specific proper- 
ties of a given individual’s psychological field 
or life space. The tool most commonly em- 
ployed in experimental studies was simply that 
of verbal report: the subject described in his 
own terms what a situation meant to him or 
what he intended. Despite Lewin's awareness 
of the liabilities in employing verbal report, he 
was not blinded to its assets when studying 
capable human subjects. 

Lewin's use of verbal report as a "make-do"' 
methodology and his emphasis on describing 
the psychological environment in terms of "what 
exists for the person" rather than in "objective 
physicalistic" terms has led some critics (e.g. 
Spence, 1944) to mistakenly characterize Lew- 
in's theoretical approach as "phenomenologi- 
cal" Lewin, first of all, never equated "what 
exists for the person" with "awareness," "con- 
sciousness,” or "ability of the person to describe 
verbally"; he realized that many important psy- 
chological influences operate unconsciously. 
"Thus, in the sense of being only concerned with 
what exists in awareness, Lewin was not a phe- 
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nomenologist. Secondly, Lewin realized, as do 
all sophisticated scientists, that the concepts and 
methods of psychology had to be intersubjective 
and, in this sense, psychological processes in 
another person could only be apprehended from 
externally observable data. Hence, in the sense 
that Lewin never assumed that verbal report 
or phenomenological descriptions were simple 
portrayals of the “raw truth,” he was also not 
a phenomenologist. On the other hand, he was 
interested in what people are aware of and he 
was sensitive to the insights into psychological 
processes which could be obtained from sub- 
jective reports. 

Although Lewin believed that the employ- 
ment of achievement concepts would prevent 
the development of unitary, psychological laws, 
he recognized that any long-range or detailed 
prediction of individual behavior required a 
knowledge of nonpsychological facts — a knowl- 
edge of facts which, though not affecting the 
individual at a given time, will come to affect 
him at a later time. For example, if one wishes 
to predict what a particular individual is going 
to do during a Sunday drive into the country 
one will have to be able to predict whether 
or not his automobile will get a punctured tire, 
whether or not there will be a traffic jam, etc., 
as well as what he seeks in his excursion. The 
specific prediction of what a person will do or 
what he will become at some future time re- 
quires a knowledge of the future situations to 
which he will be exposed. The task of discover- 
ing what part of the physical or social world 
will determine during a given period the 
"boundary zone" of the life space has been 
called “psychological ecology" by Lewin. He 
and many of his students have devoted much 
of their research to studies of the psychological 
ecology of different social settings. We shall 
discuss psychological ecology in greater detail 
in a subsequent section of this chapter. 


AN EMPHASIS ON THE TOTAL SITUATION 


The most fundamental construct for Lewin is 
that of the psychological "field" or life space. 
All psychological events (thinking, acting, 
dreaming, hoping, etc.) are conceived to be a 
function of the life space which consists of the 
person and the environment viewed as one 
constellation of interdependent factors. "That 
is, all psychological events are conceived to be 
determined, not by isolated properties of the 
person or his environment, but by the mutual 
relations among the totality of coexisting facts 
which comprise the life space, facts which derive 
from the momentary condition of the individual 
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and the structure of his environment. Indi- 
vidual psychological processes are, in Lewin's 
words, “always to be derived from the relation 
of the concrete individual to the concrete situa- 
tion, and, so far as internal forces are concerned, 
from the mutual relations of the various func- 
tional systems that make up the individual” 
(Lewin, 1935b, p. 41). 

The emphasis on the interrelatedness of the 
person and the environment was one of Lewin's 
major contributions to psychological theorizing. 
Until recently, much of psychology has been 
dominated by what he has termed an "Aris- 
totelian" mode of thinking (Lewin, 1935b, 
Chap. 1). In such thinking, psychological events 
are determined by the characteristics of the 
individual — "instincts," “heredity,” “intelli- 
gence,” “needs,” “habits,” etc. — relatively 
independent of the situation. Since Lewin it 
has become increasingly evident that it is mean- 
ingless to speak of behavior without reference 
to both the person and his environment. State- 
ments such as the following, which ignore the 
situation, are viewed as unacceptable by an ever- 
increasing number of psychologists: “the reason 
he is psychotic is because of his heredity”; “the 
rigidity of his problem-solving behavior is due 
to his ethnocentrism”; “he became the leader of 
his group because of his personality"; "her emo- 
tional outburst was due to her hysteria"; 
"friends work better together than do stran- 
gers"; "a highly cohesive group will be more 
productive than a less cohesive group." 

Lewin's emphasis on the "relation of the con- 
crete individual to the concrete situation" leads 
the scientific investigator to a more explicit 
realization. that understanding behavior re- 
quires not only a knowledge of the person (e.g., 
his past experiences, his present attitudes, and 
his capabilities) but also a knowledge of his 
immediate situation. This is true for under- 
standing everyday behavior and also for under- 
standing the behavior of subjects in carefully 
controlled experiments or diagnostic testing 
situations. In these latter instances, interpreta- 
tion of the responses of a subject requires a 
knowledge of the psychological significance of 
the total experimental or testing situation. To 
cite a simple example, the responses of subjects 
in a locked room to smoke pouring through the 
cracks in the floor and to the "shouts" of fire 
from outside will be determined largely by 
whether they view the smoke to be part of an 

o experimental hoax or to be part of a real fire 
(French, 1944) . 

The emphasis on the total situation has con- 
sequences for research methodology. Instead of 
picking out one or another isolated element 
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within a situation, field theory finds it advan- 
tageous, as a rule, to start with a characterization 
of the situation as a whole. After this first ap- 
proximation, the various aspects of the situation 
undergo a more and more detailed analysis. 
The procedure: 


. is not to add disconnected items but to 
make the original structure more specific and 
differentiated. This method therefore proceeds 
by steps from the more general to the particular 
and thereby avoids the danger of a “wrong simpli- 
fication” by abstraction. (Lewin, 1936a, p. 17) 


In essence, the emphasis in research guided by 
field theory is to study psychological events in 
terms of the units of which they are a part — to 
study them in their relations of interdependence 
rather than as isolated abstractions torn from 
their relational significance. This emphasis 
leads to a methodological interest in studying 
"pure" cases (that is, cases where accidental or 
irrelevant variables are eliminated as far as pos- 
sible) in sufficient detail to understand psycho- 
logical events in terms of the totality of their 
mutual interrelations. Contrasted with this ap- 
proach would be the study of the relationships 
between variables — e.g., intelligence and school 
achievement — by taking a large number of 
cases, none of which are studied in any detail 
from the point of view of their concrete rela- 
tions to their specific situations. The assump- 
tion sometimes underlying this latter approach 
is that the relationship between variables such 
as intelligence and school achievement is inde- 
pendent of the other properties of the situation 
or of the person. Field theorists, on the other 
hand, would assert that the relationship between 
any two variables of this sort is almost inevitably 
influenced by the total situation, and thus they 
would conclude that their relationship should 
be established only after the properties of the 
total situation have been sufficiently charac- 
terized. 


SYSTEMATIC VERSUS HISTORICAL 
CONCEPTS OF CAUSATION 


In the introductory section of this chapter, 
it was pointed out that field theory in physics 
destroyed the notion of “action at a distance" 
and replaced it by structural equations in which 
“the field here and now depends on the field in 
the immediate neighborhood at a time just 
past." Lewin, in a similar vein, has emphasized 
that psychological events (i.e., changes in the 
life space) have to be explained in terms of the 
properties of the field which exist at the time 
when the events occur. He points out that past 
events can only have a position in the historical 
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causal chains whose interweaving creates the 
present situation; they cannot directly influence 
present events. 

This is not to deny the significance of the 
past in indirectly affecting behavior. However, 
even though a past event can create a certain 
condition which carries over into the present, 
it is, nevertheless, the present condition that 
is influential in the present. Strictly considered, 
linking behavior with a past event is an ex- 
tremely difficult undertaking; it presupposes 
that one knows sufficiently how the past event 
affected the psychological field at that time, and 
whether or not in the meantime other events 
have again modified the field. While the notion 
that only the present is influential in the present 
is an obvious concept, one still can find in ab- 
normal psychology textbooks accounts of how 
an event in childhood “caused” a neurosis. A 
more adequate explanation would have to de- 
scribe the "neurotic trends" resulting from the 
childhood event, the vicissitudes of these trends 
during the period between the event and the 
neurosis, and the interactive nature of the pres- 
ent situation and neurotic trends which "result" 
in the neurotic symptoms. 

While Lewin emphasized the principle of 
contemporaneity as one of the basic charac- 
teristics of field theory, he realized that the 
determination of the meaning of any psycho- 
logical event requires the taking into account 
of an appropriate time period. His use of the 
term "contemporaneous" or the phrase "situa- 
tion at a given time" actually does not refer to 
an instant without time extension, but to a cer- 
tain time period. Time differentials in 
psychology — i.e., what is viewed as "contem- 
poraneous" — must be conceived in terms of 
the duration of meaningful psychological units; 
there is no reason to make the a priori assump- 
tion that psychological space-time is infinitely 
or arbitrarily divisible. That is, psychological 
units have to be thought of as having an exten- 
sion in regard to both field and time dimensions, 
and unless the duration of the time-period is 
appropriate to the unit meaningful psychologi- 
cal units will not be observable. 

Lewin's emphasis on employing units of suf- 
ficient time extension has methodological impli- 
cations. He writes that "the first prerequisite 
of a successful observation in any science is a 
definite understanding about what size of unit 
one is going to observe at a given time" (Lewin, 
1951b, p. 157, or 1943b) . He points out that in 
social psychology we have often misinterpreted 
the scientific requirements of analysis and have 
tried to observe as small units as possible and, 
in so doing, we have frequently torn the ob- 
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served units from their context. The fallacy 
in such an approach is that there is frequently 
no way to distinguish among different possibili- 
ties of classifying an action if the observation 
lasts only a few seconds. Thus, if two persons 
A and B are running one behind the other, it 
may mean that either A is leading B or that B 
is chasing A. Only by observation which is suf- 
ficiently comprehensive to include both 4 and B 
and to extend through a period sufficiently long 
does the meaning of the activity become clear. 
(For a further discussion of systematic versus 
historical causation see Chein, 1947.) 


THE 'CONSTRUCTIVE" RATHER THAN 
"CLASSIFICATORY" APPROACH 


Lewin's orientation to science was much in- 
fluenced by the work of Ernst Cassirer, a Ger- 
man philosopher who was concerned with the 
nature of concepts in mathematics and the 
sciences. In his book, Substance and Function 
and Einstein’s Theory of Relativity, Cassirer 
(1923) emphasized the opposition between two 
modes of theorizing which are distinguished by 
the different value which each places upon thing 
concepts and relation concepts. The classifica- 
tory approach, which is characterized by thing 
concepts, assumes that concepts are derived by 
abstractions from particular objects to an ab- 
stract object which embraces the essence of the 
particular object to which it refers. Relations 
or connections, as a rule, are considered among 
the nonessential properties of a concept, and 
thus as capable of being left out of its definition 
without fallacy. In the classificatory approach, 
as we go to "higher" and "higher" concepts 
(e.g., from "cow" to "animal" to "living ob- 
ject,” etc), the conceptual pyramid which we 
form in this way reaches its summation in the 
abstract representation of "something" — the 
all-inclusive "entity" to which every possible 
content is relevant, but which at the same time 
is totally devoid of specific meaning. 

The constructive approach to theorizing 
stresses relational concepts; it points out that 
even the process of abstraction presupposes a 
relational principle by which objects are identi- 
fied as having a common characteristic. In this 
approach, the meaning of any concept is deter- 
mined by its relation to other concepts in the 
system of concepts of which it is a part. The 
reality of specific phenomena are derivable or 
constructible from the relevant constructive 
elements or constructs. 

The ideal of a scientific concept held by the 
constructive approach is not an abstraction 
which disregards the peculiarities and particu- 
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larities of the specific objects to which it is 
relevant but rather a concept which seeks to 
show the necessity of the occurrence and con- 
nection of just these particularities. In other 
words, according to the constructive approach, 
with adequate scientific concepts, there is no 
antithesis between general laws and the indi- 
vidual case. By introducing parameters which 
are appropriate to the individual case we can 
derive individual behavior from a formula 
which is applicable to all cases. 

Lewin characterizes his approach to the de- 
velopment of psychological theory as the con- 
structive method, and he attacks the attempt to 
develop concepts by a process of abstracting 
from phenotypical or historic-geographic data 
Oe, data in which the similarities are defined 
in terms of external rather than psychological 
characteristics). He writes: 


If one “abstracts from individual differences,” 
there is no logical way back from these generali- 
zations to the individual case. Such a generaliza- 
tion leads from individual children to children of 
a certain age or certain economic level and from 
there to children of all ages and of all economic 
levels; it leads from the psychopathic individual 

. to the general category, “abnormal person.” 
However, there is no logical way back from the 
concept “child” or “abnormal person” to the in- 
dividual case. What is the value of general con- 
cepts if they do not permit prediction for the 
individual case? (Lewin, 1951b, p. 61, or 1942) 


Phenotypical data must be transformed into 
the language of constructs ("genotypes," to use 
Lewin's term) before one can hope to develop 
general psychological laws which will be ap- 
plicable to individual cases. The development 
of general psychological laws, however, not only 
requires the coordination of data to constructs 
by definite rules of correspondence and explicit 
operational procedures, but it also presupposes 
that the linkages or relations among constructs 
are guided by an awareness of the conceptual 
dimensions of the constructs. Lewin (1951b, p. 
37, or 1944) points out that “(1) Only those 
entities which have the same conceptual dimen- 
sion can be compared as to their magnitude. 
(2) Everything which has the same conceptual 
dimensions can be compared quantitatively; its 
magnitude can be measured, in principle, with 
the same units of measurement." 

While Lewin does not discuss "conceptual 
dimensions" at any length, he points out several 
major distinctions, which we shall discuss more 
fully as we discuss his specific concepts. Briefly, 
he distinguishes between structural concepts 
(e.g. position, cognitive structure) and dynamic 
concepts (e.g. force, tension) ` between concepts 
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which refer to different regions of the time di- 
mension; between concepts which refer to dif- 
ferent levels of reality; between concepts 
referring to "potential" and concepts referring 
to "actual"; etc. For example, “position” is a 
spatial relation of regions, while “locomotion” 
refers to a "relation of positions at different 
times." "Hope" refers to a relation between the 
structure of the reality level and of the wish 
level of the psychological future. “Guilt” refers 
to the relation between the structure of the 
reality and the wish level of the psychological 
past. 

Lewin's stress on clarification of the concep- 
tual dimensions of psychological constructs 
reflected his feeling that the emphasis on “op- 
erational definitions" of scientific concepts was 
being misconstrued in psychology. He felt that 
some psychologists naively assumed that the 
meaning of a concept is exhausted by its opera- 
tional definition — e.g., "intelligence is what 
is measured by intelligence tests." Lewin, on 
the other hand, emphasized that completely 
adequate scientific concepts not only required 
explicit operational definitions which would 
provide linkages to observable facts but also 
conceptual definitions which established the 
linkage between any given concept and other 
concepts in the system of concepts (or theory) . 
One might go a little further and state that the 
possibility of an unequivocal and consistently 
repeatable operational definition of any psycho- 
logical concept is dependent upon the state of 
psychological theory. "That is, one cannot give 
a precise statement of what is being done (i.e., 
a statement of all the relevant variables which 
interact with one's measurement procedures and 
the variable being measured) as one measures 
"intelligence," for example, unless one has a 
theory which enables specification of the vari- 
ables which may influence either “intelligence” 
in the situation of measurement or the testing 
procedure by which “intelligence” is measured. 


THE DYNAMIC APPROACH 


Lewin frequently uses the phrase "dynamic 
concepts" as an equivalent for such terms as 
"constructs," "genotypes," "conditional-genetic 
concepts," and "intervening concepts." In such 
a usage, any approach to psychology which em- 
ploys constructs to systematize and interpret 
observable data may be called "dynamic." One 
could say that practically all major approaches 
to psychology are, in this sense, dynamic. How- 
ever, a more limited use of the term "dynamic" 
is common. In this more limited usage, the 
common denominator is the conception of liv- 


HANDBOOK OF SOCIAL PSYCHOLOGY 


ing things as systems which tend to maintain a 
dynamic equilibrium in relation to their en- 
vironments. That is, the individual is conceived 
of as a system whose components maintain a 
relative flexibility of interrelations so that varia- 
tion in one aspect of the system may induce 
compensating changes in other aspects, thus 
maintaining the integrity of the system. The 
notion of dynamic or organismic equilibrium 
stresses the fact that the "whole" remains the 
same only because of a patterning or organiza- 
tion which persists in the midst of change. 
Contrasted with the model of dynamic equilib- 
rium is that of mechanism. The model of mech- 
anism 


. implied the notion of a whole which was 
completely equal to the sum of its parts; which 
could be run in reverse; and which would behave 
in exactly identical fashion no matter how often 
those parts were disassembled and put together 
again, and irrespective of the sequence in which 
the disassembling or reassembling would take 
place. It implied consequently that the parts were 
never significantly modified by each other, nor by 
their own past, and that each part once placed 
into its appropriate position with its appropriate 
momentum, would stay exactly there and continue 
to fulfill its completely and uniquely determined 
function. (K. Deutsch, 1951, p. 234) 


Arising out of the conception of the individ- 
ual as an “equilibrium-maintaining system” is 
an interest in the processes by which equilib- 
rium is restored when it is disturbed. In 
psychology, this interest is reflected in an em- 
phasis on motivational processes — e.g., the 
arousal of need-tensions, the setting of goals, 
goal-directed action, the release of tension. 
Much of the research of Lewin and of his co- 
workers was directed toward an understanding 
of motivational processes. Lewin's theoretical 
approach is, in the sense of the preceding sen- 
tence, legitimately grouped with the approaches 
of McDougall and Freud as a motivational or 
"dynamic" psychology. 

AII theories in psychology which use the con- 
cept of equilibrium include more or less con- 
spicuously the thesis that behavior or any other 
kind of psychological change is caused by 
directed entities. Hull's concept of “excitatory 
tendency," McDougall's "propensities," Freud's 
“libido,” Tolman's "theory of purposive be- 
haviorism," etc., all imply directed entities. In 
Lewin's system these directed entities are called 
"psychological forces." In field theory, the vec- 
torial nature of "psychological forces" is ex- 
plicitly recognized and an attempt is made to 
develop a mathematics which would adequately 
handle for psychology the concepts of magni- 
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tude and direction implicit in the concept of 
vector. 


MATHEMATICS AND FIELD THEORY 


Lewin stressed the need for clear understand- 
ing of the formal properties of scientific con- 
structs, and he insisted that the determinants of 
behavior would have to be represented in 
mathematical terms if psychology were to be- 
come a rigorous discipline. He devoted great 
effort in attempts to apply and to formulate 
geometrical concepts which would be of value 
in psychology. His most elaborate effort in this 
connection was his formulation of a geometry 
which he termed “hodological space" (Lewin, 
1938) . 

Although it cannot be said that “hodological 
space" was well developed from the mathemati- 
cal point of view, it represents an extremely 
ingenious attempt to develop a geometry whose 
basic spatial concepts could be integrated with 
dynamic concepts. In hodological space, the 
distinguished path which is used in determining 
the direction between any two points is deter- 
mined by the attractiveness rather than the 
shortness of the path. Direction is influenced 
by such factors as the degree of differentiation 
of the space into subregions, the relative promi- 
nence of wholes versus parts, the properties of 
the field at large. Using the concepts of hodo- 
logical space to define psychological direction 
seems particularly helpful in giving insight into 
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behavior in conflict situations, in situations 
which are cognitively unclear, in situations in- 
volving roundabout routes, in understanding 
the difference between escape (or withdrawal) 
and goalseeking (or approach) behavior, etc. 
It permits an answer to the puzzling necessity 
to ascribe different psychological directions to 
behaviors in the same physical direction if the 
goals of these behaviors are different. 

Although Lewin was constantly directing his 
energies to the development of a more adequate 
formalistic system, he felt that an enthusiasm 
for formal systems might lead to a substitution 
of mere verbalisms for empirically descriptive 
theories. Formalization and mathematization, 
if prematurely done, may lead us to the building 
of a logical superhighway which turns out to be 
a "dead end leading nowhere." A young science, 
if it is wise, 

. . . follows the same procedure used in the 
exploration of the resources of a new land: small 
paths are pushed out through the unknown; with 
simple and primitive instruments, measurements 
are made; much is left to assumption and to lucky 
intuition. Slowly certain paths are widened; guess 
and luck are gradually replaced by experience and 
systematic exploration with more elaborate in- 
struments. Finally highways are built over which 
the streamlined vehicles of a highly mechanized 
logic, fast and efficient, can reach every important 
point on fixed tracks. (Lewin, 1951b, p. 3, or 
1940) 


Basic CONCEPTS or FIELD THEORY As DEVELOPED IN INDIVIDUAL PsyCHOLOGY 


Although much of Lewin's writing is con- 
cerned with the metatheory of psychology, he 
also has made a major effort to develop a spe- 
cific psychological theory. In this section, we 
shall outline some of the major concepts em- 
ployed in his theory — particularly the concepts 
which were developed in his study of individual 
psychology — and shall attempt to indicate 
how these concepts were employed in research. 
It is well to realize at the outset that Lewin's 
theoretical system was never fully developed 
nor was it formalized into a hypothetico-deduc- 
tive system. As a consequence, there is a vague- 
ness and lack of precision about many of the 
concepts; a vagueness which reflects Lewin's 
feeling that theory had to be constructed by a 
method of approximation. 

We shall divide our discussion into four main 
parts: (A) The life space, (B) Structural con- 
cepts, (C) Dynamic concepts, (D) Concepts 
dealing with change in the psychological en- 
vironment. ; 


THE LIFE SPACE 


As we have pointed out in an earlier section, 
Lewin employs the term “life space" (or “psy- 
chological field" or "total situation") to refer 
to the manifold of coexisting facts which deter- 
mine the behavior of an individual at a certain 
moment, Behavior, in other words, is a function 
of the life space: B — f (LS). The life space 
is in turn a product of the interaction between 
the person (P) and his environment (E). In 
symbolic expression, B — f (LS) — f (P, E). 
Unfortunately, Lewin used each of his crucial 
terms — “environment,” “person,” and “be- 
havior” in several different ways. 

Environment. The term "environment" is 
often used to refer to the objective environment 
or stimulus situation (ie. the objective situa- 
tjon which is presented to the individual at a 
given moment: the situation which acts upon 
the individual's perceptual apparatus and upon 
which his motor apparatus acts) . It is this mean- 
ing of the term "environment" which is ap- 
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(b) te) 


Fic. l. THE REPRESENTATION OF THE PERSON. Relations among various strata of the person in different 
circumstances. (a) The person in an easy situation: the peripheral parts P of the inner personal region J 
are easily accessible from outside, E; the more central parts C are less accessible; the inner personal region 
I influences the motor region M relatively freely. (b) The person under stress, in state of self-control: the 
peripheral parts P of the inner personal region J are less accessible than in (a) ; peripheral and central parts 
(C and P) are more closely connected; communication between J and M is less free. (c) The person under 
very high tension: unification (primitivation, "regression") of the inner personal region I. 

Symbols: M, motor-perceptual region. I, inner personal region. P, peripheral parts of J. C, central parts 
of I. E, environment. Bo, dynamic wall between C and P. Bp, dynamic wall between J and M. 
(Adapted from K. Lewin, Principles of Topological Psychology, p. 181.) 


propriate to the statement that the life space 
is "a product of the interaction between the 
person and his environment." Similarly, as 
Leeper (1943) has pointed out, this meaning is 
appropriate to such statements as the following: 


The motor and perceptual region has the posi- 
tion of a boundary zone between the inner per- 
sonal regions and the environment . . . Two 
groups of facts stand in favor of such a represen- 
tation. 

1. Needs or other states of the inner personal 
regions can influence the environment only by 
way of a bodily expression or a bodily action, that 
is, by way of a region which one can call the 
motor region. 

2. We find an intermediate region again when 
we consider the influence in the opposite direc- 
tion, namely, psychological changes of the inner 
personal region resulting from changes of the 
environment. This intermediate region cor- 
responds to the perceptual system in the broadest 
sense of the word, that is, to hearing, seeing, etc. 
(Lewin, 1936a, pp. 177-179) 


Lewin also uses the term "environment" to 
refer to the "psychological environment," which 
is conceived to be the environment as it "exists" 
for the individual. The psychological environ- 
ment is a part of the life space and, hence, its 
properties are determined not only by the 
characteristics of the objective environment but 
also by the characteristics of the person. The 
fact that the psychological environment is an in- 
teractive product is most evident in instances 
where the psychological environment is not ve- 
ridical to the objective environment — e.g., con- 
sider any of the common perceptual distortions 
under intense need. In addition, a little thought 
will reveal that regions of the psychological en- 


vironment often have certain characteristics 
which do not exist in the objective environment 
independently of its relation to a given person 
— eg. the attractiveness or repulsiveness of 
certain objects. 

Person. Lewin employed the term “person” 
in three ways. First of all, he used the term to 
refer to those properties of the individual (e.g., 
his needs, his beliefs and values, his perceptual 
and motive systems) which in interaction 
amongst themselves and with the objective en- 
vironment produce the life space. One might 
say that here Lewin is referring to the indi- 
vidual or organism considered “psychologically” 
rather than physiologically. One could say that 
“person” in this sense is the life space at a time 
just past which, as a consequence of its inter- 
action with a subsequent objective environment, 
has produced the present life space. In a second 
usage, Lewin also employs the term "person" 
as being equivalent to “life space.” His repre- 
sentation of the "person" (see Fig. 1) and his 
representation of the “life space" (Fig. 2) may 
be viewed as different ways of representing 
essentially the same psychological facts. Figure 
2 is more convenient for representing changes 
in the psychological environment, and Fig. 1 
for representing changes in tension systems. 

Lewin also used the term "person" to refer 
to the "person in the life space." The person 
in the life space or "the behaving self," to bor- 
row a more expressive term employed by Tol- 
man (in Parsons and Shils, 1951), is the 
individual as related to the other entities in 
his life space. Just as the "psychological en- 
vironment” may be thought of as the “perceived 
environment," so the "behaving self" may be 
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(a) 


Fic, 2. THE REPRESENTATION OF THE LIFE SPACE. 
FINITELY STRUCTURED FORCE FIELD CONSISTING OF 
MACROSCOPIC REGIONS. (a) Eating situation in case 
of disliked food. C, child; Re, real eating; h, putting 
hand on table; sp, taking spoon; fo, putting food on 
spoon; hw, bringing spoon halfway to mouth; m, 
bringing spoon to mouth; i, taking food into mouth; 
ch, chewing; sw, swallowing; eating has a negative 
valence [Va(Re)<0]; the force away from real 
eating, fc, increases stepwise with the decrease 
of distance between C and Re. 

(b) Change of direction of forces after the child 
started real eating. C, child; Re, real eating; St, 
struggle with adults; Fr, freedom; fore force in the 
direction to freedom; Tea, force away from 
struggle, 

(c) In a later stage of “learning” to eat the dis- 
agreeable food, the situation might be restructured 
so that bringing spoon to mouth (m) is now seen as 
a subpart of the region of real eating (Re). 

(Adapted from K. Lewin, The Conceptual Rep- 
resentations and Measurement of Psychological 
Forces, p. 117.) 


thought of as the individual's perception of his 
relations to the environment he perceives. The 
psychological environment and his behaving self 
are interdependent components of the life space. 
Most observable behavior can be thought of as 
paralleling changes of position of the behaving 
self from one activity region to another in the 
life space. It should be stressed that the proper- 
ties of the behaving self are completely 
expressed in a statement of its relations and rela- 
tive position to the other objects in the indi- 
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vidual's life space. Thus, there are no internal 
relations within the behaving self, and hence 
it may be considered as an undifferentiated re 
gion for representational purposes. 


There is some evidence to indicate that Lewin 
would have disagreed with our view that he employs 
the term "person" in the different meanings de- 
scribed above. For example, in his Principles of 
Topological Psychology he has a chapter entitled 
“The Person as a Differentiated Region in the Life 
Space” and in this chapter he advances two major 
reasons why the “person in the life space” should be 
considered as a highly differentiated object. He 
states that different parts of the body often carry out 
different activities at the same time and, secondly, 
that a person cannot be considered a unit, since a 
change in one need does not change all needs in the 
same direction and to the same extent. His second 
reason clearly explains why the “person” of Fig. 1 
must be considered as being differentiated. (The 
differentiation in the life space would, of course, 
reflect the differentiation in the person.) The first 
reason — i.e., the possibility that "the person in 
the life space" is engaging in more than one activity 
simultaneously — is adequately handled by assuming 
that the behaving self is in an overlapping situation, 
a concept much employed by Lewin, or that there is 
more than one behaving self. Apart from many other 
considerations, the notion of a differentiated “per- 
son in the life space" makes it rather difficult to 
apply the concept of a force acting upon the person, 
since the point of application of the force is am- 
biguous. 


Behavior. The term "behavior" has been em- 
ployed to refer to any change in the life space 
— ie, any change which is subject to psycho- 
logical laws. Thus, not all movements of the 
person nor all changes in the environment re- 
sulting from actions of the person are considered 
behavior. For example, if a child is moved from 
an automobile to his home while he is asleep, 
his movement does not represent behavior; or 
if a boy hits a baseball and the baseball acci- 
dentally breaks a window, the "breaking of the 
window" cannot be considered to be the boy's 
behavior. Behavior, in this sense, takes place in 
the life space rather than in an observable space 
— or, to use Koffka's (1935) terms, in the be- 
havioral rather than in the geographical en- 
vironment. It is clear that in this usage 
"behavior" is not directly observable; it must 
be inferred. That is, just as attractiveness of 
any region in the life space must be inferred, so 
too must change of attractiveness be inferred 
and, similarly, change of position of the behav- 
ing self as well as its initial position must also 
be inferred. Lewin has also used "behavior" 
in its more common-sense meaning of the ob- 
servable interaction between the individual and 
the objective environment. 
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The characteristics of the life space (or per- 
son) are deduced from observed behavior in an 
observed environment. However, since we know 
that many features of the life space tend to 
remain the same over long periods of time (e.g. 
a person's enjoyment of tennis on certain types 
of occasions) despite momentary variations in 
other characteristics, we can — knowing these 
characteristics and knowing the observable be- 
havior — make inferences about the objective 
environment. For example, knowing that 


- +. the people who feel at home in a given 
community, who seem to fit best, are either highly 
dependent or authoritarian in personality pro- 
vides some insight into the characteristics of the 
community. In a similar way, if, upon investigat- 
ing the individuals who feel thwarted by a situa- 
tion, we find that they are the young and 
ambitious people, the individuals with consider- 
able personal initiative, the nonconformists, we 
would also be likely to develop some appreciation 
of the nature of this situation. (Jahoda, Deutsch, 
and Cook, 1951, p. 46) 


The dimensions of the life space. Lewin 
points out that the life space of the newborn 
child may be described as a field which has rela- 
tively few and only vaguely distinguishable 
areas. Future events or expectations do not 


(a) 


Ps past 


(b) 


Ps present 
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exist; the child is ruled by the situation imme- 
diately at hand; he has no conception of his past 
experiences. The life space, in other words, has 
no time dimension. 

An outstanding characteristic of the change 
of the life space during development is an in- 
creasing differentiation. This includes an in- 
creasing differentiation in the time dimension 
(see Fig. 3 for a representation of the time 
dimension in the life space). Plans extend 
further into the future, there is an increased 
tolerance of delay, and activities of increasingly 
longer duration are organized as one unit. As 
the individual matures, his picture of the possi- 
bilities which exist in the future play an in- 
creasingly important role in determining his 
morale. 

The differentiation of the life space brings 
with it a differentiation of the reality-irreality 
dimension (see Fig. 3). The different degrees 
of irreality correspond to different degrees of 
fantasy. They include both the wishes and the 
fears. A daydream, a vague hope, has in general 
less reality than an action; an action has more 
reality than a “talking about it"; a perception 
is more real than an image; a faraway “ideal 
goal” is less real than a goal that determines 
one’s immediate action. Dynamically, the level 


Ps future 


More 


Ps future 


Fic. 5. DIMENSIONS OF THE LIFE SPACE. The life space at two developmental stages. The upper draw- 
ing represents the life space of a younger child. The lower diagram represents the higher degree of differen- 
tiation of the life space of the older child in regard to the present situation, the reality-irreality dimension, 
and the time perspective. C, child; R, level of reality; Z, level of irreality; Ps past, psychological past; Ps pres- 


ent, psychological present; Ps future, psychological future. 


cial Science, p. 246.) 


(Adapted from K. Lewin, Field Theory in So- 
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of irreality corresponds to a more fluid medium 
and is more closely related to the central layers 
of the person. This fact is particularly impor- 
tant for the understanding of the use of play 
techniques and projective methods in the study 
of personality. 


. . -+ The level of irreality in the psychological 
future corresponds to the wishes or fears for the 
future; the level of reality to what is expected. 
. . . Hope corresponds to a sufficient similarity 
between reality and irreality somewhere in the 
psychological future; guilt to a certain discrepancy 
between reality and irreality in the psychological 
past. In the young child, truth and lying, percep- 
tion and imagination, are less distinguished than 
in an older child. This is partly due to the fact 
that the younger child has not yet developed that 
degree of differentiation of the life space into 
levels of reality and irreality which is characteris- 
tic of the adult. (Lewin, 1951b, p. 245, or 1946a) 


Psychological ecology. Physics treats its space 
as a closed system of causes and effects in the 
sense that all physical changes are conceived 
to be the results of conditions or changes within 
the same physical space. The matter is not so 
simple in psychology, since it is clear that the 
life space is affected by events which originate 
outside the life space. As a consequence, psy- 
chology must take into account physical and 
social facts which obey nonpsychological laws 
and which determine the external events that 
will impinge upon the perceptual motor regions 
of the person. The problem of taking into ac- 
count the physical-biosocial world (the non- 
psychological milieu) in which the person is 
immersed and of understanding the way it is 
transformed into a psychological environment 
has been termed the task of psychological ecol- 
ogy (Wright and Barker, 1950). 

The nonpsychological milieu cannot have di- 
rect effects upon behavior because its parts are 
alien to psychological environments, except as 
its parts get transformed into goals, barriers, and 
other psychological facts through perception. 
However, as Wright and Barker (1950, p. 15) 
point out: 


It is nonetheless true that the nonpsychological 
milieu affects behavior indirectly. It provides the 
physical and social raw materials for psychological 
habitats; and by supplying these resources it goes 
far to determine just what behavior shall and 
shall not be possible for all who live within it. It 
is possible for a [second-grade child in Midwest] 
to have a psychological habitat that brings about 
fraternizing with adolescents at school because 
the Midwest grade and high schools are in the 
same building. It is [nearly] impossible for 
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[a child in Midwest] to have a psychological habi- 
tat that brings about balancing on train rails 
because Midwest has no railroad. 


A characteristic of the nonpsychological 
milieu is that many of its features which are 
essential to behavior are relatively constant or 
predictable through long sequences of behavior. 
The chair on which you sit, the floor on which 
your feet rest, the light which enables you to 
work, remain relatively constant, and hence it 
is not necessary to adjust your behavior as you 
work to constantly changing conditions in the 
nonpsychological milieu. 

An important problem for psychology is that 
of considering what should be included in the 
nonpsychological milieu and how the included 
features should be characterized. It seems clear 
that the interests of the psychologist in charac- 
terizing the nonpsychological milieu is not 
similar to that of other scientists — ep. the 
meteorologist, the economist, the physicist, the 
biologist, the sociologist — concerned with as- 
pects of this milieu. For one thing, the psycholo- 
gist, as a psychologist, is not interested in the 
origins or causation of the conditions that go to 
make up the nonpsychological milieu. He is 
not concerned with why a given chair will hold 
only a person weighing less than 200 pounds, 
but rather with the fact that the chair will not 
support the "sitting behavior' of a person 
weighing above this amount. That is, the psy- 
chologist is interested in characterizing the 
various components of the nonpsychological 
milieu in terms of the perceptions and manipu- 
lations they will support or permit, the percep- 
tions and manipulations they will coerce, and 
the chain of events in the nonpsychological 
milieu which will be released by psychological 
interaction with any of its components. 

‘There are various ways of characterizing the 
nonpsychological milieu. One can characterize 
it as Chein (1954, p. 116) has done in terms 
of "the viewpoint of a psychologically so- 
phisticated observer who presupposes psycho- 
logical individuals with wants and desires, 
but who is not at the moment particularly 
concerned with any one person." Chein dis- 
tinguishes the following features of the non- 
psychological milieu: I. Stimuli (anything 
which is capable of initiating a change in the 
stream of activity) ; II. Goal objects and noxi- 
ants (objects or situations which can serve as 
satisfiers or which can produce pain or un- 
pleasantness) ` III. Supports and constraints 
(features of the environment which make par- 
ticular behaviors feasible or infeasible); IV. 
Directors (features of the environment which 
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tend to induce specific directions of behavior) ; 
V. Global features of the environment (e.g., its 
stability, its degree of organization) . 

The nonpsychological milieu can be charac- 
terized in terms of the different kinds of “ob- 
jects” which compose it. Thus, Parsons and 
Shils (1951, p. 254) list the following types of 
objects toward which action can be relevant: the 
self, others as personalities, collectivities or 
groups, organisms (including one’s own and 
others’), physical objects, cultural objects (be- 
liefs, symbols, standards, etc.) — and they indi- 
cate some of the means of characterizing the 
action-relevant properties of these different types 
of objects. 

Heider (1953), in a very stimulating unpub- 
lished monograph, deals extensively with the 
question of “changes whose source does not lie 
in the person but in the environment.” Follow- 
ing Angyal’s (1941) usage he has termed these 
changes “heteronomous changes.” He charac- 
terizes the difference between physical objects 
and people or groups in terms analogous to 
the distinction between perception and action. 
He points out that in perception, the initial 
focus (the object in the environment) does not 
guide the causal process through to the final 
focus (the perception) ; the fate of the waves 
emanating from the object is not controlled by 
and is therefore independent of the object. In 
action, the initial focus (the plan) guides the 
causal process to its final focus (the event). 
Action, conceived of as a certain kind of struc- 
ture of events in the system comprising the 
person and the environment, is characterized 
by an equifinality, the invariance of the end 
with the variability of the means. The end 
state is enforced so that a series of independent 
pushes tends to lead to a unitary effect. 

A person (or group) is the cause or the source 
of heteronomous change in a different sense 
than is a physical object, since he has greater 
control over the causal lines emanating from 
him. Heider indicates that if you meet a person 
who has certain intentions toward you (suppose 
he wishes to kill you), it means that your en- 
vironment contains conditions which are con- 
vergently directed toward this state and, if the 
person has enough power, this state will sooner 
or later be brought about, whatever you do. 
Compare this, on the other hand, with avoiding 
falling stones on a mountain trail; the stones 
don't change their direction of fall and follow 
you if you jump out of their way. r 

In the inorganic world any complex process 
with specific properties depends on a great many 
conditions, and it will occur only if all these 

conditions are met. However, if a person is 
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involved (assuming he has sufficient power) 
there is only one condition necessary for an 
event to occur: an appropriate intention. The 
fact of personal causality, that a person can 
bring about an effect if he intends to do so 
regardless of widely varying circumstances, 
makes people uniquely significant components 
of the nonpsychological milieu. Physical objects 
do not “wish” to nor are they able to create 
events which have as their terminal focus a 
specified change in an individual's life space. 
Only people can have the desire and power to 
determine the contents of an individual’s psy- 
chological environment. Since the human being 
is from birth the terminal focus of actions of 
others, it is evident that the understanding of 
his behavior requires that he be understood in 
his interrelations with others. 

Wright and Barker (1950) have described a 
useful concept for characterizing those parts of 
the nonpsychological milieu which tend to ''co- 
erce" similar psychologial environments or 
habitats in different people. They use the term 
"behavior setting" to refer to any part of the 
nonpsychological milieu that is generally per- 
ceived by the people of a community as ap- 
propriate for particular kinds of behavior. 
Some examples of a behavior setting are: a doc- 
tor's office, a social party, a classroom, a free 
play period, a fishing trip, a movie, the status 
position of a schoolteacher, breakfast, dressing, 
the status position of a father, homework, 
Fourth of July celebration. Behavior settings 
include positions in a social structure (e.g., boss, 
foreman, and worker) as well as regions which 
are more easily visualized in terms of geographi- 
cal locus. 

It is evident that behavior settings are "co- 
ercive" primarily because there exist shared 
frames of reference or collective intentions as 
to what is appropriate behavior in a given be- 
havior setting or with a given behavior-object. 
As defined by Wright and Barker, a behavior 
setting always has two sides. On the one side, 
there is always a part of the nonpsychological 
milieu, which may be physical, social, or both, as 
in the respective instances of a screwdriver or a 
vacant lot, a traffic law or a social clique, and a 
writ of habeas corpus or a school classroom. 
On the other side, there is always a social norm 
or a shared frame of reference that defines the 
setting as one which is appropriate for particu- 
lar kinds of behavior. 

It would be misleading if our discussion in 
the foregoing paragraphs led to the interpreta- 
tion that a nonpsychological milieu would 
coerce different people to have similar psycho- 
logical reactions only if it has interpersonal 
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significance. The fundamental identity in the 
functioning of our respective nervous systems 
"coerces" individuals with similar experiences 
who are exposed to similar environments to per- 
ceive and to react similarly. As Asch (1952, p. 
129) has pointed out, we are able to enter rela- 
tions with others only because 


. the space that surrounds us possesses the 
same directions for all people; the structure of 
time — its extension into the past and the future, 
the order of events that mark off its periods — is, 
or can be, the possession of all; the relations of 
cause and effect are similarly perceived. When 
we act with others we do so on the basis of this 
objective, open field that surrounds us and which 
stands in a similar relation to each of us. 


STRUCTURAL CONCEPTS 


"Topology may be defined as a branch of geom- 
etry which investigates the properties of figures 
that remain unchanged under continuous trans- 
formation or "stretching." These are the quali- 
tative relationships of connection and position. 
Early in his career, Lewin felt that topology 
might be particularly fitted to the special prob- 
lems of psychology. The concern of Gestalt 
theorists with part-whole relationships, with 
“belongingness” and “membership character” 
suggested the relevance of topological concepts 
to psychological questions. 

Although topology is a young mathematical 
discipline which has been developed almost ex- 
clusively within the past 50 years, it is now a 
complex and high-powered mathematical tool. 
The “topological concepts” employed by Lewin 
are, from the point of view of mathematical 
topology, of a rudimentary nature and are torn 
out of their mathematical context. Mathemati- 
cians have sometimes seen little relationship 
between the “topological concepts” employed by 
Lewin and mathematical topology. Lewin, how- 
ever, has written in implicit answer to mathe- 
matical objectors that: 


The mathematician will have to realize . . . 
that to apply a system of mathematical concepts 
in an empirical field one does not necessarily have 
to prove directly the adequacy of the basic mathe- 
matical axioms of this system one by one. It is 
well to prove the fruitfulness of some of the de- 
sired propositions of this mathematical system 
for the representation of the empirical properties 
of the field in question. If the representation of 
spatial relations in physics by Euclidean geometry 
had not been permitted until its axioms (such 
as the divisibility ad infinitum of any part of the 
space) were proved one by one to hold also for 
the physical space, physics could never have used 
Euclidean geometry. All one can say is this: if 
one coordinates certain physical processes to cer- 
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tain geometrical entities one can make certain 
physical predictions . . . Exactly the same pro- 
cedure is followed if certain psychological proc- 
esses (such as social locomotion) are coordinated 
to certain entities of topological or hodological 
geometry (such as path). There can be no other 
meaning and no other proof of the applicability 
of these geometries to psychology than the fruit- 
fulness of predictions based on such coordination. 
(Lewin, 1951b, p. 22, or 1940) 


Lewin's answer does not adequately meet the 
objections of his mathematical critics; they feel 
that Lewin, although employing some of the 
terms of topology, has not used any of the 
theorems of mathematical topology. However, 
regardless of the mathematical rigor of Lewin's 
"topological" concepts, they serve an important 
function in his theoretical system. In general, 
one may say that their function is to allow one 
to determine which events are possible in a 
given life space and which are not. Dynamic 
concepts are necessary, in addition to those of 
topology, in order to determine which of the 
possible events will actually occur in a given 
case. 

Let us illustrate Lewin's use of topological 
concepts to characterize "which events are pos- 
sible" by reference to Fig. 4, which depicts the 
life space of a student who is considering how 
to spend his evening. Let us assume that he 
envisages only four possibilities: seeing a movie, 
visiting a friend in the hospital, writing a term 


Fic. 4. TOPOLOGICAL REPRESENTATION OF THE PSY- 
CHOLOGICAL SITUATION OF A STUDENT CONSIDERING 
FOUR WAYS OF SPENDING HIS EVENING. The individual, 
P, is in the position of choosing among four activi- 
ties: visiting a sick friend, a}; writing a paper, b4; 
seeing a movie, c,; or earning some money, d. He 
sees the path to a, as involving calling the hospital, 
as, and traveling to the hospital, a. The path to 
b, is twofold, consisting of either going to the 
library, b,; reading at the library, bg; taking notes 
at the library, by; or of working with the books and 
notes already at home, b’. The path to the goal of 
seeing a movie is seen to require a trip downtown, 
ca, and the buying of a ticket, c5. The individual 
perceives no path leading from his present position 
to region d. 
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paper, and earning some money; that is, only 
these alternatives exist psychologically and, 
therefore, only they are part of the life space. 
From the diagram, one can further see that for 
this particular student “going to a movie” re- 
quires that he “go downtown” and “buy a ticket 
to the movie" before he "sees the movie.” "'See- 
ing a movie," in other words, is only possible if 
it is preceded by the activities of "going down- 
town" and "buying a ticket." We would expect 
that if by chance he found that he did not 
have the money to buy the ticket after going 
downtown that he would not see the movie, 
since the only region connected with "seeing 
the movie" is "buying a ticket." The regions 
of "sneaking into the movie" or of "getting 
someone else to buy his ticket" do not exist 
psychologically and, thus, if he cannot buy a 
ticket, it is impossible for him to see the movie. 
The other alternative ways of spending the eve- 
ning could be similarly discussed. For example, 
Fig. 4 indicates that even if the library were 
closed, another way of going about writing his 
term paper exists as a possibility. On the other 
hand, for this student, it does not make sense 
to travel to the hospital to visit his friend with- 
out first calling up to see whether or not visiting 
is permitted. It is also evident that the activity 
of "earning money" will not be engaged in, since 
there doesn't exist, psychologically, a linkage 
or path between what the student is now doing 
and this particular activity. In other words, he 
doesn't see how he can get to "earning money" 
from his present position. 

Let us employ Fig. 4 to illustrate some of 
Lewin's specific topological concepts. One of 
his basic concepts is that of region, which may 
be defined as any distinguishable part of the life 
space (or person). Regions of the psychological 
environment refer to present or contemplated 
activities rather than to the objective areas in 
which activities occur or to the external objects 
with which activities are linked (Leeper, 1943, 
pp. 92-95) . A child who is using the back porch 
as a stage and a chair as a throne is in the region 
of “play acting" (on the porch with a chair as 
a throne), rather than in the region of the 
"porch." It should be obvious, however, that 
the characterization. of any activity will be 
facilitated by knowledge of its time and locale 
and of the objects with which it is concerned. 
In Fig. 4, all of the areas enclosed by a line 
represent regions (e.g., the region of considering 
how to spend the evening, the region of going 
downtown, the region of buying a ticket). Some 
regions include subregions (e.g., visiting one's 
friend in the hospital may include subregions 
of giving him a persent, encouraging him, amus- 
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ing him). However, psychological activities are 
not thought to be infinitely divisible. The 
degree of differentiation of a region refers to the 
number of subparts within a region. Any region 
which has no distinguished subparts may be 
called a cell. 

The position of the behaving self, P, is de- 
fined by the activity region in which it is lo- 
cated, Any change of position of any region 
within the life space is conceived to be a loco- 
motion. The term “locomotion,” however, is 
primarily used in reference to locomotion of the 
behaving self rather than in reference to loco- 
motion of parts of the psychological environ- 
ment. In Fig. 4, a locomotion would take place 
if P moves out of the region of “considering how 
to spend his evening” into any other region, or 
if some rearrangement of the psychological 
environment took place so that, for example, the 
student now views “visiting his friend” as lead- 
ing to, rather than away from, “seeing a movie.” 
(This might occur if he called up the hospital 
and found out that they were showing movies 
to the patients and their visitors.) 

Locomotion from one region to another in- 
volves movement of the behaving self from its 
present to its terminal position through a path 
of neighboring regions. Two regions are neigh- 
boring if both regions have a common boundary 
and are otherwise foreign to each other. The 
boundary of a region consists of those cells in 
the region for which there is no surrounding 
boundary that lies entirely within the region. 
Regions 4,, 45,. . ., A, are said to form a path 
if A, is touching 45, A, is touching 45, . . s 
A,—1 is touching A, In Fig. 4, there is no 
psychologically existing path between “‘consid- 
ering how to spend the evening" and “earning 
some money,” while there are two possible paths 
between “considering how to spend the evening” 
and “writing a paper.” The distinguished path 
between any two regions is the path along which 
the individual anticipates that he will locomote 
if he chooses to proceed from one region to 
another. It is, in other words, the preferred or 
psychologically best path. 

As we have pointed out in an earlier section, 
Lewin developed a geometry he termed “hodo- 
logical space" to represent direction in the life 
space. As in other geometries, the concept of 
direction is based upon the concept of distin- 
guished path. However, it is evident from the 
preceding paragraph that the concept of dis- 
tinguished path in hodological space takes into 
consideration the cognitive structure and dy- 
namic properties of the life space. A change 
in the psychological environment (ie, in the 
cognitive structure) or of the attractiveness of 
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its various features is a sufficient condition for 
changing the direction between regions. 

Direction at A toward B is defined as the re- 
lation between the regions A and B which is 
determined by the beginning step of the dis- 
tinguished path from the region A to B. Four 
types of direction of locomotion can be distin- 
guished (the definitions which follow represent 
an adaptation of Leeper's modifications of Lew- 
in's original definitions) : (1) direction toward 
— any locomotion from one region to another 
which increases the force toward or decreases 
the distance from the terminal focus of locomo- 
tion (or goal-region); (2) direction away — 
any locomotion from one region to another 
which decreases the force toward or which in- 
creases the distance from the terminal focus of 
locomotion; (3) opposite direction — two 
paths have opposite direction if the distin- 
guished path for locomotion toward one 
goal-region corresponds with the most unfavor- 
able path for locomotion toward the other 
goal-region; and (4) direction at “right angles" 
(Leeper, 1943, p. 205) — a path is at right angles 
to another path if locomotion along it does not 
change the strength of force toward the terminal 
focus of the other path or if locomotion along 
it does not change the distance to the terminal 
focus of the other path. 

In addition to the preceding types of direc- 
tion, Lewin distinguishes between equal and 
partly equal directions. If one compares two 
paths from one region to different goals which 
share the same first step(s), the degree of 
equality of direction at the starting point ap- 
proaches zero if these paths show a high degree 
of psychological unity (all steps in the path 
tend to be fused as part of an inseparable 
whole) as well as if each shows a very low 
degree of unity (so that each step is viewed 
as relatively separate and independent from 
the other steps in the path). Forces with partly 
equal directions summate, but not as completely 
as forces in fully equal directions. 

Without elaborating, let us point out that 
Lewin's discussion of the psychological unity 
of a path requires that a path, such as the chain 
of regions A-B-C-D-E connecting A and E, be 
conceived as a series of overlapping paths which 
consist of A-B, A-B-C, A-B-C-D, and A-B-C-D-E. 
If the potency of A-B-C-D-E is high then there 
may be no distinguishable subunits on the path. 
If, on the other hand, the potency of any sub- 
path is high, the total path A-B-C-D-E may not 
exist as a psychological unit (for a discussion 
of the concepts of overlapping situations and 
potency see several paragraphs below). 
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While the factor of psychological distance is 
of crucial importance in studying problems of 
the strength of forces, Lewin developed the con- 
cept of distance only sufficiently to indicate that 
one path is longer than another if it includes 
the other path within itself; however, the fact 
that one path is longer than another does not 
imply that it includes the shorter path within 
itself. However, in a subsequent Analysis of the 
Concepts: Whole, Differentiation, and Unity 
(Lewin, 1951b, or 1941), he defines the mini- 
mum distance between any two cells as the 
minimum number of boundaries crossed by a 
path from one of these cells to the other. This 
later definition, which has also been employed 
by Bavelas (1948) in a modified form, runs into 
the difficult question of whether it is fruitful to 
consider all cells (or undifferentiated psycho- 
logical activities) as being of the same size. 

We have indicated in a preceding paragraph 
that direction in the life space is dependent 
upon the cognitive structure. Thus, if the in- 
dividual has no clear knowledge of the sequence 
of steps necessary to achieve a given objective 
(e.g., to solve a mathematical problem, to influ- 
ence a policeman not to give him a speeding 
ticket, to escape from a burning building), the 
direction of locomotion necessary to obtain his 
goal is not known. Most new situations are 
cognitively unstructured, since the individual is 
likely to have no knowledge of “what leads to 
what.” Hence it follows that in such situations 
the individual is unlikely to know what direc- 
tion he should follow to achieve his objective. 
As he strikes out in any given direction he does 
not know whether it is leading him toward or 
away from his goal. His behavior will be ex- 
ploratory, trial-and-error, vacillating, and con- 
tradictory rather than parsimonious. If reach- 
ing the goal has positive significance and not 
reaching it has negative significance for the 
individual concerned, then being in a region 
that has no clear cognitive structure results in 
psychological conflict, since the direction of 
the forces acting upon him are likely to be both 
toward and away from any given region. There 
will be evidences of emotionality as well as 
cautiousness in such situations. In addition, the 
very nature of an unstructured situation is that 
it is unstable; perception of the situation shifts 
rapidly and is readily influenced by minor cues 
and by suggestion from others. 

This characterization of some of the psycho- 
lógical implications of cognitively unstructured 
situations is of considerable usefulness to social 
psychology. For example, Lewin has employed 
this concept to give insight into the situation of 
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the adolescent (1939a or 1951b). He pointed 
out that the change from child to adult is a 
shift to a more or less urrknown position: 


. .. Entering a new social group can mean 
something very similar to being thrown into a 
cognitively unstructured field, being forced to 
stand on unfirm ground and not knowing whether 
the "right thing" is being done. The uncertain 
character of the adolescent's behavior and his 
conflicts can partly be explained by the lack of 
cognitive clarity concerning the adult's world 
which he is going to enter. It follows that this 
uncertainty is greater the more the individual 
has previously been kept out of the adult world 
and has been kept in the dark about it... [In 
addition] the time of sexual maturity brings with 
it changes which make this part of the life space, 
which is so close and vital to the individual, 
strange and unknown. In this case the change 
does not mean merely the usual uncertainties of 
a new and strange environment; but, in addition, 
a region which previously appeared to be well 
known and reliable becomes now unknown and 
unreliable. This change necessarily. shakes the 
belief of the individual in the stability of the 
ground on which he stands and perhaps even in 
the stability of the world at large. Since the 
region of the body happens to be very important 
and central for everyone, this doubting might be 
rather fundamental. (Lewin, 1951b, pp. 138-139, 
or 19392) 


The concept of lack of cognitive structure has 
also been fruitfully applied to characterize the 
situation of members of an autocratic group 
where decisions are arbitrarily made (or made 
on a basis which is unclear to the members) by 
the person in authority. It also has been used to 
give insight into the psychological situation of 
minority group members (Lewin, 1948a, or 
19352) , of people suffering from physical handi- 
caps (Barker, Wright, Gonick, 1946), of the 
nouveaux riches, or of other persons crossing 
the margin of social classes. It may be applied 
to any situation in which the consequences of 
behavior are seemingly unpredictable or uncon- 
trollable, in which benefits and harms occur in 
an apparently inconsistent, fortuitous, or arbi- 
trary manner, in which one is uncertain about 
the potential reactions of others to being in a 
certain activity region. 

The concept of overlapping situations has 
been employed to characterize instances where 
the person is on more than one path at the 
same time. The simplest example is that of 
divided attention: a student eating his lurch 
and trying at the same time to study for an 
examination. One could analyze most instances 
of experienced conflict as being examples of 
overlapping situations — €g- the role conflict 


experienced by some career women between 
being “professional” or being “feminine” at any 
particular instant. 

Barker, Wright, and Gonick (1946) have 
pointed out five aspects of overlapping situa- 
tions: (1) Consonance. The extent to which 
the overlapping situations require activity that 
is more or less congruent is called their conso- 
nance. Overlapping situations can vary from 
situations leading to identical behaviors (i.e., 
paths have same directions) to situations re- 
quiring completely antagonistic behavior (i.e., 
paths have opposite directions). (2) Potency. 
The influence of one situation relative to all 
simultaneously acting situations is called the 
relative potency of the situation. A person in 
the overlapping situations of “reading the pa- 
per” and “conversing with wife,” in which the 
latter situation is of relatively higher potency, 
will predominantly converse with his wife, with 
perhaps an occasional glance at the headlines 
or a slight feeling of annoyance. (3) Valence. 
An important characteristic of overlapping 
situations is their relative valence or desirabil- 
ity. (4) Barriers. A fourth characteristic of 
overlapping situations is the nature of the per- 
ceived barriers or obstacles in each of the situa- 
tions. (5) Extent of common parts. This char- 
acteristic refers to the extent of the part common 
to the overlapping regions. 

While the representation which is employed 
in topological diagrams such as Fig. 4 is useful 
in depicting the possibilities which exist in the 
life space, it obviously does not tell us which 
of the alternative possibilities will be chosen. 
The selection among the possibilities will be 
determined by such dynamic factors as the rela- 
tive strength of the goals of “seeing a movie,” 
“visiting the friend in the hospital,” “writing 
the paper"; the attractiveness of the activities 
which are necessary to reach each of the dif- 
ferent goals; the probabilities that the goals will 
be reached by the available paths. 

Lewin's "topological" representation — (of 
which Fig. 4 is a typical example) is most ade- 
quate for describing the activities that an in- 
dividual perceives as being necessary for the 
attainment of a given objective and, thus, for 
depicting alternative courses of action as they 
appear in the life space of the acting individual 
at the planning or decision-making stage. It 
should be made clear that the relations repre- 
sented in a topological diagram can be trans- 
lated without loss of exactness into statements. 
However, the diagram is often a more con 
venient way of expressing the relations among 
a limited number of facts than would be a series 
of statements. On the other hand, the topologi- 
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cal diagram is very awkward and, at times, even 
a hindrance when one attempts to depict certain 
types of perceived possibilities. This is true 
when the means-ends relations are numerous or 
complex. Moreover, Heider (unpublished 
manuscript) and Leeper (1943) have pointed 
out that the “topological” diagram is incon- 
venient when one wishes to represent the suf- 
ficient rather than the necessary conditions for 
an event. For example, a child may perceive 
"eating jam" as being a sufficient but not a 
necessary condition for being punished. The 
fact that “eating jam" irrevocably leads to pun- 
ishment in the child's view is difficult to repre- 
sent without the use of arbitrary devices; 
on the other hand, there is no difficulty in 
representing high school graduation as a neces- 
sary condition for entering college. Heider has 
also pointed out that the “topological” diagram 
is inconvenient when the perceived source of 
change in the life space is not the actor but is 
somebody else or an impersonal source. Thus, 
it is difficult to represent topologically the ex- 
pectation that “if my boss gives me a raise, I 
will be able to buy a new car.” Similarly, it is 
difficult to represent topologically instances 
where one is the recipient of an event — e.g. 
that a child expects to have a good time when 
his grandparents visit. 

In addition to the foregoing difficulties often 
encountered in “topological” representation, 
Lewin's attempt to go beyond topology to treat 
problems of distance and direction in a manner 
suitable for psychology in his ""hodological 
space" was not mathematically successful. For 
example, although the concept of distance is 
basic to the definitions of direction which are 
employed in hodological space, it was never 
more than partially defined. Lewin was well 
aware of the provisional nature of “hodological 
space" and in his last years was engaged, with 
the aid of some of his students and some mathe- 
maticians, in attempts to develop a more ade- 
quate geometry. Bavelas (1948), one of the 
most ingenious of Lewin's students, developed 
a mathematical model which has considerably 
more mathematical rigor to it than “hodological 
space." Instead of defining the distinguished 
path in terms of dynamic factors, the criterion 
of a distinguished path is purely a positional one 
— that of "minimal regions to be crossed by a 
path from a to b." Employing five basic assump- 
tions and a series of definitions developing from 
these assumptions, Bavelas is able to give un- 
equivocal meaning to a number of concepts 
which seem to have applicability to personality 
and group theory: "structures," “outermost re- 
gion of a structure," "innermost region of a 


structure," "central region of a structure," 
"peripheral region of a structure," “maximum” 
and "minimum" distances within a structure, 
"steps toward or away, or neither toward nor 
away from a specified cell,' "a straight line 
path," etc. Some of these concepts have already 
been employed fruitfully in experimental stud- 
ies of different types of communication struc- 
tures in groups (Leavitt, 1951). Harary and 
Norman (1952), working at the Research Cen- 
ter for Group Dynamics, have employed the 
mathematical theory of graphs (a branch of 
combinatorial topology) to the study of com- 
munication networks and group structures. 

Let us conclude this section by indicating 
that although Lewin's "topological" and “hodo- 
logical" concepts are not much more than 
diagrammatic representations of his brilliant 
theoretical and experimental insights, it would 
be foolish to neglect the suggestive value of his 
imaginative attempt to develop a geometry suit- 
able for psychology. He pointed out the need 
for a new mathematics based on axioms dif- 
ferent from those of the mathematics developed 
for the physical sciences. He indicated some 
of the properties that would be required of a 
geometry adequate to handle psychological 
space and he stimulated a more widespread in- 
terest in the development of such a geometry. 


DYNAMIC CONCEPTS 


In Lewin's system, dynamic concepts have the 
function of enabling one to determine which 
of the possible psychological events will occur. 
Just as his geometrical concepts are not inde- 
pendent of dynamic factors, so too his dynamic 
concepts are defined in a way which presup- 
poses an adequate geometry. However, despite 
this interdependence of concepts in his formal 
"topological and vector psychology,” many of 
Lewin's basic dynamic concepts were formu- 
lated prior to his explicit use of topological 
concepts. 

The concept of tension system. The begin- 
nings of Lewin's dynamic conceptions can be 
traced back to his controversy with Ach. Ach 
(1910) had conducted a series of experiments 
concerned with the learning of paired nonsense 
syllables in which the learned association be- 
tween the previously paired nonsense syllables 
(e.g., between rik-tik) was put in conflict with 
the test instructions which called for a different 
wpe of response than that required in the learn- 
ing situation. For example, the subject would 
be asked to invert the stimulus syllable and thus 
be required to respond to "rik" with "kir" in- 
stead of the "tik" which he had learned to 
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associate with “rik” during the initial learning 
situation. Ach found that reaction time in the 
test situation was influenced by whether the “‘as- 
sociative tendency" acquired in the initial 
learning coincided or conflicted with the “de- 
termining tendency” resulting from the in- 
structions in the test situation. While Ach’s 
experiments convincingly demonstrated conflict 
and delayed reaction time, Lewin argued that 
the conflict was not between “association” and 
a determining tendency but between two de- 
termining tendencies. It was his belief that 
there is no “force” within mere association to 
lead to reproduction, that reproduction itself 
must be motivated. 

To prove his contention, Lewin conducted a 
series of experiments (Lewin 1917, 1922), that 
were much like Ach’s except for a few minor 
modifications. They demonstrated that the ex- 
istence of an association is not, per se, sufficient 
for recall of the associated syllable on presenta- 
tion of the stimulus syllable and, thus, for inter- 
fering with a different response to the stimulus 
syllable. He reaffirmed that an association (or, 
to use his terms, a cognitive structure) does not 
provide the "motor" for mental activity. A 
tension system is always necessary for activity, 
including the activity of reproducing previously 
learned nonsense syllables. 

The concept of tension system plays a central 
role in Lewin’s theorizing, and it has been the 
stimulus to a series of ingenious experiments 
on recall and resumption of interrupted activi- 
ties, substitute activities, and satiation. It is 
most appropriately applied in connection with 
the person as represented in Fig. 1 (see p. 190). 
A system in a state of tension is said to exist 
within the individual whenever a psychological 
need or an intention (sometimes referred to as 
a quasi-need) exists. "Tension is released when 
the need or intention is fulfilled. Tension has 
the following conceptual properties: (a) it is a 
state of a region or system $ which tries to 
change itself in such a way that it becomes 
equal to the state of its surrounding regions 
$1,82...,$9, and (b) it involves forces at the 
boundary of the region S in tension. 

There exists a definite relation between ten- 
sion systems of the person and certain properties 
of the psychological environment. In particular, 
a tension may be related to a positive valence 
for activity regions in the psychological en- 
vironment which are perceived as tension-re- 
ducing, and a negative valence for the region ih 
which the behaving self is at present. However, 
the existence of a region of positive valence (a 

goal region) depends not only upon the exis- 
tence of tension but also upon whether or not 
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there are perceived possibilities for reducing the 
tension. The individual who has no conception 
of how to reduce a tension, and who therefore 
perceives no goal region, will have his behavior 
determined by his desire to leave his present 
negative region for another, and to leave that 
for another, and so on. His behavior will be 
characterized by locomotion in the direction 
away from his present region — or, in observ- 
able terms, by restless movement. Search be- 
havior, occurring in the situation where the 
individual knows a goal region exists but does 
not know its location, has many similarities 
to restless activity but is distinguished from 
it by the fact that the direction toward the goal 
is dominant and the leaving of the present re- 
gion is only a means to that end (Lewin, 1938, 
pp. 62-63). 

When a goal region which is relevant to a 
system in tension exists in the psychological 
environment, one can assert that there is a force 
acting upon the behaving self to locomote to- 
ward the goal. A tension for which there is 
a cognized goal leads not only to a tendency to. 
actual locomotion towards the goal region but 
also to thinking about this type of activity. 
This may be expressed by saying that the force 
on the person toward the goal exists not only 
on the level of doing (reality) but also on the 
level of thinking (irreality) . 

From the foregoing assumptions about sys- 
tems in tension it is possible to make a number 
of derivations. Thus, it follows that the tendency 
to recall interrupted activities should be greater 
than the tendency to recall finished ones. Zei- 
garnik (1927) and many others have conducted 
experiments in which subjects are given a series 
of tasks to perform and are then prevented from 
completing half of them (selected at random) . 
Later, the subjects are asked to recall what tasks 
they had performed. The results are presented 
in the form of a quotient, commonly called the 
Zeigarnik quotient (ZQ): 


unfinished tasks recalled (RU). 
completed tasks recalled (RC) 


Zeigarnik predicted a quotient of greater than 1. 
The obtained quotient was approximately 1.9, 
clearly supporting Lewin's assumptions. How- 
ever, since many completed tasks were also re- 
called, it was obvious that additional factors 
were involved. Analyzing the situation of the 
subject at the moment of recall, she concluded 
that in addition to the force on the person to 
think about, and hence to recall, the incom- 
pleted tasks, there was also present a force to 
recall both incompleted and completed tasks 
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induced upon the person by the experimenter's 
instructions to "try to recall the tasks you 
worked on earlier" The Zeigarnik quotient 
could be viewed as a function of the relative 
strengths of the induced force to recall all tasks 
and of the force to recall the incompleted tasks. 
As the strength of the induced force increases 
in relation to the force toward the task goals, the 
quotient should approach 1; as it decreases in 
relative strength, the quotient should increase 
beyond 1. These additional predictions, which 
follow from an analysis of the situation at 
recall, were borne out in experiments by Zei- 
garnik and others. Thus, if the strength of 
motivation associated with the interrupted task 
is relatively high, or the strength of the experi- 
menter's pressure to recall is low, or if the task 
is interrupted near its end, the Zeigarnik quo- 
tient will be high. 

A number of more recent experiments have 
indicated that the situation of recall is fre- 
quently even more complex than indicated 
above. When not finishing a task is interpret- 
able as a personal failure (e.g., in an experiment 
where the tasks are presented as measures of a 
socially esteemed ability) and when recall of 
failure threatens one's self-esteem, or when the 
recall of success raises a lowered self-esteem) , 
the Zeigarnik ratio tends to be less than one. 
For a more thorough discussion of the effects 
of self-esteem involvement on the Zeigarnik 
quotient see Alper (1946, 1948), Glixman 
(1948), Prentice (1944), Rosenzweig (1943), 
and Zeller (1950). 

It should be noted that the terms "unfin- 
ished" and "completed" in the Zeigarnik quo- 
tient should refer to the subject's view of the 
tasks rather than to whether or not they were 
objectively completed. If the subject achieves 
his goal in relation to the task even though 
objectively it is unfinished, one would expect 
no force toward recall and, contrariwise, if the 
subject's goal is not attained even though the 
task is completed, one would expect the tension 
to persist. Data from Zeigarnik's experiment and 
from an experiment by Marrow (1938) support 
the notion that "psychological" reality rather 
than "objective" reality is what is significant 
here in predicting an individual's behavior. 

From the general assumption that when there 
is a need present for a certain goal there cor- 
responds a force causing a tendency to locomote 
toward the goal, it follows that as long as the 
tasks are psychologically incomplete the subject 
should continue to try to perform the task (pro- 
viding there are no sufficiently strong counter 
forces). Ovsiankina (1928) created conditions 
such that the subject thought he was inter- 


rupted by chance and at a later time he was left 
free to do as he wished in an experimental room. 
There was 100 percent resumption of the in- 
terrupted task. When the interruption ap- 
peared to be intentional, there was still 82 per- 
cent resumption. 

Since the construct of tension refers to the 
state of one system relative to the state of sur- 
rounding systems, it presupposes a geometric 
representation of the person and a distinction 
between functional subparts or "systems" within 
the person. Lewin (1935b), in conceiving of 
the person as a series of differentiated subsys- 
tems, is rejecting the catch phrase "everything 
is related to everything else" as misleading, since 
there is a great range of interdependence be- 
tween the subsystems of the person. Much of 
the research of Lewin and his students is con- 
cerned with the question of what factors deter- 
mine the interdependence of tension systems 
and with the "fate" of tension with varying 
conditions of action. 

Lewin postulated that the change in the ten- 
sion difference between any two tension systems 
would depend upon the time interval and the 
degree of interdependence of the two systems. 
With increased time, one would expect the ten- 
sion differences between interdependent systems 
to decrease. Thus, Zeigarnik in her experiments 
on the recall of interrupted activities and Ov- 
siankina in her experiments on the resumption 
of interrupted activities both found that as time 
elapses there is a decreasing tendency to recall 
or resume the interrupted activities. 

The degree of interdependence of two tension 
systems is conceived to be a function of the 
degree of fluidity or rigidity of the person and 
of the structural relations or connections be- 
tween the two systems. The fluidity of the per- 
son is affected by general states of the person. 
For example, a person is considered to be more 
fluid when tired, when he is undergoing a wave 
of emotional tension, when he is young, when he 
is in a “make-believe” or irreal situation, when 
he is more intelligent, etc. Experiments by 
Zeigarnik have indicated that in more fluid 
states, the tension differences between systems 
tend to dissipate more rapidly. 

Experimental studies of rigidity by Kounin 
(1941) , in which he compared three groups of 
subjects of equal mental age (an old feeble- 
minded group, a young feeble-minded group, 
and a normal group) , led him to conclude that 

D 


. any performance which requires a certain 
degree of communication between neighboring 
regions—, is to such an extent made difficult for 
the older and/or more feeble-minded individual. 
The task may be predominantly of a cognitive 
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nature . . ., or a motor nature, .. . or of a 
volitional nature. If a task is facilitated by a lack 
of communication between neighboring regions, 
such a task will be more efficiently and accurately 
performed by an older and/or more feeble- 
minded individual. 


The structural relations between systems refer 
to such questions as whether or not two systems 
are subparts of a larger system, whether or not 
two systems have a relationship of simple de- 
pendence, interdependence, or organizational 
dependence, and the proximity of or distance 
between regions. While the problem of charac- 
terizing structural arrangements in logically 
strict conceptual terms has received only scant 
attention (Bavelas, 1948), (Lewin, 1951b, Ap- 
pendix), a number of interesting experiments 
have been conducted which deal with structural 
relations, The experimental studies of substi- 
tution are relevant in this connection. 

Lissner (1933) initiated the study of the value 
which one activity has for reducing a tension 
originally connected with another activity by 
a technique of resumption; some of the other 
experimenters have employed the technique of 
recall. The substitute value is measured by 
the amount of decrease in resumption or recall 
of the interrupted original activity after a sub- 
stitute activity has been completed. The results 
of the experiments on substitute value can be 
summarized briefly as follows: (1) Substitute 
value increases with the perceived degree of 
similarity between the original and the substi- 
tute activity and with the degree of difficulty 
of the substitute activity (Lissner, 1933). (2) 
Substitute value increases with increasing tem- 
poral contiguity between the original and the 
substitute activity and with the attractiveness 
of the substitute activity (Henle, 1942). (3) The 
substitute value of an activity (e.g, thinking, 
talking, or doing) depends upon the nature of 
the goal of the original task. Tasks that are 
connected with the goal of demonstrating some- 
thing to another person (e.g., the experimenter) 
require an observable substitute activity (not 
merely "thinking" without social communica- 
tion) ; "realization tasks," in which the building 
of a material object is the goal, require “doing” 
not only telling how it can be done; for intel- 
lectual problems, talking (or telling how it can 
be done) can have a very high substitute value 
(Mahler, 1933). (4) “Magic solutions," “make- 
believe solutions," or solutions which observably 
violate the requirements of the task have little 
substitute value for tasks at the reality level. 
However, if the situation is a make-believe or 
play situation, make-believe substitutions will 
have substitute value (Sliosberg, 1934; Dembo, 


1931). (5) A substitute activity which is iden- 
tical with the original activity will have little 
substitute value if it does not serve the same 
goal Thus, building a clay house for John 
will have little substitute value for building a 
clay house for Mary. If the emphasis is on 
building a clay house and not upon the for 
somebody, then, of course, substitution will 
occur (Adler and Kounin, 1939). (6) Having 
somebody complete a task that the subject is 
interrupted in tends to have little substitute 
value, particularly when completion of the task 
by oneself is related to self-esteem. However, 
when pairs of individuals work cooperatively 
on a task, the completion of the task by one's 
partner has considerable substitute value (Lew- 
is, 1944, and Lewis and Franklin, 1944) . 

The research findings with respect to substi- 
tute value have implications for a wide range 
of problems in psychology — from the relative 
gratification value of individual versus socially 
shared projective or fantasy systems to the de- 
velopment of specialized roles within a group. 
Let us briefly illustrate with the very important 
finding that the actions of another person can 
be a substitute for one’s own actions if there is 
a cooperative relationship. The fact of substi- 
tutability enables individuals who are working 
cooperatively on a common task to subdivide 
the task and to perform specialized activities, 
since none of the individuals in a cooperative 
situation has the need to perform all the activi- 
ties by himself. In contrast, the individual in 
a competitive situation is less likely to view the 
actions of others as substitutable for similarly 
intended actions of his own. Thus, when a com- 
petitive situation exists in a group, there is less 
likely to be a development of specialization in 
activities (Deutsch, 1949a, b) . 

The concept of tension systems has also been 
fruitfully employed in experimental studies of 
satiation. One can distinguish in regard to most 
needs a state of hunger, of satiation, and of 
oversatiation. These states corresponds to a posi- 
tive, a neutral, and a negative valence of the 
activity regions which are related to a particular 
need or tension system. Karsten (1928) has 
studied the effect of repeating over and over 
again such activities as reading a poem, writing 
letters, drawing, and turning a wheel. The main 
symptoms of oversatiation appear to be (1) the 
appearance of subunits in the activity leading 
to the disintegration of the total activity and a 
loss of meaning of the activity; (2) increasingly 
poorer quality and greater frequency of errors in 
performing the task; (3) an increasing tendency 
to vary the nature of the task, accompanied by 
a tendency for each variation to be quickly 
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satiated; (4) a tendency to make the satiated 
activity a peripheral activity with an attempt to 
concentrate on something else while doing the 
task — this attempt is usually not completely 
successful and the mind wanders; (5) increas- 
ing dislike of the activity and of similar activities 
with an increased valence for different tasks; 
(6) emotional outbursts; (7) development of 
“fatigue” and similar bodily symptoms which 
are quickly overcome when the individual is 
shifted to another activity. 

Satiation occurs only if the activity has, psy- 
chologically, the character of marking time or 
of getting nowhere. If the activity can be viewed 
as making progress toward a goal, the usual 
symptoms of satiation will not appear. Em- 
bedding an activity in a different psychological 
whole so that its meaning is changed has prac- 
tically the same effect on satiation as shifting 
to a different activity. The rapidity with which 
satiation occurs depends upon (1) the nature of 
the activity — with increasing size of its units 
of action and with increasing complexity satia- 
tion occurs more slowly, (2) the degree of cen- 
trality of the activity — other things being 
equal, activities which are of more significance 
to the person are more quickly satiated than 
peripheral activities, (3) the state of the person 
— the more fluid the state of the person the 
more quickly he is satiated. The rate of satia- 
tion and cosatiation of similar activities (i.e., 
the spread of satiation effects from one activity 
to similar activities) decreases with age and 
with lack of intelligence (Kounin, 1941). 

Most of the phenomena of satiation can be 
explained by assuming that continued perform- 
ance of a task leads to a lowering of the tension 
level in the system corresponding to the task; 
with increasing repetition the system corre- 
sponding to the task may be represented as 
reaching a lower level of tension than the sur- 
rounding systems, This should lead the person 
to turn away from the repeated task to other 
activities. With time, the person will also turn 
away from activities which are related to tension 
systems that are interdependent with the sati- 
ated system and turn toward activities which are 
not so related to the satiated activity. Thus, the 
tendency is to turn away from similar activities 
and to turn toward dissimilar activities. The 
breakdown of the satiated activity into disor- 
ganized units fits in very well with Lewin's 
analysis of the “differentiation and unity of a 
whole based on simple dependence” (Lewin, 
1951b, pp. 305-335, or 1941). 

The concept of tension system has also been 
employed to give insight into some of the effects 
of frustration. Barker, Dembo, and Lewin 
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(1941) in their study of Frustration and Regres- 
sion have indicated that psychological develop- 
ment can be characterized as resulting in the 
following structural changes: (a) an increased 
differentiation of the person into subsystems, 
which is reflected in a wider variety of behavior, 
emotions, skills, knowledge, etc.; (b) a change 
of organization of the person so that subsystems 
of the person are hierarchically related to one 
another rather than related to one another by 
simple interdependence. ‘Thus, as the individual 
grows older some systems become more domi- 
nant and other systems serve as their “tools”; 
in infancy, interdependence of systems is more 
characterized by the simple diffusion type of in- 
fluence. Barker, Dembo, and Lewin believed 
that the structural changes brought about by 
psychological development could be reversed, 
at least momentarily, by creating a high level 
of tension in a subsystem of the person and by 
not permitting this tension to be reduced 
through activity toward a relevant goal. Pre- 
senting a frustrating situation, i.e., one in which 
an individual is prevented from reaching a de- 
sired goal, is one way of creating such tension. 

To study the effects of frustration, they cre- 
ated an experimental situation where children 
(ages 3 to 5) who had been allowed to play 
with some extremely desirable toys were taken 
to another part of the room which separated 
them from these toys by a wire partition. Here 
they were allowed to play with less desirable 
toys. Their behaviors, including the construc- 
tiveness of their play activities, were observed 
and compared during the period when they 
could play with the desirable toys (free play 
period), during the period when they could 
not play with the toys (frustration period) , and 
during a later period when they were again 
allowed to play with the desirable toys (the post- 
frustration period). The results of the experi- 
ment indicate that frustration produces a change 
in the constructiveness of play activities (i.e., 
of the degree of organization and differentiation 
in the play activities) equivalent to an average 
regression of 17.3 months of mental age. 

The dedifferentiation which was produced 
by the experiment is explained in terms of two 
related notions, which are developed at some 
length by Lewin (1951, Appendix, or 1941) : 
(1) The degree of differentiation of a whole 
(eg, the person) is inversely related to the 
strength of tension when this passes beyond 
certain limits that are determined by the 
strengths of boundaries of the subsystems com- 
prising the whole, and (2) a decrease in the 
variety of behavior must also occur if a part of 
the whole is kept in a fixed state. The amount 
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of decrease depends upon the extent of the fixed 
areas, their degree of centrality, and their diver- 
gence from the normal level. Frustration keeps 
a certain portion of the person in a state of 
more or less permanent tension and the variety 
of behavior therefore decreases. The change in 
degree of organization of play activities results 
from a tendency for diffuse spreading of tension 
when tension is high and thus a bypassing or 
disruption of organizational relations and, in 
addition, from the conflict for control over the 
perceptual-motor regions between the “frus- 
trated” tension system and the tension system 
related to playing with the less desirable toys. 

Let us conclude our discussion of tension sys- 
tems by pointing out that the concept of tension 
system and the various experimental studies on 
the recall and resumption of interrupted activi- 
ties, substitution, satiation, and frustration have 
direct relevance to many problems of social 
psychology (although, as yet, their relevance has 
not been fully exploited). The concept of 
tension system is applicable to socially derived 
needs and intentions, to motives which develop 
from belonging to a group and from participat- 
ing in group activities, to interpersonal influ- 
ences, etc. The various effects of tension upon 
psychological processes provide criteria for de- 
veloping insights into the social and group 
factors which produce individual motivation as 
well as insight into the social and group factors 
which facilitate the reduction of individual 
tensions. 

Dynamic properties of the psychological en- 
vironment. The construct force characterizes, 
for a given point of the life space, the direction 
and strength of the tendency to change. The 
combination of a number of forces acting at 
the same point at a given time is called the re- 
sultant force. Whenever a resultant force (dif- 
ferent from zero) exists, there is either a 
locomotion of the behaving self in the direction 
of that force or a change in cognitive structure 
equivalent to this locomotion, 

Lewin distinguishes between driving and re- 
straining forces. Driving forces correspond to 
a relation between at least two regions of the 
life space: the region of present activity and 
the region of a goal. Driving forces tend to 
lead to locomotion, hence the statements in the 
preceding paragraph, strictly speaking, refer 
only to driving forces. Restraining forces, as 
such, do not lead to locomotion, but they do 
influence the effect of driving forces. Any region 
which offers resistance to locomotion, i.e, any 
barrier to locomotion, is characterized by re- 
straining forces at its boundary. The strength 
of the restraining forces reflects the relation be- 
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tween the perceived nature of the barrier region 
and the perceived ability of the individual. 
The same social or physical obstacle corre- 
sponds, therefore, to different restraining forces 
for diflerent individuals. It is an important 
characteristic of the force field, which corre- 
sponds toa barrier, that it does not extend, or at 
least does not extend to any important degree, 
outside the region of the barrier. In other words, 
there are only forces at the barrier away from 
the barrier but not forces in any other region 
which have the direction away from the barrier. 

We have employed the term “valence” in our 
discussion of tension systems and of driving 
forces; the three constructs are intimately inter- 
related. A region within the life space of an 
individual which attracts or repulses is consid- 
ered to have valence: a region of positive valence 
attracts, a region of negative valence repels. 
Technically, a positive valence corresponds to 
a force field where all forces are directed toward 
the same region — the region of positive val- 
ence. A negative valence corresponds to a force 
field where all forces have the direction away 
from the same region — the region of negative 
valence. 

The discussion of the relation among tension, 
valence, and force may be clarified by comparing 
two types of theories. The first type attempts to 
explain behavior as a result of the inner state 
of the person, including the inner stimuli (which 
in turn may be due to stimuli from outside) . 
This view is characteristic of many of the so- 
called physiological theories. It involves a deri- 
vation of behavior including directed locomo- 
tions from the state of inner personal regions. 
Lewin, advancing the second type of theory, 
holds 


. that such an explanation of behavior is 
not possible . . . 

(1) Forces for locomotions cannot be derived 
logically from tension in the person. 

(2) There exists a relation between certain 
tensions of the inner personal regions of the per- 
son and certain valences in the environment: as 
the tension increases, the strength of the valence 
tends to increase. However, the existence and the 
strength of the valence does not depend only 
upon the tension of the person but also upon 
certain nonpsychological "alien" factors . . . More 
specifically, the valence Va (G) which an object 
or activity G possesses for a person at a given 
time depends upon the character and state of the 
person P, and upon the perceived nature of the 
object or activity G. 

(3) In the same indirect way tension and forces 
for locomotion are related . . . the force is a 
function of the [strength of the] valence (G) and 
of the relative position of P and G (the strength of 
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the force increases as the strength of the valence 
increases and as the distance between the person 
and the valent region decreases] . . . In other 
words, the force for a locomotion depends on the 
need or the tension of the person, on those non- 
psychological factors which affect the existence of 
a valence, and on the relative position of the 
person and the valence. 

. .. Instead of linking the need directly to the 
motoric, the need is linked with certain properties 
of the [psychological] environment. The [psycho- 
logical] environment then determines the mo- 
toric. (Lewin, 1938, pp. 106-108) 


The dynamic concepts which Lewin has uti- 
lized to characterize the psychological environ- 
ment have been brilliantly employed, in 
conjunction with his topological concepts, to 
give insight into the nature of different types 
of conflict situations. Lewin's article, The Psy- 
chological Situations of Reward and Punish- 
ment (Lewin, 1935b, Ch. IV) should be read 
in its entirety to obtain the full richness of his 
important discussion. Here we can give only 
the essential points of his distinctions. 

Lewin first distinguishes between the interest 
and the punishment situation. He points out 
that one can represent the fact that a person is 
interested in a task by a positive valence or a 
field of forces which all have the direction to- 
ward the goal. Hence, if difficulties force the 
person out of his original direction to the goal, 
a new vector in the direction of the goal comes 
into play and initiates behavior toward the goal. 
The behavior thus makes a pronounced goal- 
seeking impression. In the punishment situa- 
tion, the individual finds himself between two 
negative valences arising from the unpleasant 
task and the threat of punishment for failure 
to do the task. If the threat of punishment is 
to be effective, the forces emanating from it 
must not only be strong enough to overcome 
the forces away from the unpleasant task but 
they must be so distributed as to continuously 
hold the person within the field of the task. 
When the forces emanating from the threat of 
punishment do not have this distribution or if 
they are not continuously active, the occurrence 
of a difficulty in pursuing the unpleasant task 
(unlike the case in the interest situation) will 
tend to push the individual in a direction op- 
posite to that of the task to be completed. The 
punishment situation lacks an educationally 
important property that the interest situation 
possesses: when the child is thrust away by diffi- 
culties he does not tend to resume of himself 
the direction of the task. 

Lewin pointed out three fundamental types 
of conflict situation, the punishment situation 
being an example of one of the major types. 
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A conflict is characterized as a situation in which 
oppositely directed, simultaneously acting forces 
of about equal strength play upon the indi- 
vidual. The three fundamental types of conflict 
are: 

1. The individual stands midway between 
two positive valences of approximately equal 
strength. A classical instance of this sort is that 
of Buridan’s ass starving between two stacks of 
hay. However, the classical instance is highly 
unlikely, since this type of conflict is a condition 
of a labile or unstable equilibrium. As the 
individual, due to the play of chance factors, 
moves from the point of equilibrium toward one 
rather than the other goal region the resultant 
force toward that region increases and, hence, 
he will continue to move toward that region 
(and, therefore, away from the point of equilib- 
rium). This follows from the fact that the 
strength of force toward a goal region decreases 
with increasing distance from the goal. Most 
instances of apparent conflict between two posi- 
tive valences (e.g., between going to the theater ' 
and going to the opera) are actually conflict 
between regions each of which is simultane- 
ously positive and negative in valence. One 
wants to go to the opera but one does not want 
to miss going to the theater, or one wants to go 
to the theater but one does not want to miss 
the opera — doing one precludes doing the 
other. 

2. The second fundamental type of conflict 
situation occurs when the individual finds him- 
self between two approximately equal negative 
valences. The punishment situation is an ex- 
ample. This type of conflict situation is very 
much influenced by the structure of the situa- 
tion. Let us illustrate three types of conflict 
between negative valences: (a) A situation 
where the individual is between two negative 
valences but there are no restraints keeping him 
in the situation — e.g., a girl who will have to 
marry an unpleasant suitor or become an im- 
poverished spinster if she remains in her village 
— there is nothing to prevent her from leaving 
her village; (b) a situation where the individual 
is between two negative valences but he cannot 
leave the field — e.g., a group member who is 
faced with the prospect of losing social status 
or of performing an unpleasant task — his situa- 
tion is such that it is impossible for him to 
leave the group; (c) a situation where the 
individual is in a region of negative valence and 
‘an only leave it by going through another re- 
gion of negative valence — e.g., a man is in the 
negative region of being cited for contempt of 
Congress for not testifying whether or not some 
of his acquaintances were members of the Com- 
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munist party — to purge himself of contempt 
he must engage in the unpleasant activity of 
being an “informer.” 

It is evident that the situation depicted in 
(a) will lead to a going "out of the field." Escape 
behavior will follow from the fact that the re- 
sultant of the forces away from the two regions 
of negative valence will tend to push the person 
in a direction that is perpendicular to the path 
connecting the two negative regions. Only if 
restraints prevent the individual from leaving 
the field will such a situation result in more 
than momentary conflict. Restraints as in (b) 
introduce a conflict between the driving forces 
related to the negative valence and the re- 
straining forces related to the restraining bar- 
rier. There is a tendency for a barrier to 
acquire a negative valence which increases with 
the number of unsuccessful attempts to cross 
it and which, finally, is sufficiently strong to 
prevent the individual from approaching it (Fa- 
jans, 1933a, b). Thus, the conflict between 
driving and restraining force is replaced by a 
conflict between driving forces as in (c) above. 
This fact is particularly important for social 
psychology since, in many situations of life, 
the barriers are social. When a person turns 
against the barrier, he is in effect directing him- 
self against the will and power of the person (s) 
or social group to whom the erection of the 
barrier is due. That is, when there is no way 
to escape the conflict between negative valences 
except by overcoming social barriers, the bar- 
riers and the people from whom he perceives 
that they derive will take on negative valence 
as the individual is unable to escape. One of 
the effects of the threat of punishment is to cre- 
ate a situation in which the individual and the 
creator of the barrier stand against each other 
as enemies. 

3. The third fundamental type of conflict 
situation occurs when the individual is exposed 
to opposing forces deriving from a positive and 
negative valence. One can distinguish at least 
three different forms of this conflict: (a) The 
situation in which a region has both positive 
and negative valences (the Freudian concept of 
ambivalence is subsumable under this variety 
of conflict) , e.g., a person wishes to join a social 
group but he fears that being a member will 
be too expensive. (b) The situation in which 
a person is encircled by (but is not actually 
in) a negative or a barrier region and is at- 
tracted to a goal which is outside the negative 
or barrier region — e.g., a person who has to 
go through the unpleasant ordeal of leaving his 
home or his group in order to pursue some 
desired activity. The mature of this situation 


is such that the region of the person's present 
activity tends to acquire negative valence as 
long as the region which encircles the person 
hinders locomotion toward desired outer goals. 
Thus, being a member of a minority group or 
being in a ghetto or in a prison often takes 
on negative valence, apart from the inherent 
characteristics of the region, because one can 
get to desired goals from the region only by 
passing through an encircling region of nega- 
tive valence. (c) The situation in which a 
region of positive valence is encircled by or 
is accessible only through a region of negative 
valence. This type of situation differs from (b) 
in that the region of positive valence rather 
than the region in which the person is to be 
found is encircled by the negative valence. The 
"reward situation," in which the individual is 
granted a reward only if he performs a task. (the 
task is unpleasant) , is an example of this type 
of conflict. Similarly, the initiation rites in 
many social groups — e.g., the hazing proced- 
ures of college fraternities — may be viewed as 
exemplifying this kind of conflict. 

Lewin (1951b, p. 259, or 1946a), as well as 
Miller (1944), has pointed out that the forces 
corresponding to a negative valence tend to de- 
crease more rapidly as a function of psychologi- 
cal distance than do the forces corresponding 
to a positive valence. The amount of decrease 
depends also upon the nature of the region 
which has a positive or negative valence. It is 
different, for example, in the case of a dan- 
gerous animal which can move about than it 
is in the case of an immovable unpleasant 
object. The difference in the gradients of de- 
crease for forces deriving from positive and 
negative valences accounts for the fact that a 
strong fear or a strong tendency to withdraw 
may be taken as evidence of a strong desire for 
the goal despite the apparent paradox. Only 
with a highly positive goal will the equilibrium 
point between approach and avoidance tenden- 
cies be close enough to the negatively valent 
region to produce a strong force away from the 
goal. On the other hand, strengthening the 
negative valence of a region may very well have 
the effect of weakening the forces in conflict, 
since the equilibrium point (ie, the point 
where the force away from the negative valence 
and the force toward the positive valence are 
of equal strength and opposite direction) may 
be pushed a considerable distance away from 
the valent regions. 

In our earlier discussion of overlapping situa- 
tions, we pointed out that one could analyze 
conflict in terms of overlapping situations. The 
usefulness of characterizing conflict in terms 
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of overlapping situations becomes evident when 
one considers more broadly choice situations or 
decision-making situations (of which conflict 
might be considered a subtype). Here it be- 
comes apparent that the relative potency of 
each of the overlapping situations, as well as 
the strength of the conflicting valences and the 
relative position of the person in relation to 
the valent regions, is an important factor de- 
termining choice. Potency may be viewed as a 
factor influencing the strength of the effective 
valence or effective force. Even if a goal is 
highly attractive it will not elicit behavior if 
the potency of the situation to which the goal is 
linked is very low. (The combined potency of 
all the situations in which an individual is si- 
multaneously involved is considered to be equal 
to 1; any one situation will vary between 0 
and ].) Escalona (1940) has indicated that one 
may consider the potency of a situation as vary- 
ing directly with the perceived probability of 
success in obtaining the goal. While it is clear 
that factors other than the subjective probabil- 
ity of success may influence "potency," these 
have never been adequately discussed in the 
writings of Lewin or of his students. 
"Reaching a decision" when there are equally 
attractive and opposing goals may be thought 
of as a process which raises one of the alterna- 
tive paths to a stable, dominant potency. Ex- 
perimental results have indicated that, other 
things being equal: the decision time is greater 
in choice between two negatives as compared 
with two positive valences (Barker, 1942), and 
that it increases the more the opposing situa- 


tions are of equal potency (Escalona, 1940), 
the more the opposing forces are of equal 
strength (Cartwright, 1941), the stronger the 
opposing forces (Barker, 1942), and as a func- 
tion of certain personality characteristics, Cart- 
wright and Festinger (1943), by conceiving the 
region of decision between two positive valences 
to be bound by restraining forces against leaving 
the decision area and by assuming that the 
probability of choosing one alternative or an- 
other is distributed in the form of the normal 
curve, have developed a rather neat quantitative 
theory of decision from which many of the re- 
sults of classical psychophysical studies of choice 
can be derived. 

Let us sum up our discussion of the dynamic 
features of the psychological environment by 
pointing out that Lewin employs three main 
concepts — valence, driving force, and restrain- 
ing force — supplemented by the concept of 
potency. These concepts are dealt with most 
fully in his monograph, The Conceptual Rep- 
resentation and the Measurement of Psycho- 
logical Forces. In this monograph, he also 
considers, as its title suggests, the problem of 
the measurement of psychological forces. He 
points out that the strength of a force can be 
measured either by (1) the strength of opposed 
driving or restraining forces, (2) by the rela- 
tive persistence of directed activity, and (3) by 
the velocity of locomotion or of restructuring. 
Lewin's monograph should be consulted for a 
detailed discussion of these various methods for 
ascertaining the strength of a force. 


Concerts DEALING WITH CHANGE IN THE PSYCHOLOGICAL ENVIRONMENT 


Although little of Lewin's work deals with 
“learning” as traditionally conceived, a good 
deal of the research by Lewin and his students 
has relevance to "learning" more broadly con- 
ceived as any change in the person which 
produces a change in his psychological environ- 
ment. Lewin (1951b, p. 67, or 1942) writes: 


Within what is called learning, we have to dis- 
tinguish at least the following types of changes: 
(1) learning as a change in cognitive structure 
(knowledge), (2) learning as a change in motiva- 
tion (learning to like or dislike), (3) learning as 
a change in group belongingness or ideology (this 
is an important aspect of growing into a culture) , 
(4) learning in the meaning of voluntary control 
of the body musculature. 


It is beyond the purpose of this chapter to 
discuss in any detail the various types of changes 
which might be called "learning." Lewin's writ- 
ings on learning were largely confined to a study 


of “learning” as a change in valences and values, 
and here we shall focus our attention on this 
area. However, insofar as Lewin concerned 
himself with problems of change in cognitive 
structure, his viewpoint was sympathetic to Tol- 
man's approach to learning (White, 1943). He 
emphasized that both differentiation and re- 
structuring (in the sense of separating certain 
regions which have been connected and con- 
necting regions which have been separated) are 
key processes in cognitive change. Further, he 
stressed that changes in the cognitive structure 
may occur in any part of the individual's life 
space, including the psychological future, the 
psychological present, or the psychological past. 
Ft may occur on the reality level or on the irreal- 
ity level of each of these sections of the life 
space. ` 

In preceding sections of this chapter, we have 
discussed many studies bearing on factors af- 
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fecting changes in needs and valences — e.g., 
the studies on satiation and substitution. Here 
we shall confine our discussion to three topics: 
level of aspiration, socially induced changes, and 
social perception. 

Level of aspiration. Perhaps no other area 
of research that Lewin and his students have 
opened to experimental investigation has been 
the subject of so many studies as that of level 
of aspiration. The level of aspiration may be 
defined as the degree of difficulty of the goal 
toward which the person is striving. The con- 
cept of level of aspiration is relevant only when 
there is a perceived range of difficulty in the 
attainment of possible goals and there is varia- 
tion in valence among the goals along the range 
of difficulty. 

In discussing the level of aspiration, it may 
be helpful to consider a sequence of events 
which is typical for many of the experimental 
studies in this area: (1) A subject plays a game 
(or performs a task) in which he can obtain a 
score (e.g., throwing darts at a target) ; (2) after 
playing the game and obtaining a given score, 
he is asked to tell what score he will undertake 
to make the next time he plays; (3) he then 
plays the game again and achieves another score; 
(4) he reacts to his second performance with 
feelings of success or failure, with a continuing 
or new level of aspiration, etc. In the foregoing 
sequence, point (2) (setting of the level of 
aspiration) and point (4) (reaction to achieve- 
ment) are particularly significant for the dy- 
namics of the level of aspiration. 

In outline form, the theory of the level of 
aspiration is rather simple (Lewin, Dembo, Fes- 
tinger, and Sears, 1944). It states that the 
resultant valence of any level of difficulty will 
be equal to the sum of the valence of achieving 
success times the subjective probability of suc- 
cess minus the valence of failure times the 
subjective probability of failure, The level of 
aspiration (ie, the goal an individual will 
undertake to achieve) will be the level of diffi- 
culty that has the highest resultant positive 
valence. The subjective experience of success 
or failure is determined by the relation of the 
individual's performance to his level of aspira- 
tion (providing, of course, that the performance 
is seen to be self-accomplished) and not simply 
by his absolute accomplishments. 

Experimental work on the level of aspiration 
has brought out the variety of influences which 
affect the positive and negative valences cf 
different levels of difficulty. It has indicated 
that cultural and group factors establish scales 
of reference which help to determine the rela- 
tive attractiveness of different points along a 
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difficulty continuum. Some of these influences 
are rather stable and permanent in their effects. 
It has been found, for example, that most people 
of western culture, under the pervasive cultural 
pressures toward “self-improvement,” when 
first exposed to a level of aspiration situation 
give initially a level of aspiration which is above 
the previous performance score, and under most 
conditions tend to keep their level of aspiration 
higher than their previous performance. In 
addition to broad cultural factors, the individ- 
ual's level of aspiration in a task is likely to be 
very much influenced by the standards of the 
groups to which he belongs (Anderson and 
Brandt, 1939; Hilgard, Sait, and Magaret, 1940). 
The nature of the scales of reference set up by 
different group standards may vary. 


In some cases, for instance, in case of the 
ideology underlying the college term “Gentleman 
C," the group standard is equivalent to the maxi- 
mum valence on the scale of success: to be either 
above or below this standard is considered less 
desirable than the standard. The fashion, par- 
ticularly in democratic countries, frequently fol- 
lows a similar pattern of an optimum rather than 
a maximum of elegance as the most desirable level. 
In other cases, the group standard merely indicates 
a level at which the valence gradient is particu- 
larly steep: there is little success valence and much 

.negative valence of failure immediately below 
the group standard, and much success and little 
failure valence directly above group standard. 
(Lewin, Dembo, Festinger, Sears, p. 368) 


Reference scales do not come only from mem- 
bership in a definitely structured social group, 
for they also may reflect the influence of one's 
selfimage, other individuals, or groups that 
either establish certain standards for perform- 
ance or that serve as models for evaluating 
self-performance. "Thus, the level of aspiration 
of a college student with respect to an intel- 
lectual task will vary depending upon whether 
he is told that a given score was obtained by the 
average high school student, the average college 
student, or the average graduate student (Fes- 
tinger, 19422, b). 

Research has given some insight into the 
factors determining the values on the scale of 
subjective probability. A main factor which 
determines the subjective probability of future 
success and failure is the past experience of 
the individual in regard to his ability to reach 
certain objectives. (Jucknat, 1937). If the in- 
dividual has had considerable experience with 
a given activity he will know pretty well what 
level he can expect to reach or not to reach and 
the gradient of values on the subjective proba- 
bility scale will be steep. However, it is not 
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only the average of past performances which 
determines an individual’s subjective probabil- 
ity scale, but also the trend — whether he is 
improving, getting worse, or remaining the 
same. Furthermore, there is experimental evi- 
dence to indicate that the last or most recent 
success or failure has a particularly great in- 
fluence on the individual's expectation of his 
future achievement level. In addition, there is 
evidence that the subjective probability scales 
as well as the performance of others can in- 
fluence the subjects own probability scale. 
Personality factors — e.g., self-confidence — may 
also influence subjective probability. 

The level of aspiration theory has widespread 
implications for many social phenomena. It 
gives insight into the reasons for social apathy 
in the face of pressing political and interna- 
tional problems. People are not likely to at- 
tempt to achieve even highly valued objectives 
when they see no way of attaining them. Simi- 
larly, it sheds some light upon why social revo- 
lution tends to occur only after there has been 
a slight improvement in the situation of the 
oppressed groups — the improvement raises 
their level of aspiration, making goals which 
were once viewed as unattainable now perceived 
as realistic possibilities. 

The level of aspiration technique suggests 
itself as an instrument to compare different cul- 
tures and to characterize their system of values 
in a systematic way. Such questions as op- 
timism-pessimism, the need for success, the fear 
of failure, the influence of specific group stand- 
ards, etc., seem open to the use of this type of 
technique. 

Socially induced changes. Lewin makes a 
distinction between changes due to forces which 
are "imposed" upon the individual and changes 
which reflect directly the individual's own 
needs, In this connection, it is relevant to in- 
troduce the concept of powerfield which, al- 
though not well defined, has been employed in 
many social psychological investigations (Wiehe, 
1935; Arsenian, 1943; Frank, 1944; Lippitt, 
1940; Cartwright, 1950). A powerfield is an 
inducing field; it can induce changes in the life 
space within its area of influence. The induced 
changes may be with respect to the valence of 
regions in the life space or they may be cog- 
nitive in nature. 

The source of a powerfield is usually but not 
necessarily a person. It is possible to speak of 
a person's area of direct social influence as his 
powerfield and to speak of the changes which 
can be or are induced in the life spaces of other 
people by this person's powerfield. For ex- 
ample, in a situation where workers have no 
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intrinsic interest in their work and do not view 
it as a means to some important goal, they 
would be inclined to loaf were it not for the 
presence of a supervisor who can induce forces 
in the direction of performing work. When the 
supervisor is not psychologically present and 
when therefore his powerfield is absent, the 
workers will resume their loafing. 

'The distinction between own and induced 
forces has been found to be useful in explaining 
some of the differences in behavior under auto- 
cratic and democratic leadership (Lippitt and 
White, 1943). Thus, children in a club when 
led by authoritarian leaders (who determined 
policy, dictated activities, and were arbitrary 
and personal in evaluation of activities) tended 
to develop little of their own motivation with 
respect to club activities. Although the children 
worked productively when the leader was pres- 
ent (ie, when his powerfield was psychologi- 
cally effective) , the lack of personal motivation 
toward group goals clearly evidenced itself in 
(1) change of behavior when the leader left 
the club, (2) absence of motivation when the 
leader arrived late, (3) lack of carefulness in 
the work, (4) lack of initiative in offering 
spontaneous suggestions in regard to club proj- 
ects, (5) lack of pride in the products of club 
effort. 

The distinction between own and induced 
forces has also been employed to give insight 
into why workers are usually happier and more 
productive when they can farticipate in the 
decisions which affect their work. Apart from 
other considerations, participation in goal set- 
ting is more likely to create own forces toward 
the goal, and thus there will be no necessity to 
exert continuous social influence to induce 
forces in the direction of the work goal or to 
induce restraining forces to prevent the indi- 
vidual from leaving the work region (Coch and 
French, 1948; McGregor, 1944) . 

A powerfield may be characterized not only 
in terms of its perceived source (e.g, a given 
person, a group, a personal value, a law) but 
also in terms of: the regions that the powerfield 
can affect in the individual's life space (eg, a 
mathematician may influence your views about 
the soundness of a mathematical proof but not 
about the soundness of a political viewpoint) ; 
its strength or power as indicated in the magni- 
tude of the changes which it can induce; the 
conditions under which a powerfield is effective 
(e.g, a child is unlikely to obey his father if 
his mother is telling him to do the opposite) ; 
the nature of the changes induced by the power- 
field (eg, whether barriers or valences are 
affected, whether the induced forces act upon 
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the behaving self or regions of the psychological 
environment); whether or not being in the 
powerfield of another person is positively or 
negatively valent (e.g., does an individual like 
to have decisions made for him or does he like 
to make them himself) ; the perceived qualities 
of the source of the powerfield — friendly or 
hostile, personal or impersonal; the degree of 
correspondence or conflict between induced and 
own forces; the attributes of the source of the 
powerfield which give rise to its power (e.g., 
physical strength, social role, personal attrac- 
tiveness) . 

Social influence and socially induced change 
have been the foci of a series of investigations 
conducted under the direction of Ronald Lip- 
pitt and Fritz Redl (Grosser, Polansky, and 
Lippitt, 1951; Polansky, et al., 1949; Polansky, 
Lippitt, and Redl, 1950a, b; Lippitt, Polansky, 
and Rosen, 1952). These studies were con- 
ducted in three children's camps: a camp for 
emotionally disturbed boys, a camp for emo- 
tionally disturbed girls, and a camp for middle- 
class, nondisturbed boys. Two different types 
of social influence processes were investigated: 
behavioral contagion, which is defined as "the 
spontaneous pickup or imitation by other chil- 
dren of a behavior initiated by one member of 
the group where the initiator does not display 
any intention of getting the others to do what 
he did," and direct influence, which is indicated 
by behavior which has "the manifest objective 
of affecting the behavior of another member of 
the group." Various kinds of data were col- 
lected through sociometric questionnaires and 
through direct observation of the children in 
their activities. The data included measures of 
the power attributed by other children to each 
child, self-perception of power, of frequency 
of initiation of behavioral contagion and of 
direct influence, frequency of pickup of con- 
tagion and of recipiency of direct influence. 

Some of the major results may be summarized 
as follows: (1) A child is more likely to “con- 
tage" from the behavior of another group 
member the higher the attributed power of the 
other group member. (2) A child is more likely 
to accept direct attempts to influence him the 
higher the attributed power of the group mem- 
ber trying to influence him. (3) The average 
child tends to initiate more deferential, ap- 
proval-seeking behavior toward group members 
with high attributed power than toward those 
whose attributed power is low. (4) The more 
attributed power a child perceives himself to 
have the more likely he is to initiate direct- 
influence attempts. (5) The children with high 
attributed power make more attempts to influ- 


ence others and are more successful in these 
attempts than the children with low power. 
(6) While neither “height” nor "weight" have 
an unequivocal relationship to attributed 
power, children who have the highest attributed 
power are also perceived to have the best fight- 
ing ability and to have superiority in "camp- 
craft skill" (7) Children who have high 
attributed power are liked better and identified 
with more than are other group members. 

Social perception. Heider (1953) has used 
the concept “heteronomous event" to include 
the phenomena referred to by the concept of 
powerfield but more broadly to refer to all 
changes whose source does not lie in the person 
but in the environment. He points out that the 
psychological significance of a heteronomous 
event is very much influenced by its perceived 
source. Thus, you are more likely to be angry 
when somebody steps on your toes if you see 
this event as having its source in the inten- 
tions of another person than if you see it as 
one of those things which happen in a crowded 
bus. The focus in the study of social percep- 
tion, as Heider defines it, is not so much the 
study of social influences upon the perception 
of the physical world but rather the study of 
the conditions which influence our perception 
of other people (groups, etc.) and which deter- 
mine the characteristics and relations that we 
attribute to them (Heider, 1944; Heider and 
Simmel, 1944; Heider, 1946) . 

Heider indicates that the attribution of re- 
sponsibility to people as sources of events serves 
the function of providing a cognitive stability 
to our psychological environment. ‘The motives 
and other personality characteristics of other 
persons provide the underlying causes which 
give coherence and meaning to the diverse “‘sur- 
face facts” with which we are confronted, The 
life space would be in a state of confusion if 
cognitive awareness and meaning were deter- 
mined by the molecular events or proximal 
facts, e.g., the physical movements of parts of 
the body, without regard to the more distal, 
underlying causes which give rise to and deter- 
mine the direction and significance of the 
movements. In effect, Heider is asserting that 
people develop their own “naive” psychology as 
a means of giving meaning and cognitive or- 
ganization to the surface actions of others, In 
this naive or everyday psychology, there are 
concepts which deal with perception, action, 
wanting, trying, being able to, experiencing or 
suffering, ought and may, attitudes, belonging. 
Much of the task in the study of social percep- 
tions may be defined as that of determining the 
conditions which govern the development and 
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application of these concepts in our perception 
of other people. 

Heider indicates that the attribution of re- 
sponsibility for an event to another always 
implies that the person and event are seen as 
“belonging” together in a unit. In the most 
primitive, global form of attribution, a person 
is seen as the source of an event merely when 
the event can in any way be seen as being con- 
nected with him. With increasing sophistication 
in the attribution of responsibility the meaning 
of an event and person "belonging" together 
is determined by conceptions of the person’s 
abilities and his wants or intentions. Thus, a 
more sophisticated form is to attribute an event 
to the person if he were a necessary condition 
for the happening even if he could not have 
foreseen the outcome. A still more sophisticated 
form is attribution to a person of any effect for 
which he was a necessary condition, whether 
intended or unintended, as long as he could 
have foreseen the event. A more differentiated 
attribution occurs when the event is attributed 
to a person only if it was intended by the person. 
The most differentiated form occurs when there 
is perception that even the intentions of the 
person are dependent partly on his environ- 
ment. 

The nature of attribution which occurs in 
any particular instance will be influenced by 
the need to prevent cognitive imbalance. Cog- 
nitive stability requires a congruence among 
causal expectations with respect to related ob- 
jects. That is, for a state of complete cognitive 
harmony to exist, the various implications of a 
person’s expectations or judgments of any aspect 
of the cognized environment cannot contradict 
the implications of his expectations or judg- 
ments of any other aspect of the cognized en- 
vironment. Thus, if a person judges X to be 
of potential benefit to his welfare, he cannot 
at the same time judge that Y (which is also 
judged to be of benefit to his welfare) and X 
are antagonistic and still have a stable or bal- 
anced cognitive structure. (Let X and Y be 
things, people, the products of people, or the 
characteristics of people.) When the cognitive 
structure is in a state of imbalance or is threat- 
ened by imbalance, forces will arise to produce 
a tendency toward locomotion so as to change 
the psychological environment or to produce 
a tendency toward change in the cognition of 
the environment. Under conditions which do 
not permit locomotion, the tendency for cog- 
nitive change is enhanced. i 

In the preceding example of X and Y, cog- 
nitive change could occur by changing one's 
judgment of X or of Y, or of the relationship 


between X and Y, or by differentiating X or Y 
into independent or segregated subparts. Thus, 
instead of continuing to like both X and Y, who 
hate each other, one may come to dislike either 
X or Y, or to feel that they don't hate each other, 
or to feel that I like X because she is beautiful 
and that Y hates X because she is wealthy, and 
that there is no interaction directly or indirectly 
between the two characteristics of X — beauty 
and wealth. In general, the nature of the cog- 
nitive changes resulting from an imbalance will 
tend to produce the most congruence and least 
changes in the perceptual-cognitive field. For a 
discussion of the Gestalt principles determining 
cognitive organization and reorganization and 
some of the implications of these principles for 
social perception see Krech and Crutchfield 
(1948) and Asch (1952). 

From assumptions which are similar to those 
outlined in the preceding paragraphs, and with 
three hypotheses about the conditions of cog- 
nitive imbalance, Heider has been able to de- 
velop a keen insight into some of the conditions 
which determine our perceptions in interper- 
sonal situations. He indicates that: (a) In re- 
gard to attitudes directed toward the same 
entity, a balanced state exists if positive (or 
negative) attitudes go together; a tendency 
exists to see a person as being positive or nega- 
tive in all respects, (b) In regard to attitudes 
toward an entity combined with belongingness, 
a balanced state exists if a person is united with 
the entities he likes and if he likes the entities 
he is united with; the converse is true for nega- 
tive attitudes. (c) If two entities are seen as 
parts of a unit, a balanced state will exist if 
the parts are seen to have the same dynamic 
character (positive or negative). If the two 
entities have different dynamic characters, a 
balanced state can exist if they are seen to be 
segregated. 

One may consider the above hypotheses to be 
true for various types of positive attitudes — 
ep, to like, to love, to esteem, to value — and 
for various types of negative attitudes. Simi- 
larly, the statements about belonging or being 
part of a unit are meant to refer to different 
types of units — €g., membership, belonging, 
causality, possession, similarity, or proximity. 
However, it should be emphasized that the un- 
balanced cognitive situation produces only a 
tendency to change; whether or not locomotion 
or cognitive change will actually occur depends 

.upon the strength of the other forces which are 
also at work in the situation. 

The reader may demonstrate for himself some 
of the implications of Heider's hypotheses by 
considering three interrelated entities — one’s 
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self (p), another (0), and a third object (x). 
Specify the perceived attitude (positive or nega- 
tive) between any two of the entities and you 
can determine what the perceived relationship 
is likely to be between the other two. Thus, if 
p likes x, and p dislikes 0, then one could pre- 
dict a tendency to perceive o as disliking x; if 
p likes x and also likes o, one can predict 
a tendency to perceive o as liking x. The 
reader may introduce some additional factors 
by considering the relation of belongingness. 
For example, given that p likes o, and that o 
caused x, one could predict a tendency for p 
to like x, and so forth. The reader might find 
it of particular interest to consider x as an as- 
pect of the self and to see some of the 
predictions one would make, from Heider's 
hypotheses, about attitudes toward aspects of 
the self under varying conditions. 

We do not have the space to elaborate on 
the many insights which Heider's approach 
gives to social and self-perception and to the 
psychology of interpersonal relations. However, 
let us point out that the nature of "moral" or 
“ought” forces and of various socio-psychologi- 
cal emotions — e.g., envy, jealousy, revenge, 
altruism, benefiting, harming — are clarified 
by Heider's formulations. 

While as yet little research has been con- 
ducted to test the various implications of 
Heider's theory, several studies (Heider and 
Simmel, 1944; Horowitz, Lyons, and Perlmut- 
ter, 1951; Jordan, 1953) are of relevance and 


provide supporting evidence. The experiment 
by Heider and Simmel demonstrated the sig- 
nificance of spatial arrangements and of kinetic 
structures in determining the qualities and 
relationships we attribute to social objects. 
Their experiment gives some insight into the 
spatial and kinetic factors making for the at- 
tribution of relationship of belonging, of “ini- 
tiating" versus "receiving" an event, etc. The 
investigation by Horowitz, Lyons, and Perl- 
mutter tested some of the implications of 
Heider's hypothesis in a study of a discussion 
group and obtained data which support Heider's 
theory. Jordan, utilizing a questionnaire, had 
students rate 64 situations (each of which was 
composed of specified relationships among p, o, 
and x) for their degree of pleasantness-unpleas- 
antnes. His data, although suggesting some 
refinements in Heider's theory, provide support 
for Heider's notion that a situation of cog- 
nitive imbalance produces a state of tension. 
An ingenious experiment by Pepitone (1950) , 
although not conducted in the framework of 
Heider's theory, is relevant to some of the more 
complex interpersonal perceptions dealt with 
by Heider. This experiment indicates how the 
perception of another person is influenced not 
only by the other person’s attitude toward the 
perceiver but also by the other person's power 
to benefit or harm and by the perceiver's 
strength of motivation to receive the benefits 
which the other person can bestow. 


Action RESEARCH AND GROUP DYNAMICS 


Lewin's work on the study of motivation from 
the first heightened his awareness of the im- 
portance of social factors for practically all 
aspects of psychological life. Experimental 
studies on such topics as goal setting, conflict, 
level of aspiration, play, and recall and resump- 
tion of activities clearly revealed the effects of 
social influences. In his first explorations of 
social psychology, Lewin focused his attention 
upon the individual in his social environment. 
He analyzed the social influences playing upon 
the child (Lewin, 1931), the differences in na- 
tional «character between Americans and 
Germans (Lewin, 1936b, or 1948) the socio- 
psychological problems of being a member of 
a minority group (Lewin, 1935a, or 1948) and 
the situation of the adolescent (Lewin, 19392, 
or 1951b). These diverse studies stimulated an 
interest in the study of the conditions which 
affect group life. This interest saw its first fruits 
in the group atmosphere studies conducted at 
the University of Iowa and began to fructify in 


earnest with the establishment of the Research 
Center for Group Dynamics at the Massachu- 
setts Institute of Technology in 1945. 

Lewin's work on social psychology reflected 
two basic personal orientations which Lippitt 
(1947) has summed up in the term "scientific 
citizen." Action research and group dynamics, 
two terms which are closely associated with the 
work of Kurt Lewin in social psychology, in a 
sense reflect each of Lewin's personal orienta- 
tions. 


ACTION RESEARCH 


As a citizen deeply sensitive to the world in 
which he lived, aware of the personal tragedies 
caused by oppression and prejudice, Lewin 
devoted much of his scientific work to further- 
ing the understanding of the practical day-by- 
day problems of modern society. His book, Re- 
solving Social Conflicts, presents many of his 
socio-psychological analyses which deal with 
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such important social problems as the effects of 
prejudice, methods of facing oppression, conflict 
in industry, conflict in marriage, morale in time 
of war, changing prejudiced groups. Lewin felt 
that the social scientist could not only con- 
tribute to the solution of social problems but 
also that the study of attempts to produce 
change in social conditions would enable scien- 
tific insight into social processes which might 
not otherwise be attainable. These feelings led 
Lewin to help stimulate an interest in action 
research among various social agencies con- 
cerned with reducing prejudice. 

“Action research” is a term with many mean- 
ings (Chein, Cook, Harding, 1948) but let us 
here elaborate only on the two major comple- 
mentary values which Lewin felt might be 
advanced by action research. First of all, Lewin 
(1946a, p. 34, or 1948) pointed out that many 
social agencies and civic groups concerned with 
eliminating and preventing community prob- 
lems are inffective, despite their efforts and 
good will. “They feel in a fog on three counts: 
1. What is the present situation? 2. What are 
the dangers? 3. And most important of all, 
what shall we do?” 

One of the consequences of this unclearness 
is the lack of standards by which to measure 
progress. As Lewin (1946a, p. 35) points out: 


This lack of objective standards of achievement 
has two severe effects: 

1. It deprives the workers in intergroup rela- 
tions of their legitimate desire for satisfaction on 
a realistic basis. Under these circumstances, satis- 
faction or dissatisfaction with his own achieve- 
ment becomes mainly a question of temperament. 

2. In a field that lacks objective standards of 
achievement, no learning can take place. If we 
cannot judge whether an action has led forward 
or backward, if we have no criteria for evaluating 
the relation between effort and achievement, there 
is nothing to prevent us from making the wrong 
conclusions and to encourage the wrong work 
habits. Realistic fact-finding and evaluation is a 
prerequisite for any learning. 


In addition to the value that action research 
might have for social agencies, Lewin felt that 
the linking of research to social action might 
give the social scientist access to basic social 
processes which otherwise he would be unable 
to study. Further, Lewin's orientation to dy- 
namics in individual and group psychology led 
him to the conclusion that change studies were 
necessary to reveal underlying processes. Since 
social scientists are rarely in the position to 
create social change of their own initiative, they 
have much to gain through cooperation with 
social agencies that attempt to produce social 
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and community changes. Through such co- 
operation, the social scientist is enabled to do 
research in the community and he escapes the 
limitations of social psychological research on 
the much abused “college sophomore.” Con- 
siderable experience with action research, 
however, has indicated that the goals of action 
and the goals of research may often be incom- 
patible. The danger that confronts the research 
worker, in such situations, is the possibility 
that his research design or methodology will 
be sacrificed to the achievement of the social 
action objective. 

By working closely with various social agen- 
cies, educational institutions, and industries 
Lewin was able to stimulate a number of field 
experiments on such important topics as leader- 
ship training (Bavelas, 1942), intergroup rela- 
tions (Lippitt, 1949), and group productivity 
(Coch and French, 1948). Lewin's emphasis 
on field experiments in diversified community 
settings, his stimulation of social psychologists 
to leave the sheltered comfort of the college cam- 
pus and to carry their experimental procedures 
into settlement houses, workshops, offices, and 
factories, are not the least of his contributions 
to this field. 


GROUP DYNAMICS 


While Lewin stressed the importance of re- 
lating action and research, he was perhaps even 
more deeply concerned with the relationship 
between research and theory. Since the 1930's, 
and stimulated by World War II, developments 
in social psychology, sociology, and cultural 
anthropology have brought these apparently 
separate disciplines into closer linkage. Lewin 
(19472, b) felt that a fruitful integration of the 
social sciences could occur, however, only by 
moving from the description of social and group 
life to a theoretical analysis and experimental 
study of the dynamic problems of changing 
group life. This theoretical analysis and ex- 
perimental study he referred to by the title 
group dynamics. 

It is well to realize that Lewin's first writings 
in the area of group dynamics (Lewin, 1939a, 
b) occurred at a time when psychologists com- 
monly denied the existence or reality of 
"groups." Only "individuals" were real, and 
to refer to characteristics of groups — e.g. 
"group atmosphere," "group goals," etc. — was 
wiewed as being "nonscientific" or “mystical.” 
One of Lewin's major contributions was to help 
make the concept of group acceptable to psy- 
chologists, that is, to lead psychologists to accept 
the notion that groups, per se, have characteris- 
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tics, and that the behavior of an individual is 
greatly influenced by the various groups to 
which he belongs. 

Among psychologists, the taboo against be- 
lieving in the reality of the group was mainly 
shattered by demonstrating that “something 
could be done with” groups; that is, by handling 
groups experimentally (Lewin, 1948). The 
concept of group is, of course, not new; it has 
been the key concept in sociological writings 
for decades. Nevertheless, by performing ex- 
periments with groups, Lewin performed for 
psychologists the service of taking this concept 
from the plane of irreality to the plane of real- 
ity. For sociologists, the introduction of ex- 
perimental studies of groups also served a 
radical function by opening up many of the 
theoretical writings of sociologists to the possi- 
bility of experimental testing. Lewin and his 
students have helped to demonstrate that ex- 
perimental studies of small groups could shed 
light on large-scale social processes. The early 
experiments of Lippitt (1940), Lewin, Lippitt, 
and White (1939), French (1944), and Fes- 
tinger (1947) were instrumental in initiating 
and giving momentum to the experimental in- 
vestigations of group life, which are now ac- 
cepted as a commonplace in psychology and 
sociology. 

Concepts of group dynamics. Apart from 
papers dealing with group decision and social 
change (Lewin, 1947a and b or c and 1948), 
Lewin actually wrote very little on the theory 
of group dynamics. However, from the research 
investigations of his colleagues and students at 
the Research Center for Group Dynamics a 
formidable array of concepts has been gradually 
emerging. Such people as Bavelas, Cartwright, 
Festinger, French, and Lippitt have made lead- 
ing contributions, (The discussion of concepts 
in group dynamics that follows has been consid- 
erably stimulated by an unpublished document 
of Dorwin Cartwright's.) 

These concepts have been largely developed 
in isolation from the body of sociological litera- 
ture which deals with related topics. Neverthe- 
less, the sociologically oriented reader will be 
able to detect many parallels between the con- 
cepts of group dynamics and of sociology. 
Whether or not the isolation from prior so- 
ciological theorizing has been sensible is hard 
to judge. The group dynamicist might argue 
that the isolation has produced concepts which 
have stimulated experimental research studies 
— studies which did not flow from the more 
general sociological conceptions. The sociologist 
might argue that the studies of group dynami- 
cists might have been more sophisticated if they 


had been acquainted with the prior, relevant 
writings of sociologists. 

Let us begin our discussion of the concepts 
of group dynamics by briefly considering the 
concept of group. Lewin (1948, p. 54, or 1943d) 
wrote that: 


The essence of a group is not the similarity or 
dissimilarity of its members, but their interde- 
pendence. A group can be characterized as a 
"dynamical whole"; this means that a change in 
the state of any subpart changes the state of any 
other subpart. The degree of interdependence of 
the subparts of members of the group varies all 
the way from a “loose” mass to a compact unit. 


French (1944) pointed out that in addition 
to interdependence, membership in a group 
presupposes identification with the group. 
Deutsch (1949a) indicated that the interde- 
pendence is that of promotive or cooperative 
interdependence rather than, for example, com- 
petitive interdependence. Thus, a group may 
be defined tentatively as being composed of a 
set of members who mutually perceive them- 
selves to be cooperatively or promotively inter- 
dependent in some respect (s) and to some 
degree. Our discussion of groups will through- 
out refer to psychological groups rather than 
to collections of individuals who may be ob- 
jectively interdependent without, at the same 
time, being subjectively or psychologically 
interdependent. 

A group may be thought of in terms of its 
external relations (i.e. its relationship to its 
external environment) in a manner analogous 
to that of the behaving self in the psychological 
environment, or it may be considered in terms 
of its internal relations (ie, the relations 
among the members composing it) in a manner 
analogous to the discussion of the person and 
the tension systems composing it. In group 
dynamics, there has been very little discussion 
of the external relations of groups, of "group 
spaces." This is, of course, a profound neglect, 
comparable to the earlier tendency in psychol- 
ogy to consider the individual without relation 
to his environment. Most of the conceptualiza- 
tions and research in group dynamics have been 
on the internal characteristics of groups — on 
their internal structural and dynamic proper- 
ties. 

One of the key concepts, which has been 
the subject of much experimental investigation, 
is that of cohesiveness. Intuitively, cohesiveness 
refers to the forces which bind the parts of a 
group together and which, thus, resist disruptive 
influences. Hence, the study of conditions af- 
fecting group cohesiveness and of the effects 
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upon group functioning of variations in group 
cohesiveness is at the heart of the study of group 
life. 

Festinger, Schachter, and Back (1950) have 
defined cohesiveness, in terms of the group mem- 
ber, as “the total field of forces which act on 
members to remain in the group." The nature 
and strength of the forces acting upon a member 
to remain in the group may vary from member 
to member. These investigators make the sug- 
gestion that group cohesiveness "be related to 
the average magnitude of this force in all parts 
of the group." Deutsch (1949a) in a definition 
which has essentially the same implications as 
that of Festinger, Schachter, and Back, relates 
group cohesiveness to the degree of perceived 
cooperative interdependence among members 
and to the strength of goals about which the 
members are cooperatively interdependent. 
"Membership motive" in the individual is de- 
fined so as to be the counterpart of cohesiveness 
in the group. In the foregoing definitions, the 
relationship between the group attribute of 
cohesiveness and the individual attribute of 
membership motive (or cohesiveness) is.not 
adequately handled. It is obvious, however, 
that the average of the various individuals’ 
membership motives is an inadequate measure 
of group cohesiveness. Intuitively, group cohe- 
siveness will be affected by the distribution of 
membership motives in a group — i.e., whether 
there is much variation or little variation in 
membership motive among the members and 
whether "important" members ór "unimpor- 
tant" members have high or low membership 
motives. Parenthetically, it should be noted 
that in the definitions of cohesiveness the "forces 
to remain in the group" are a resultant of the 
forces to remain in the group and of the forces 
to leave the group. For simplicity of presenta- 
tion, the discussion of cohesiveness neglects the 
forces to leave the group. 

The force acting upon an individual to re- 
main in the group may derive from the attrac- 
tiveness or positive valence of the group, the 
negative valence of not being in the group, 
and/or the barriers or restraints against leaving 
the group; the forces acting upon the individual 
may be induced or own forces. Little research 
has been done to determine the group conse- 
quences of these different types of membership 
motivation, Back (1951), however, has shown 
that it makes no difference with certain aspects 
of group functioning whether attractiveness to 
a group is based upon (1) liking the other group 
members, (2) liking it because it is prestigeful 
to belong to it, or (3) liking the group because 
it is a means of obtaining personal goals. 
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In Back' experiment, subjects (in pairs) 
worked cooperatively in interpreting a set of 
pictures. The groups of pairs were formed so 
as to be more or less attractive to their members; 
the three different bases of attraction indicated 
in the preceding paragraph were employed. 
Back found that in the more cohesive groups, 
the members made more effort to reach agree- 
ment and were more influenced by the discus- 
sion than were members of the lesser cohesive 
groups no matter what the basis of attractive- 
nes. However, discussion was influenced by 
the source of attractiveness to the group. The 
behaviors of the people in the groups composed 
of people attracted to the group by a liking 
for the other group members, were more chatty 
than the other groups; by the prestige of the 
group, were more cautious and less related to 
other members; by the group as a means to a 
goal, were more impersonal and task-oriented. 

While there have been few direct studies of 
the factors determining group cohesiveness, a 
number of experimental investigations inci- 
dentally bear on this area. Thus Back (1951) 
found that he could produce high, as contrasted 
with low, cohesiveness in experimentally created 
groups by stressing to the subjects how much 
they would like each other, or how important 
it was for the group to do well on the task since 
the task was a test of ability, or by stressing the 
prestige of the group. French (1941) found that 
groups composed of students who belonged to 
the same athletic teams were more cohesive than 
groups composed of students not previously 
acquainted. Deutsch (1949b) and Levy (1953) 
found that groups rewarded on a cooperative 
basis were more cohesive than groups rewarded 
on a competitive basis. Lippitt and White 
(1948) in a study of the effects of autocratic, 
democratic and laissez faire group leadership in 
boys clubs found that democratically led 
groups were most cohesive. Schachter (1951) 
produced clubs with high cohesiveness by 
composing them of students who expressed mod- 
erate or high interest in their activities; he 
created low cohesive clubs by composing them 
of students who expressed little or no interest 
in their activities. Thibaut (1950), in the most 
direct study of factors affecting cohesiveness, 
found that being in a persistently high-status 
group increases cohesiveness; on the other hand, 
being in a persistently low-status group tends 
to decrease the attractiveness of one’s own group 
but also increases the hostility toward the high- 
status group. Those members of the low-status 
group whose hostility toward the high-status 
group is most aroused, the most central mem- 
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bers, may as a consequence of their rejection of 
the other group become more cohesive. 

Various measures of “cohesiveness” have been 
employed in the foregoing experimental inves- 
tigations: desire to remain in the group, the 
ratio of "we" remarks to "I" remarks during 
group discussion, ratings of friendliness, evalua- 
tions of the group and its product, acceptance 
of each other's ideas, in-group sociometric choice 
versus out-group sociometric choice, etc. Deutsch 
(19492), in a theoretical paper, provides a 
rationale for the use of a wide variety of meas- 
ures of membership motive by developing the 
hypotheses that members of more cohesive (co- 
operative) groups as compared with members 
of less cohesive (competitive) groups would, 
under conditions of successful locomotion: (a) 
be more ready to accept the actions of other 
group members as substitutable for simi- 
larly intended actions of their own (and 
therefore would not have to perform them 
also); (b) be more ready to accept inductions 
Oe, be influenced) by other members; and (c) 
be more likely to positively cathect the actions 
of other group members. From these core hy- 
potheses, with the addition of more specific 
assumptions, it is possible to derive the influ- 
ence of more or less cohesiveness upon many 
aspects of group functioning. Thus, from the 
substitutability hypothesis, it is possible to pre- 
dict that more specialization of function, more 
subdivision of activity, more diversity of mem- 
bership behavior would occur in the more 
cohesive groups. The inducibility hypothesis 
leads to the prediction that members of more 
cohesive groups would be more attentive to 
one another, be more understood by one an- 
other, be more influenced by one another, be 
more likely to change and have more internali- 
zation of group norms than members of less 
cohesive groups. The cathexis hypothesis leads 
to predictions of greater friendliness, greater 
ratio of in-group sociometric choices, etc., in 
the more cohesive groups. Data in a variety of 
experiments (Deutsch, 1949b; Levy, 1953; 
Schachter, 1951; Back, 1951; Gerard, 1953) sup- 
port the foregoing predictions. 

In a well-integrated program of research, Fes- 
tinger and his co-workers have developed a 
series of fertile hypotheses (Festinger, 1950) and 
conducted some ingenious experiments on the 
communication process within groups. In brief, 
these investigators have been concerned with 
three sources of pressures to communicate with- 
in groups: (a) communications arising from 
pow toward uniformity in a group (Back, 

; Festinger and Thibaut, 1951; Schachter, 


1981); ; (b) communications arising from forces 
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to locomote in a social structure (Kelley, 1951; 
Thibaut, 1950); and (c) communications aris- 
ing from the existence of emotional states (Thi- 
baut, 1950; Thibaut and Coules, 1952). Most 
of the theoretical writings and experimental 
work of Festinger and his associates have been 
concerned with (a) above. 

Festinger (1950) has indicated two major 
sources of pressures toward uniformity in a 
group: social reality and group locomotion. He 
indicates that when there is no simple objective 
basis for determining the validity of one’s be- 
liefs, one is dependent upon social reality (i.e., 
upon the consensus of judgment among people 
whose judgments one will respect) to establish 
confidence in one's beliefs. For example, the 
belief that “Negroes adversely affect property 
values when they move into a white neighbor- 
hood” is difficult for any white property owner 
to test in a simple, objective way; thus, he relies 
upon the judgment of others as a test of its valid- 
ity. Lack of agreement among members of a 
group provides an unstable basis for beliefs 
which depend upon social consensus for their 
support, and hence (in line with Heider's dis- 
cussion of the tendency toward cognitive bal- 
ance) forces will arise to produce uniformity. 
Pressures toward uniformity among members 
of a group may also arise because such uniform- 
ity is desirable or necessary in order for the 
group to locomote toward some goal. Greater 
uniformity in opinion within a group can be 
achieved in either of the following ways: (a) by 
actions (i.e., communications) that are directed 
at changing the positions the members occupy 
relative to one another — e.g, by attempting 
to influence others to change their opinions 
or by changing one's own views; or (b) by 
actions to make others incomparable in the 
sense that they are no longer effective as a com- 
parison for one' 5 opinions — Cg. by rejecting 
or excluding people with deviating opinions 
from the group. 

Experiments have shown that increasing the 
attraction to the group, and thus increasing the 
importance of the group as a comparison object, 
increases the amount of influence which is 
attempted and the amount of opinion change 
which occurs when there is opinion discrepancy 
within a group (Back, 1951). It has also been 
demonstrated that there will be more pressure 
for uniformity in a group, the more relevant 
or important the opinion is for the functioning 
of the group (Schachter, 1951; Festinger and 
Thibaut, 1951). Also, as may be expected from 
theoretical considerations, there is evidence that 
when pressures toward uniformity exist, the 
concern is mainly with those members of the 
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group who have opinions most divergent from 
one’s own. Thus, members exert influence 
mainly on those whose opinions are most diver- 
gent from their own (Festinger and Thibaut, 
1951; Schachter, 1951). While there has, as 
yet, been little research to indicate the con- 
ditions which will lead the pressure toward 
uniformity to manifest itself in a rejection of 
deviates rather than in an attempt to influence 
them to change, experiments by Gerard (1951) 
and by Festinger and Thibaut (1951) suggest 
that as the heterogeneity of a group increases, 
rejection of deviates also increases. 

The experiments revealing the tendency to 
direct communication toward deviants in the 
group, and through communication to exert 
social pressure upon them, provide support for 
Lewin’s theory of group decision and social 
change. Lewin (1947c) began his analysis of 
change by pointing out that the status quo 
in social life is not a static affair but a dynamic 
process that flows on but still keeps a recogniz- 
able form. He borrowed the term “quasi-sta- 
tionary equilibria” from physics to apply to 
such on-going processes, which are kept at their 
present level by fields of forces preventing a rise 
or fall. The field of forces in the neighborhood 
of the level of equilibrium presupposes that the 
forces against going higher than the equilibrium 
level increase with the amount of raising and 
that the forces against lowering increase (or 
remain constant) with the lowering. Thus, if 
we assume that a group standard is operating 
to determine the level of worker productivity 
in a factory, any attempt upon the part of the 
worker to deviate from the standard by higher 
productivity will only result in stronger forces 
being induced upon him by his co-workers to 
push him back into line. That is, as the Fes- 
tinger experiments have demonstrated, the 
deviant will be exposed to stronger pressures 
toward uniformity the more he deviates. How- 
ever, as Lewin points out, the gradient of forces 
may change at a distance from the equilibrium 
level so that after an individual has gone a 
certain distance from the equilibrium level, the 
forces may push him away rather than pull him 
toward the group standard. 

Lewin's analysis of the status quo as a quasi- 
stationary equilibrium has two major implica- 
tions. First of all, it points out that change 
from the status quo can be produced either by 
adding forces in the desired direction or by 
diminishing opposing forces. However, the two 
methods of producing change have different 
consequences: by adding forces, the process on 
the new level would be accompanied by a state 
of relatively high tension, since the strength of 
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forces in opposition would be greater; by de- 
creasing the opposing forces, the new level 
would be accompanied by lower tension. Sec- 
ondly, it highlights the difficulties of attempting 
to change individual conduct and attitudes that 
are rooted in groups by efforts which are di- 
rected at the individual and not his group. If 
one endeavors to change the prejudices of an 
individual without changing the prejudice of 
the group to which he belongs, the individual 
will either be estranged from his group or will 
be under pressure from his group to revert to 
his initial attitude. Isolated individuals may 
perhaps change their attitudes because of their 
individual experiences, but the person who is 
deeply enmeshed in the social life of his com- 
munity is unlikely to be able to resist the pres- 
sures to conform on matters of community im- 
portance if he wishes to continue in good 
standing. 

Considerations such as the foregoing have led 
to a series of experiments in various settings — 
in the school, with neighborhood groups, in 
industry, in an interracial workshop — on the 
relative efficacy of changing behavior by efforts 
directed at individuals or at a group (Lewin, 
1948). A typical procedure has been to com- 
pare the results of a lecture or individual in- 
struction in changing behavior with respect to 
the use of certain foods with the results of a 
group discussion on the merits of the food com- 
bined with a group decision favoring the use. 
Results have clearly indicated that the group 
decision method produces more change. How- 
ever, most of the experiments on group 
decision were unfortunately not designed 
in such a way as to rule out the possibility that 
such factors as interest or decision, per se, were 
not the factors producing the greater change. 
A more carefully controlled experiment by Ben- 
nett (1952) suggests that the apparent advan- 
tages of the group-discussion method result 
primarily from the fact that group discussion 
facilitates decision and perception of consensus. 
She concludes that group discussion, other 
things being equal, was not more effective than 
lecturing, nor did public identification of in- 
dividuals’ decisions contribute appreciably to 
obtained differences. In Bennett's experiment, 
the subjects were college students in introduc- 
tory psychology classes and the content of their 
lecture or group discussion was concerned with 
a request to volunteer as experimental subjects 
for an unspecified psychological experiment. 
The “groups” were not very cohesive (intro- 
ductory psychology sections) nor was the issue 
of volunteering group-rooted in the sense of 
being clearly relevant to group norms. The cu- 
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mulative evidence as well as everyday experi- 
ence supports Lewin's basic contention that to 
change group-rooted individual attitudes one 
must, in many instances, change the group to 
which the individual belongs. 

Horowitz (1950) performed an experiment 
which demonstrates some of the psychological 
consequences of group decision. He designed 
an experimental setting where it was possible 
for members to set up goals for the group and 
to perceive where the group stood in regard to 
the attainment of the goals. In the course of 
working toward the attainment of some of the 
group-agreed-upon goals the group was inter- 
rupted. In others, the group was allowed to 
complete the task. After working on a number 
of different tasks, a Zeigarnik-type test for mem- 
ory of completed and of interrupted tasks was 
given to all members of the group. The experi- 
ment was conducted in such a way that each 
individual indicated by vote whether or not he 
wanted to complete each task. The results of a 
secret ballot for each task were also announced 
to the group. (The experimenter actually an- 
nounced a falsified group vote in order to be 
able to study its effects under equated condi- 
tions) The results clearly indicated the 
motivational effect of the group-agreed-upon 
goals: in general, more interrupted than com- 
pleted tasks were recalled. In addition, there 
was clear evidence that the announced group 
vote (i.e. group decision) whether to complete 
or not to complete the task affected recall: when 
the announced group vote was to abandon the 
task recall was smaller than when the an- 
nounced group vote was to complete the task. 

Much of our discussion of group dynamics, 
so far, has considered the group as made up of 
homogeneous parts. In reality, of course, 
groups are differentiated structures, and the 
behavior of any group member will be largely 
determined by his position in the group struc- 
ture. Although the concepts of group structure 
and of position within the group have not been 
well defined in the writings of group dynami- 
cists, a good deal of research related to these 
concepts has taken place. Thus, Lewin, Lip- 
pitt, and White's (1939) pioneering study on 
the effects of different types of leadership be- 
havior has demonstrated how crucial the 
position of leadership is in determining the 
atmosphere of the group. Bavelas (1951) and 
Leavitt (1951) have demonstrated the signifi- 
cance of position in a communication network 
in determining member behavior and have in- 
dicated how the structure of a communication 
network may affect group productivity and indi- 
vidual satisfactions. Kelley (1951) has experi- 


mentally shown the significance of position in 
a status hierarchy in determining the nature and 
direction of communication. Polansky, Lip- 
pitt, and Redl (1950), in a study of behavioral 
contagion in groups, have explored the rela- 
tionship between status (as defined by various 
criteria in sociometric choice) and the ability 
to influence others in the group. Studies such 
as the foregoing provide the rationale for use 
of the communication process (who communi- 
cates with whom, about what, with what fre- 
quency, in what manner, in what circumstances, 
and with what effects) as a key instrument for 
characterizing group structure and for locating 
the occupants of various positions within this 
structure, 

The very diversity of these studies indicates 
some of the varied meanings that the term 
"position" may have. "Position" has been em- 
ployed to refer to the functions a member per- 
forms in a group, to an individual's locus in a 
communication network, to a person's ability 
to induce forces, to a person's prestige in the 
group, etc. Gerard and others (unpublished 
memorandum) provide insight into the reason 
for the diversity in meanings of "position." 
"Fhey have found it useful to conceptualize the 
position of a person in a group "as a locus 
which describes his relationship to other group 
members with respect to a single dimension. 
At any given time each member may have one 
and only one position on a specified dimension." 
A dimension refers to any group-psychological 
aspect which may be useful in describing the 
relations of one group member to another. 
Examples of dimensions are (a) being able to 
communicate with, (b) having power over, 
(c) having prestige for, (d) having more skill 
than. In some social settings one dimension 
may become more salient than others and thus 
one's locus on that dimension becomes most 
important for the individual. This saliency is 
termed dimension dominance; the dominant 
dimension is most powerful in determining a 
given range of behavior. Thus, to characterize 
adequately any group member's relations to 
others in the group over a period of time and 
in different social settings one will have to 
locate him along a number of dimensions, i.e., 
in a number of different positions. 

As we indicated in our earlier discussion of 
psychological ecology, a position in the group 
may be thought of as a "behavior setting" — 
i.e., as a part of the nonpsychological milieu 
which induces forces having certain magnitudes 
and directions upon an individual who is lo- 
cated in it. Behavior settings have their coercive 
and restraining powers as a result of a social 
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norm or group standard that defines the be- 
havior setting as a locus which is appropriate for 
particular kinds of behavior. It follows that the 
ability of a behavior setting or a position in 
the group to induce behavior will, in part, be 
dependent upon the strength of membership 
motive in the individual. 

Let me conclude the discussion of group dy- 
namics by reminding the reader that the experi- 
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mental study of small group functioning began 
in earnest a little over fifteen years ago. There 
has been an extremely rapid proliferation of 
studies and of theory since then. While Lewin 
and his students did much to give momentum 
to the development of “group dynamics,” it is 
no longer their exclusive domain — it is an in- 
tegral part of social psychology. 


SUMMARY 


In this chapter, we have attempted to present 
a survey of the theoretical approach to psychol- 
ogy which is commonly termed “field theory." 
While many theorists in psychology might be 
discussed under the title “field theory in social 
psychology," we have limited our discussion 
to the work of Kurt Lewin and the people who 
have been closely associated with him. We dis- 
cussed Lewin's approach to theorizing in some 
detail because of our belief that his extraordi- 
nary creativeness as theorist and as experimenter 
reflected “a way of thinking.” We also surveyed 


the basic concepts of Lewin's “topological and 
vector” psychology and some of the research 
studies which grew out of these concepts. Fi- 
nally, we discussed action research and group 
dynamics. 

The large sweep of Lewin's work, the brilliant 
innovations in many areas of psychology, cannot 
help but be impressive. Yet, like many trail 
blazers, the paths he opened up were left rough- 
hewn and unfinished. For orderly, safe passage 
brigades of scientific workers are required to 
fill in the holes and widen the paths. 
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CHAPTER 6 


Role Theory 
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Role theory attempts to conceptualize human 
conduct at a relatively complex level. In a sense 
it is an interdisciplinary theory in that its vari- 
ables are drawn from studies of culture, society, 
and personality. The broad conceptual units 
of the theory are role, the unit of culture; posi- 
tion, the unit of society; and self, the unit of 
personality. We define position as a system of 
role expectations. The discussion of this defi- 
nition will attempt to show the interrelatedness 
of role and position. 

Coordinate with role is the concept of the 
self. Its origins and its dimensions are described 
in cognitive terms. The ultimate units of the 
self are inferred qualities, the conceptualization 
of which is aided by the use of qualifying terms 
such as adjectives. The self-and-role-in-inter- 
action thesis prevails throughout the chapter 
and we shall report empirical observations that 
focus on the interactional nature of conduct. 

Since we hold the position that the value of 
a theory can be assessed only through empirical 
verification of hypotheses drawn from the the- 
ory, we review a number of representative 
studies, The results of these studies provide at 
least some evidence for the utility of theory. 

The theory set forth in this chapter leans 
heavily on the concepts of George Herbert 
Mead (1934). It may properly be called an 
interaction formulation, However, it differs 
in certain respects from conventional inter- 
action theories which are predicated on the 
notion that action occurring between persons 
is reciprocal. The general formula is: when 4 
initiates an action to B, B's response to 4 serves 
as a stimulus for A, etc. The unit of action is 
the person. It would not be too far from the 
truth to say that all strictly sociological theories 
accept implicitly or explicitly this formula of 
interaction. Analysis which follows from such 
interaction theories would allow inferences at 
the first of the three levels of complexity de- 
scribed by Murphy (1947) . At this level per- 
sons are studied as distinguishable entities 
without reference to internal structure. 

In role theory, the person as the broad so- 


ciological unit of interaction is retained, but 
a somewhat finer unit, the role, is added. Thus 
role theory embraces reciprocal action between 
persons, but these actions are organized into 
roles. If this were the only addition, we would 
have no more than an extension of traditional 
sociological theory. A second kind of inter- 
action has been added, however, which marks 
role theory as a unique social psychological for- 
mulation, namely, the interaction of role and 
self. Such a theory aims at problems at the 
third of Murphy's levels of complexity, where 
there is structure within the organism and 
structure within the environment. It is to the 
investigation of these structures and their inter- 
action that role theory addresses itself. In 
broad perspective, contemporary role theory 
regards human conduct as the product of the 
interaction of self and role. Not dissimilar is 
Parsons and Shils' idea of need dispositions and 
role expectations (1951) . 

Unlike the field-theoretical school or the psy- 
choanalytic school, whose concepts have his- 
tories more or less directly traceable to the 
original theorists, Lewin and Freud, role theory 
has developed as the result of the work of inde- ; 
pendent investigators who "discovered" the 
need for such synoptic concepts as self, role, and 
interaction. This does not imply that these 
conceptions have no history. At the turn of the 
century James M. Baldwin, G. Stanley Hall, 
William James, and John Dewey had begun 
to place the self on an empirical footing. Not 
long afterward Charles H. Cooley and George 
H. Mead further developed the notion of the 
self, and Mead began his analysis of the role- 
taking proces. With a few notable exceptions, 
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however, the concept of the self disappeared 
from learned psychology until it was revived 
by Allport (1937). 

The concept of role, until recently, was no 
more than an armchair abstraction and made 
little, if any, impression on the currents of 
American psychological thought. In the last 
decade, under the influence of the sociometrists 
and group dynamics workers, and as a result 
of interdisciplinary work with sociologists, role 
has been accepted as a concept with research 
possibilities, Interaction has always been recog- 
nized as a common-sense principle, but psy- 
chological experimentation and conceptualiza- 
tion were designed primarily in unidirectional 
stimulus-response terms. 

At this writing, there is a growing number of 
psychologists who utilize self, role, and inter- 
action as focal concepts in their theoretical 
formulations. Among these are Newcomb 
(1950) , Cameron (1943, 1947) , Sargent (1950), 
McClelland (1951), and Sarbin (1943). Men- 
tion should also be made of a number of 
sociologists whose contributions have been in- 
valuable: Cottrell (1942), Merton (1940), 
Hughes (1928), Waller (1932), Parsons (1951), 
Parsons and Shils (1951), and K. Young (1952). 

It should be noted here that most of the 
literature is discursive and anecdotal or ab- 
stractly theoretical. Although we shall refer 
at the appropriate places to some of these 
authors, wherever possible we shall review em- 
pirical or experimental studies in order to 
document our argument. Parenthetically, Cot- 
trell's presidential address before the American 
Sociological Society (1950) sketches a number 
of crucial problems awaiting empirical investi- 
gation, most of which focus on self, role, and 
interaction, 

The exposition of role theory requires a 
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broad overview against which our psychological 
concepts may be delineated. To this end, a 
conceptual framework is sketched below which 
embraces culture, society, and personality. 

With Linton (1947), we regard a culture as 
an organization of learned behaviors and the 
products of behavior which are shared and 
transmitted. When analyzed, these behaviors 
appear to be no more and no less than the or- 
dered actions of persons, e.g., “mother feeds 
infant by forcing premasticated food into its 
mouth.” Moreover, persons are always mem- 
bers of a society (defined as an aggregation of 
persons with common goals) , and these societies 
are structured into positions or statuses or 
offices. The positions are collections of rights 
and duties designated by a single term, e.g., 
mother, village chief, teacher, etc. The actions 
of persons, then, are organized around these po- 
sitions and comprise the roles. Role and posi- 
tion are conjoined. Roles are defined in terms 
of the actions performed by the person to vali- 
date his occupancy of the position. In sum, all 
societies are organized around positions and 
the persons who occupy these positions perform 
specialized actions or roles. 'These roles are 
linked with the position and not with the per- 
son who is temporarily occupying the position. 

Furthermore, the person is characterized by 
an internal organization of qualities or disposi- 
tions, the residue of an individual's experiences 
as a participant in the culture, In Linton's 
analysis this aspect is called personality and em- 
phasizes individual differences. In role theory, 
this internal organization of qualities (traits, 
attitudes, and habits) is conceptualized as the 
self. The term “personality” is reserved for a 
broader concept: action systems arising out of 
the interplay of self and role. 


ROLE 


Although the term role has recently come to 
have the beginnings of a stable core of meaning, 
we are a long way from univocality. In a review 
of over 80 sources in which the concept role 
was used, Neiman and Hughes (1951) con- 
cluded that: 


The concept role is at present still rather 
vague, nebulous, and nondefinitive. Frequently 
in the literature, the concept is used without any 
attempt on the part of the writer to define or 
delimit the concept, the assumption being that 
both writer and reader will achieve an immediate 
compatible consensus . . . (p. 149) 


As employed by Mead (1934) and a number of 
other sociologists such as Park and Burgess 


(1924) , role was regarded as the unit of sociali- 
zation. This usage leads to a definition of 
personality or of the self as the internalization 
of roles. Whether the role is made up of overt 
or covert elements or both is difficult to say 
from their statements. Role is regarded by 
them as "a pattern of attitudes and actions" 
which a person takes in social situations, One 
objection to this formulation is that it is not 
conceptually pure. Attitudes are inferences 
made about certain kinds of covert behavior, 
while actions are more directly observable. A 
unit which includes such operationally hetero- 
geneous conceptions makes difficult the isola- 
tion of variables for research. The lack of 
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specificity in the above definition of role may 
be responsible in part for both the paucity of 
experimental studies and the plethora of arm- 
chair theorizing associated with it. 

Whatever agreement has been attained in the 
use of the term role centers around the organ- 
ized actions of a person coordinate with a given 
status or position, One of the sometimes con- 
fusing results of thus regarding roles and posi- 
tions as intimately related is in their being 
treated as identical. They are interdependent 
concepts, but not identical. In the paragraphs 
which follow an attempt is made to translate 
position into psychological concepts and to in- 
dicate its relationship to role. 


DEFINITION OF POSITION 


What is a position? — or its equivalent, “a 
system of rights and duties"? The writer would 
regard a position in a social structure as a set 
of expectations or acquired anticipatory reac- 
tions. That is to say, the person learns (a) to 
expect or anticipate certain actions from other 
persons and (b) that others have expectations 
of him. The child, for example, learns to ex- 
pect to be fed, bathed, fondled, etc., by an adult 
who, of course, is operating within an existing 
role organization. These expectations become 
organized into a concept later verbalized as 
“mother.” The adult expects certain responses 
from the child. These latter expectations be- 
come organized into a concept such as "mother's 
darling." Another illustration is the job speci- 
fication of contemporary personnel work. The 
job or position is described in terms of the 
actions expected of the occupant of the posi- 
tion (his obligations) and the actions which 
he may expect from others such as foremen, 
subordinates, paymaster, etc. (his rights). In 
other words, a position is a cognitive organiza- 
tion of expectations, a shorthand term for a 
concept embracing expected actions of persons 
enacting specified roles. "These expectations, 
organized as they are around roles, may justi- 
fiably be called role expectations. Thus, a po- 
sition is a cognitive organization of role expec- 
tations. 

In somewhat different language but probably 
related to this conception is the statement of 
Parsons and Shils (1951). Although roles and 
role expectations are used interchangeably, the 
referent appears to be the same. 


The allocative foci of social systems are roles 
or role expectations. The social system is in a 
sense composed of a variety of roles or role ex- 


225 


pectations; each of these assures that some need 
of the social system will be met. The referent 
of a role... is a set of concrete orientations; 
the role or role expectation is an inferred entity 
in exactly the same fashion as is the needdis- 
position. When orientations are grouped (or 
abstracted) according to the roles or role expecta- 
tions that control them, and according to the 
interacting groups to which they belong, we are 
dealing with social systems. (p. 92) 


DEFINITION OF ROLE 


Cottrell's concise definition is worth quoting: 
"A role (is) ... an internally consistent series 
of conditioned responses by one member of a 
social situation which represents the stimulus 
pattern for a similarly internally consistent 
series of conditioned responses of the other (s) 
in that situation. Dealing with human be- 
havior in terms of roles, therefore, requires 
that any item of behavior must always be placed 
in some self-other context" (Cottrell, 1942, p. 
617). 

A role is a patterned sequence of learned 
actions or deeds performed by a person in an 
interaction situation. The organizing of the 
individual actions is a product of the perceptual 
and cognitive behavior of person A upon ob- 
serving person B. B performs one or a number 
of discrete acts which 4 observes and organizes 
into a concept, a role. On the basis of this 
conceptualization of the actions of B, 4 expects 
certain further actions from B. This expecta- 
tion is covert, and is the equivalent of saying 
“locates or names the position of the other." 
Once having located or named the position of 
the other, 4 performs certain acts which have 
been learned as belonging to the reciprocal 
position; these actions are conceptualized as 
A's role. Although language imposes upon us 
the necessity for defining role in this rather 
deliberate way, the reader should bear in mind 
that this "locating the position of the other" 
may be placed somewhere on a continuum from 
deliberate to automatic, from witting to un- 
witting. 

In attempting this definition of role and po- 
sition, the interdependent relationship of the 
two concepts is indicated. A role, the organized 
actions of a person in a given position, may be 
a person's own, or it may be the actions of other 

ersons not immediately involved in the recip- 
rocal behavior (Cameron, 1950) . The role may 
be that of teacher, the reciprocal role that of 
student, and both teacher and student may be 
reading about a third role which belongs to 
neither of them, for instance, scientist. The 


226 


interest of the investigator, then, can focus 
upon the actions of the teacher and the role 
expectations of the student, or vice versa, or 
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upon the influence of the actions of the scientist 
on the role expectations of either teacher or 
student, 


RoLe EXPECTATIONS 


A position in a social structure is equivalent 
to an organized system of role expectations. In 
Tolman’s (1952) scheme, the latter would be 
variables intervening between the stimulating 
conditions and the performances or acts. These 
role expectations are cognitive and, of course, 
must be inferred either from phenomenological 
report or from the analysis of other kinds of 
data. Two general kinds of expectations are 
found: rights and obligations. Rights are role 
expectations in which the actor of the role 
anticipates certain performances from the actor 
of the reciprocal role; e.g., the child's right to 
be protected by his mother. Obligations (or 
duties) are role expectations in which the actor 
of a role anticipates certain performances di- 
rected toward the actor of the reciprocal role; 
e.g. the mother's obligation to provide protec- 
tion for the child. 

A person cannot enact a role for which he 
lacks the necessary role expectations. These 
must be acquired through experience. In the 
following pages we shall consider the ways in 
which persons acquire the prior experience 
which makes possible the enactment of roles. 


THE ACQUISITION OF ROLES 


The actions which are patterned into roles 
are learned through two broadly defined proc- 
esses, intentional instruction and incidental 
learning. It should be explicitly stated that 
both forms of acquiring roles may operate con- 
jointly. Any set of role expectations is probably 
an admixture of expectations acquired through 
intentional instruction and incidental learning. 
In intentional instruction the culture carriers 
in any society attempt to teach prescribed acts, 
Intrinsic to these methods is the belief in some 
form of reinforcement theory of learning. Some 
cultures stress reward for the commission of a 
desired act, others stress punishment for the 
commission of an undesired act. In either case 
there is an intentional program designed to 
teach the child certain role behaviors. 

Early in life the child is taught acts which 
are differentiated, for example, according to 
sex. Little boys are rewarded or punished for 
certain kinds of actions, little girls are rewardec 
or punished for other kinds of actions. Thus 
begins the acquisition of actions for the later 
performance of one’s generalized sex role. 

Acts are learned as patterned behaviors, not 


as isolated muscle responses. Further, these 
acts are taught through the agency of other 
persons; thus, stimulus events always include 
social objects who in turn respond to the child's 
acquired actions. This is the paradigm of in- 
teraction for which the role is the unit. The 
child learns not only the motoric actions ap- 
propriate to the given situation, he also learns 
that the other person has expectations into 
acl his overt acts must fit, Concurrently, in 
the sequential interactions, he comes to expect 
certain responses from the other. Thus is born 
‘his own system of expectations. 

In incidental learning, the child adopts the 
T of others in his environment. Cameron 
and Magaret (1951) refer to this as "adoption 
of the prevailing pattern." DuBois (1941) re- 
fers to “absorptive” disciplines in child rearing. 

Several forms of acquiring role behavior 
through incidental means may be identified. 
Among them, play is probably the most widely 
observed. Mead (1934) was among the first to 
recognize the importance of play in the adop- 
tive learning of social roles. At least two 
resultants follow from the play acting of chil- 
dren: (a) the acquisition of roles (truncated, 
of course, because of maturational limitations) , 
and (b) the acquisition of skill in shifting 
roles. In play, the child can shift from role 
to role without observing the formal logic of 
the adult. This movement from role to role 
leads to oscillatory shifting of sets, thus en- 
abling the child to take both his own role and 
the role of the other. As will be shown later, 
the absence of a number of different standard 
roles, as well as the absence of skill in taking- 
the-role-of-the-other, retards socialization and 
leads to invalid role enactments. 

The imaginative processes are central in play 
acting. They are likewise central in covert 
processes, such as fantasy. The silent rehearsal 
of roles appropriate to real or imagined posi- 
tions, and of roles appropriate to the position 
of the other, provides a large reservoir of ex- 
perience. In all cultures, a good part of the 
content of imaginative behavior is institu- 
tionalized in the form of folk tales, myths, and 
other story forms. The imaginative process is 
central likewise in that form of acquiring roles 
which has been variously named identification, 
introjection, empathy, and taking-the-role-of 
the-other. Dependent on the ability of the per- 
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son (child or adult) to engage in as if processes, 
identification provides numerous avenues for 
acquiring roles. The number and kinds of per- 
sons with whom one may identify, of course, is 
limited by the number and kinds of persons in 
the environment and by cultural practices. If 
a child’s social environment is made up prin- 
cipally of the mother, there are fewer opportu- 
nities for identification than if the environment 
contains many persons. The “other” need not 
be in the visual field of the acting person. In 
fact, he may be a cultural hero, contact with 
whom has been established vicariously through 
storytelling. The importance of the imagina- 
tive processes in role taking has been empha- 
sized by Mead (1934), Moreno (1946), Cam- 
eron (1947), and Sarbin (1950). 

Superficially similar to identification, and 
probably antecedent in the development of 
the child, is the deliberate copying of the 
actions of others in the absence of intentional 
instruction. Although imitation as an ex- 
planatory device has been repudiated, the 
observations that children deliberately copy the 
behavior of other children and adults cannot 
be gainsaid. What is copied, of course, depends 
on the child's maturational level, availability 
of models, and favorable motivation (Miller 
and Dollard, 1941). Unlike identification, the 
model has to be present in the imitator's be- 
havioral field. 

Controlled experimentation in the learning 
of roles has not yet begun. Anecdotal material 
is available which provides evidence for the 
plausibility of the hypothesis that adult role 
behavior depends on the prior acquisition of 
role expectations. Especially cogent are the 
reports of situations in which the absence of the 
opportunities for learning to enact roles ap- 
propriate to defined positions leads to deviant 
conduct (Cameron, 1947; Gough, 1948) . 


ROLE EXPECTATIONS AS ACTIONS AND AS QUALITIES 


Both kinds of role expectations, rights and 
obligations, may be analyzed in terms of actions 
and qualities. Analysis of the role expectations 
of mother, for instance, may reveal the poten- 
tial action: mother provides food when child 
cries. In addition, certain qualities of the per- 
son enacting the role are expected: warmth, 
gentleness, friendliness, etc. Both the expec- 
tations of actions and the expectation of quali- 
ties are residua of prior experience. This con- 
struction is similar to Parsons and Shils’ notion 
of performance and qualities. 

It is possible to orient objects either (1) in 
terms of characteristics they possess regardless of 
their performances, or (2) in terms of charac- 
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teristics they possess by virtue of their perform- 
ances . . . A social object is a complex of quali- 
ties when the actor overlooks . . . performances 
and focuses on "attributes" as such . . . A social 
object is a complex of performances when the 
actor . . . focuses on the processes of action and 
their outcomes rather than its qualities or at- 
tributes . . (1951, p. 65) 


Role expectations as actions. Role expecta- 
tions as anticipated actions or performances 
can be studied by means of questionnaires. A 
number of questionnaires appropriate to such 
study have been devised. One of these, a role- 
conception inventory, has been developed by 
Motz to determine (l) the expectations of 
actions of self in the marital relationship, and 
(2) the expectations of actions that other occu- 
pants of the same position should perform. 
Preliminary interviews with married students 
indicated the presence of two types of marital 
roles for each sex: traditional and companion- 
ship roles. These types were conceptualized 
according to six areas of actions: housework, 
employment, financial support, child care, com- 
munity participation, and schooling. State- 
ments in the inventory were devised to get at 
these action expectations in the traditional role 
and in the companionship role. The items were 
so written that scores could be obtained for 
personal expectations ("I should do thus and 
so”) and public expectations ("Wives should 
do thus and so"). The variability in expecta- 
tions among the members of the group, of 
course, reflects the degree of clarity of role 
expectations. 

Cottrell (1942) has listed a number of hy- 
potheses regarding the relationship of role 
expectations to adjustment. One of these 
hypotheses is concerned with the specificity and 
definiteness of role expectations. "The degree 
of adjustment to roles which a society assigns 
to its agesex categories varies directly with the 
clarity with which such roles are defined." As 
the result of a questionnaire study of under- 
graduates, Rose (1951) concluded that the 
adult role is unclear to the average college girl. 
Role expectations about children, household 
chores, servants, civic work, employment, etc., 
showed lack of definitions. That such a state 
of affairs may lead to ineffectual role enact- 
ments is a safe guess. A person must move 
cautiously and uncertainly when role expecta- 
tions of others are partly known or entirely 
‘unknown (or unknowable). Rolerole and 
self-role conflicts are likely to follow from am- 
biguous role expectations. The persisting need 
for solution of such conflicts may lead to so- 
cially invalid role enactments. 
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Lack of specificity and definiteness in role 
expectations may be related to discontinuities 
in female expectations. Benedict's (1938) often- 
quoted observations about the effects of 
discontinuity in cultural conditioning are ap- 
posite here. The analyses by Komarovsky 
(1946), Wallin (1950), Nottingham (1947), 
and Hacker (1951) on the effects of historical 
changes on role expectations emphasize the 
actual and potential problems of women in 
contemporary American society. 

Emphasis on the lack of specificity and defi- 
niteness in women’s role expectations should 
not lead the reader to assume that this condi- 
tion does not apply elsewhere. Persons who 
are mobile from class to class or culture to 
culture may likewise be plotted on this dimen- 
sion. The American adolescent’s occupational 
role expectations are likewise characterized by 
indefiniteness and diffuseness (Carter, 1944) . 

Role expectations as the independent vari- 
able in conflict analysis has been used by 
several investigators. Role conflicts occur when 
a person occupies two or more positions simul- 
taneously and when the role expectations of 
one are incompatible with the role expectations 
of the other. Ort (1952) used the concept of 
role expectations in an empirical investigation 
of the relationship of social class to role con- 
flicts. The subjects were 100 adolescent boys, 
and the data were determined simply by count- 
ing the number of unfulfilled role expectations. 
An example of an unfulfilled role expectation 
would be where a youth answers “Yes” to the 
question “Should a boy be home by a certain 
time at night?" and “No” to the question “Are 
you home by a certain time at night?” 

Social class indices were derived from Warn- 
ers scheme. The correlation between role 
conflict and class was positive and high; as 
predicted, the middle-class adolescents expressed 
more role conflicts than the lower-class boys. 

This research is cited only incidentally to 
point out the influence of class position on 
role conflict. The significant point is that it 
demonstrates how a cognitive variable, action 
expectations, is subject to empirical research. 

Less frequently used than the questionnaire 
or the interview, but perhaps possessing more 
ecological validity (Brunswik, 1947), are be- 
haviors observed in enacting contrived roles. 
Inferences are drawn from the observation of a 

erson actually engaged in the enactment of 
a role. Of course, the investigator must make 
a correction (implicitly) for the fact that the 
subject, in addition to responding to the task 
set for him, is also enacting the role of "subject- 
in-an-experiment." 
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Brown (1952) reported an experiment in 
which this role-playing method was used. He 
was interested in the effects and interactions 
in role-taking situations in terms of the follow- 
ing characteristics: sex of person enacting the 
role of another person, sex of the "other" (the 
reference person), the sex orientation of the 
cultural content of the situation, and incidental 
learning. His main hypothesis was that "there 
exists a partial disjunction between the cul- 
tural worlds of males and females in American 
society which will be exhibited in the partial 
failure of role taking" (Brown, 1952, p. 568). 
In other words, the results of learning role 
expectations of males and females are such that 
role enactment of an opposite-sex peer will be 
inadequate or distorted. In general, Brown 
concluded that (a) males are slightly better 
than females in role-taking skill and that (b) 
role taking across the sex line is more difficult 
than role taking within sex categories. 

Role expectations as qualities. Role expecta- 
tions as qualities or attributes rather than as 
actions or performances may also be studied 
empirically. The significance of inferred quali- 
ties in building up role expectations is derived 
from a study by Sullivan (1950). He investi- 
gated conceptual behavior with the aid of a 
set of 40 pictures of men. The subjects were 
asked to sort or categorize the persons. The 
principles upon which the sortings were made 
were analyzed. Among the findings were sex 
differences in the way social concepts are 
formed. Women differed from men in that they 
were more inferential, more dichotomous and 
specific in the manner in which they assign 
qualitative role expectations to others. For 
example, women used traits (e.g., moody) and 
expressions (e.g, quizzical) as concepts; men 
used more descriptive categories (e.g, direc- 
tions, clothing) . 

Sullivan's results suggested a further hy- 
pothesis: that women would use inferential 
terms depicting “underlying” traits more fre- 
quently than men in their role expectations of 
"live" subjects. Conversely, men would be ex- 
pected to use terms more continuous with the 
raw data: nouns describing positions, or adjec- 
tives describing external characteristics of the 
person. In a study conducted by the author 
as a class exercise, 131 upperclassmen served as 
subjects. They were told that a version of the 
word-association experiment would be per- 
formed. Instead of presenting a word as a 
stimulus, however, a person would be the stimu- 
lus. The subjects were to write the first three 
words that "came to mind" upon the presenta- 
tion of a live social object for ten seconds. "This 
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was repeated with two other persons as stimulus 
objects. The social objects merely rose from 
their seats — being required to say nothing. 
The descriptive words used by the respondents 
were then grouped into two categories: those 
representing several degrees of inference, such 
as shy, aggressive, warm, hostile, logical; and 
those representing descriptive categories, such 
as student, tall girl, grey (color of clothing), 
etc. The hypothesis derived from Sullivan's 
study was verified (P — .02) , men favoring the 
use of the latter class of words, women, the 
former. Thus, qualitative features of role 
expectations may be approached experimen- 
tally (Sarbin, 1952b). 

Naboisek (1953) made a start at a more 
ecological method of isolating qualitative 
characteristics of role expectations by predict- 
ing, in a limited way, interpersonal behavior 
to be expected from members of small therapy 
groups. The subjects filled out personality 
inventories and adjective checklists made up 
of 96 interpersonal adjectives. Naboisek then 
classified the patients according to what he con- 
sidered the prevailing qualitative expectations. 
Four classes of role expectations were used: 
friendly-dominant, hostile-dominant, friendly- 
submissive, and hostilesubmissive. After six 
sessions of group therapy (a reasonable time 
in which to become acquainted) each subject 
checked interpersonal adjective checklists for 
every other member in his group. The assess- 


ments were made about other persons enacting 
the patient role each in his idiosyncratic way. 
'The agreement among their assessments veri- 
fied the predictions made by the experimenter 
prior to therapy. For example, on the basis 
of pretherapy self-descriptions a person was 
classified as having role expectations of others 
as “friendly-dominant.” After six weeks’ ac- 
quaintance a summing of his checklists about 
the role enactments of others revealed that he 
actually checked adjectives about the others 
that were classifiable as friendly-dominant. In 
this setting, the roles are described qualita- 
tively in adjectival terms, the substantive “pa- 
tient" applying to all members of the group. 

Developmental studies in role expectations. 
Genetic accounts of role expectations (either as 
anticipated actions or as qualities) are virtually 
nonexistent. Hartley and associates (1948) re- 
port a set of experiments on role expectations. 
By the use of pictorial materials and interro- 
gation procedures, data were gathered to 
answer questions about multiple group mem- 
bership. Children from age 3.5 to 10.5 were 
the subjects. Role expectations of father and 
worker were studied. Constancy of the parental 
role appears at an earlier age than constancy 
of the worker role. Whether the earlier con- 
stancy of the parental role is due to affective 
factors (child's need for a parent) or cognitive 
factors (greater familiarity) is a problem for 
future investigation. 


Rote PERCEPTION 


If we accept the premise that perception is 
an intraorganismic response of the organism 
to stimulus objects and events, then the exten- 
sion to the social act is obvious. The perception 
of roles is an organized response of a person 
to stimuli in a social context. This is no place 
to review the literature on what is organized. 
Most psychologists agree that what is organized 
is a contemporaneous event and the residua of 
the organism's prior experience. This may be 
conceptualized as trace and process, concept 
and percept, hypothesis and information, creed 
and deed, expectations and acts, etc. In role- 
taking theory, acts of persons are seen as or- 
ganized against a cognitive background of 
role expectations. Role perception may be 
thought of as a sequence of behaviors in which 
the perceptual response is the first part of a 
social act: the (usually) silent naming or locat- 
ing the position of the other (from observed 
actions or inferred qualities) , which serves to 
locate the position of the self. The second part 
of the social act is the motoric response, the role 


enactment, in which the actor performs actions 
appropriate to his location of the positions of 
self and other. In real life situations, the se- 
quence of events is seldom as deliberate as this 
analytic description would suggest. The time 
elapsing between the perceptual response and 
the motoric components of the social act may 
be infinitesimal, so that observers may overlook 
the cognitive phase. In infants, of course, the 
two phases are not at all discontinuous. Per- 
ception of an object or event and motor dis- 
charge are continuous processes. In young 
children, the motoric response which follows 
the presentation of a stimulus object is medi- 
ated by some form of gestural activity, such as 
pointing or naming. With the muting of 
speech, the perceptual response becomes covert. 

From this presentation of role perception, it 
lvecomes obvious that the veridicality of one's 
role perceptions and functional adaptation to 
the social world are highly related. If a per- 
son's locating of the position of the other is 
invalid, then his location of the position of self 
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is likely to be invalid. His role enactment, 
then, will be inappropriate and nonadaptive. 

Research in this area has been neglected. 
Perception has, for the most part, been studied 
in “impoverished” settings, where there is 
little ecological generality (Brunswik, 1947). 
Moreno (1946) has suggested procedures which 
utilize social acts in the psychodramatic experi- 
ment. However, these procedures have been 
used more for demonstration and clinical pur- 
poses than for the determination of principles 
of role perception. 

The building blocks of role perception, of 
course, are the overt acts of others. These acts 
may be defined as everything the other does. 
This includes his gross skeletal movements, his 
verbal behavior, what he wears, his facial ex- 
pression, his posture, his gait, his accent and 
intonation, his adornments, visible emblems 
and tattoo marks, etc. The wearing of a uni- 
form and a silver badge, for example, is part 
of the role of policemen. The perception of 
these objects on a person leads to predictable 
role behavior on the part of the perceiver (pro- 
viding the system of role expectations of the 
perceiver is known). Perceiving the position 
of thé other, then, becomes a matter of attend- 
ing to and organizing cues. 

Two recent experiments illustrate a method 
for studying role perception as well as provid- 
ing data for testing certain hypotheses. In the 
first experiment (Sarbin and Williams, 1953) 
we addressed ourselves to the problem of role 
perception on the basis of certain reduced cues, 
namely, single verbal sentences. We recorded 
38 sentences spoken by different persons under 
various kinds of role instructions. For example, 
one sentence was shouted by a middle-aged 
man: "All right, boys, get in there and fight!" 
Another example is a whining little girl who 
cries; "Somebody broke my dollie" A third 
example is a man saying: "I used to sit before 
the fire and read poetry for hours." 

These recordings have been played before 
various groups over the past three years, The 
first of our groups received instructions (a) to 
indicate age, sex, and social or occupational 
role of the speaker and the person spoken to, 
and (b) to specify the action occurring. On 
the basis of 96 protocols, a multiple-choice 
answer sheet was developed which provided 
(a) three age-range choices, (b) male and fe- 
male choices, (c) four role choices, (d) four 
"alter" choices, and (e) four action choices. 
Except for (b), whic h was eliminated as a vari- 
able early in our experiments because of the 
restricted range of choices, the frequency dis- 
tribution for each variable followed a J-curve. 
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The first finding of importance is that dif- 
ferent middle-class groups perceive these re- 
stricted role cues in much the same way. It 
was possible therefore to establish a modal 
response for the four response categories for 
each of the sentences. Although the objective 
validity of the response was practically zero, the 
consensus was high. Actually, the sentences 
were spoken by student and adult actors. Few 
of the respondents perceived the speaker as 
"actor." In the first illustrative sentence cited 
above, over 90 percent of our respondents 
checked “football coach." 

One of the hypotheses confirmed by the ex- 
periment was that “good” role perceivers (those 
who respond in the modal way) would more 
likely have self-characteristics in common, 
while “poor” role perceivers (those who re- 
spond in deviant ways). would have fewer 
self-characteristics in common (more hetero- 
geneity), and would be different from the 
"good" role perceivers. Self-characteristics were 
inferred from adjective checklists filled out by 
the subjects. 

No claim is made for the objective accuracy 
of the self-descriptions. They are limited phe- 
nomenological reports, qualitative expressions 
of the self. The adjectives characteristic of 
"poor" role perceivers (male) are similar to 
the cluster checked by diagnosed neurotics 
(Sarbin and Rosenberg, 1952); the adjectives 
checked by the female “poor” role perceivers 
are often utilized in self-descriptions by non- 
insightful hysterics. The adjectives checked by 
the “good” role perceivers indicate awareness of 
tension-binding qualities (eg, self-denying, 
tense, modest, thoughtful, anxious, civilized, de- 
liberate, patient, reasonable, etc.) . Obviously, 
this conclusion is only suggestive and calls for 
further testing. 

A third finding emerged from the test of the 
hypothesis that males and females will differ 
in role perception. For women a significant 
positive correlation is found between fre- 
quency of perceiving the modal age response 
and frequency of perceiving the modal social 
role response. Age and social role thus appear 
to be intimately related for women. For men a 
significant positive correlation is found be- 
tween the frequency of perceiving the modal 
social role and the modal social situation. Thus, 
for men, social role and social situation appear 
to be intimately related. 

A second experiment (Sarbin and Hardyck, 
1952) followed the same design as the previous 
one. This study was designed to investigate 
the influence of postural behavior (as actions 
of persons) on role perception. The idea of 
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postural cues, of course, goes back to Lipps’ 
doctrine of kinesthetic inference. We were 
able to eliminate cues other than postural by 
the simple expedient of representing postures 
with line drawings of "stick figures." We pre- 
pared 42 such figures and presented them by 
means of lantern slides to various undergradu- 
ate classes. "The respondents were to write the 
action, state, feeling, or other quality that was 
suggested by the stick figure. From the re- 
sponses of the first group, we constructed a 


multiple-choice type test for administration to 
other experimental groups. As in the experi- 
ment on auditory cues, we found that for most 
of the stimulus presentations, the typical curve 
of the distributions of the responses was a J- 
curve. For our sample of undergraduates, then, 
we can conclude that certain postures are per- 
ceived as associated with certain acts or with 
certain qualities of a person. Figure 1 shows a 
few samples and response choices. 


18 
— disinterested 
— describing 
— resigned 
— doubtful 
— questioning 


39 
— shy 
—  self- conscious 
— ashamed 
— modest 
— sad 


45 


— surprised 

— dominating 
— suspicious 
— undecided 

— aloof 


20 
— self-satisfied 
— impatient 
— describing 
— casual 
— angry 


Ficun 1. 


We were interested in testing the hypothesis 
that persons who do not conform to the mode 
in the perception of postural acts differ on 
other criteria. For one sample the subjects 
were asked to state briefly the method they had 
used in forming their judgments, i.e., whether 
they had imagined themselves in the particular 
postures for each figure (empathized), or 


whether they had formed their judgments solely 
on the basis of the checklist categories. 

An initial scoring using simply the number 
of modal responses for each subject indicated 
a significant relationship between modal re- 
sponse score and self-report on empathy-non- 
empathy. The subjects who had checked the 
greatest number of modal responses reported 
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the use of the empathetic method in a signifi- 
cantly greater proportion, the biserial r being 
A41, P = 01. 

For another group, scores on the California 
Psychological Inventory (Gough) were ranked 
and compared with a ranked frequency count 
of the number of modal responses. The rela- 
tionship is quite high, the Spearman rank dif- 
ference correlation being .81. In view of the 
nature of the California Psychological Inven- 
tory, social adjustment would seem to be defi- 
nitely linked to empathic ability. 


HANDBOOK OF SOCIAL PSYCHOLOGY 


A third procedure was to administer the 
stimulus series individually to persons whose 
actions had already demonstrated their lack 
of conformance in the perception of roles, 
namely, hospitalized schizophrenics. They 
checked the nonmodal responses as frequently 
as the modal responses. 

Most significant in this procedure is the fact 
that role perception is subject to empirical re- 
search. Three different samples show rela- 
tionships which are predicted from the theory 
elaborated below. 


Rote ENACTMENT 


The preceding sections have dealt with role 
expectations and role perception, respectively. 
These are the covert processes which, of course, 
must always be inferred. Now we turn to overt 
performances, role enactment. 

A series of propositions has already been 
advanced which facilitates the description and 
exposition of role enactment (Sarbin, 1943, 
1951). As expressed earlier, role enactments 
are the overt performances of persons; these 
performances validate (or invalidate) the ex- 
pectations of the other person or persons in a 
social situation. (Our interest here is in the 
motoric aspect rather than in the cognitive 
aspects of role theory.) Coutu (1951) has re- 
cently inveighed against the confusion of the 
motoric and the cognitive aspects. What we 
call role enactment is equivalent to Mead's and 
Coutu's "role playing." "Role taking" or 
taking-the-role-of-the-other means adopting the 
perspective or attitude of the other. This is 
regarded as a covert process, although there are 
notable exceptions, such as the overt empathiz- 
ing with a halfback making an end run, smiling 
with the hero when he sees his sweetheart after 
a misunderstanding, etc. In these situations, the 
cognitive behavior "spills over" into action. In 
our organization of the theory, we treat em- 
pathy as a separate concept, as primarily covert, 
although we recognize its close relationship to 
role enactment. 

Role enactment [equivalent to Newcomb's 
“role behavior" (1950) ] includes, among other 
segments of behavior, gross skeletal movements, 
the performance of verbal and motoric gestures, 
posture and gait, styles of speech and accent, the 
wearing of certain forms of dress and costume, 
the use of material objects, the wearing of 
emblems or ornamentation, including tattoos, 
etc. In short, role enactment embraces what 
may be called the mechanics of the role-taking 
process. These mechanics can be summarized 
by means of the following concepts: number of 


roles, organismic involvement, and accessibility 
or reportability. 


NUMBER OF ROLES 


The dimension, number of roles, is subject 
to observation and report, providing the ac- 
tions which make up the roles are properly 
conceptualized. The ultimate in attaining 
ecological generality would occur if an observer 
followed a given subject for a representative 
number of days and recorded all instances of 
role behavior. The observer might also rate 
the intensity with which he enacted the various 
roles. This type of observational approach is 
not new to anthropologists (see, for example, 
Ford, 1938) . 

A less difficult approach for getting at the 
number of roles enacted by an individual is the 
use of small-group settings. The discussion 
group has been the focus of a number of re- 
search attempts in recent years and lends itself 
to the laboratory study of role behavior. Benne 
and Sheats (1948) have recognized and named 
a number of positions (systems of role expecta- 
tions) which characteristically arise in the 
discussion group. Some examples are: the 
opinion-giver, the orienter, the encourager, the 
hostile critic, the aggressor, etc. It should be 
pointed out that such roles do not have the 
institutional support of the more traditional 
roles, such as mother, doctor, shortstop, etc. 
Nevertheless, they do emerge in the interaction 
process and can be described with the same 
formal characteristics as the traditional roles. 
Among the roles which emerge in the small 
group are various “leader” roles, such as initia- 
tor, coordinator, director, and so on. The 
dimension "number-of-roles" was applied to 
such leader roles in a recently completed study 
by Sarah Mazelis (1953). She addressed herself 
to the question: Is the productivity of a dis- 
cussion work group related to the diffusion of 
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leader roles among the members of the group? 
The leader roles were defined and categorized 
by actions. The observers recorded and counted 
the number of leader roles enacted by each 
person in each of four independent work 
groups. The hypothesis studied was: produc- 
tivity will be greater among groups where the 
various leader roles are enacted by most of the 
group members; productivity will be less where 
the leader roles are enacted by a small propor- 
tion of the group members. The work products 
were rated on several criteria by experts in the 
field. 

In a group task which required planning, 
the quality of the product, as rated by the 
judges, was directly related to the diffusion of 
the various leader roles among the group mem- 
bers. Spearman rank difference correlation be- 
tween an index of role diffusion and production 
ratings was .75. Although the small number 
of groups makes this finding only suggestive, the 
method appears promising. 

This number-of-roles dimension is implicit 
in the application of the role-taking approach 
to psychopathology. It is a widely accepted 
postulate that the more roles in a person's be- 
havior repertory, the "better" his social adjust- 
ment — other things being equal. Cameron 
and Magaret (1951) have argued most con- 
vincingly that the absence of role-taking skills 
is influential in the development of paranoid 
disorders. Gough (1948) has also implicitly 
used number-of-roles as a dimension in his role- 
taking theory of psychopathy. Following Mead, 
he describes how the residuals of role enact- 
ments (*me's") are integrated into a self-con- 
ception, and how the ability to be self-critical 
is a function of the number of “me's” or the 
internalization of a "generalized other." Fur- 
ther, “. . . deficiency in role playing means the 
incapacity to look upon one's self as an object 
(Mead) or to identify with another's point of 
view .. ." (Gough, 1948, p. 363). Gough and 
Peterson (1952) have already utilized this con- 
ception in an empirical study of psychopathol- 
ogy. 


ORGANISMIC DIMENSION 


Any role may be enacted with different de- 
grees of organismic involvement. This is essen- 
tially an intensity dimension, the intensity of 
the enactment being manifest in the number of 
organic systems involved. At the low end of 
the dimension would be the kind of interaction 
which occurs with little affect and with little 
effort. The role of the customer in today's su- 
permarket involves only minimal participation 


233 


— the saying of a few words, plus a few move- 
ments involved in the exchange of money. 
Contrast this role with the intensity involved 
in enacting the role of the mother of a sick 
child. 

In daily life, of course, most roles are enacted 
with minimal organismic involvement. Be- 
havior would be inefficient indeed if all roles 
were enacted with maximal intensity. Not only 
are cultures organized so that the number of 
maximally intense roles is few, but there are 
autonomic safeguards which limit the time span 
of roles that call for great mobilization of 
energy. The organismic dimension can be un- 
derstood as applying to all organized actions. 
The illustrations we have chosen are reference 
behaviors against which more conventional 
roles may be compared for intensity (Table 1). 

Level I. The low end of the scale is labeled 
"role and self differentiated.” The self, a cog- 
nitive structure (v. infra) , and the role, also a 
cognitive structure, are minimally in contact. 
An example of a casual role, already given, is 
the customer in a supermarket. In fact, it is 
possible to perform this role at the motoric 
level while at the same time vicariously enact- 
ing other roles unrelated to the public role. 

Level II. At the next level of organismic 
involvement, the reference is that of the dra- 
matic actor who performs the motions neces- 
sary for the portrayal of the role assigned to 
him. In mechanical acting, the actor does not 
become involved; the self is relatively autono- 
mous from the role. He must maintain a cer- 
tain degree of consistency in the various 
response systems, which calls for more effort 
and precludes the degree of autonomy of the 
casual role. This degree of involvement is seen 
in many everyday interactions. For example, 
the employee who "puts on a good front" to 
impress the boss. 

Level III. At the third level, “heated” acting 
is used as the illustrative reference behavior. 
Archer (1889) in his classical study of the psy- 
chology of acting, was able to discern this kind 
of acting from the more automatic and me- 
chanical mentioned before. Commonly called 
“living the role,” the actor behaves as if he is 
the character in the drama. Of course, the suc- 
cessful actor maintains some contact with his 
role as actor in order to change his tempo, 
amplitude, intensity, etc., as conditions warrant. 
His involvement includes some affective as well 
as motoric components. In order to portray 
anger, for example, the actor may work up a 
rage by violently shaking a ladder in the wings 
before appearing on the stage. 

Level IV. The role pf the hypnotic subject 
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TABLE 1 
Role and self differentiated Role and self undifferentiated 
Minimal involvement Maximal involvement 
Few organic systems Entire organism 
Little effort Much effort 


I. casual roles 


II. dramatic role, mechanical acting 


Ill. dramatic role, heated acting 


IV. role of hypnotic subject 


V. hysterical fugue, role of the amnestic 


VI. roles involving ecstatic states; mystical 
experiences, possession, religious con- 
versions, sexual climax 


serves as a reference role for a moderate degree 
of organismic involvement. The classical be- 
haviors of the hypnotic subject, the catalepsies, 
the compulsive post-hypnotic actions, sensory 
and motoric changes, and so on, illustrate more 
cogently the operation of the as if mechanism. 
In enacting the role of the hypnotized subject as 
perceived against a background of generalized 
and specific expectations, a person demonstrates 
that more of the organism is responding than 
in play acting. He behaves as if he is blind, 
or deaf, or fearful, or analgesic, or whatever the 
specific instructions call for. 

The hypnotic situation provides a suitable 
laboratory model for the demonstration of 
organismic involvement. An experiment con- 
ducted by Lewis and Sarbin (1943) demon- 
strates this involvement. These investigators 
addressed themselves to the question: Can as if 
behavior, as observed in the role taking of hyp- 
nosis, influence a basic physiological process, 
namely, the hunger cycle? "Ten volunteer sub- 
jects, demonstrating different degrees of hyp- 
notizability, participated in the experiment. 
The apparatus was essentially the same as used 
by Carlson (1916) in his studies of gastric hun- 
ger contractions. Each subject appeared for the 
experimental period each morning without his 


VII. role of the moribund person; object of sor- 
cery and witchcraft (sometimes irreversible) 


breakfast. He was hooked up to the recording ` 


apparatus. He was hypnotized and given no 
further instructions. One full gastric hunger 
cycle was recorded. During the middle of the 
second cycle, he was given a fictitious meal. 
The results indicated that for subjects rated 
high on the role-taking aptitude (see below), i.e., 
the ability to participate in as if behavior, the 
gastric hunger contractions stopped. That is, 
taking the role of the eater was effective in re- 
ducing or eliminating the gastric contractions 
even though no food was ingested. For subjects 
who were not hypnotizable, who were limited 
in their ability to participate in this form of 
as if behavior, no effect was produced on the 
gastric cycle. Thus, a range of the dimension 
of organismic involvement was demonstrated. 

Level V. The behaviors subsumed under the 
rubric hysteria may be used as a reference band 
on the organismic continuum, Responding as 
if one is afflicted with some organic dysfunction 
is the main characteristic of the hysteric. The 
range covered by hysteria overlaps considerably 
the range covered by hypnotic role taking. 
The organismic components in hysteria, how- 
ever, are less self-limiting and more prolonged 
than in hypnosis. Descriptive cases may be 
found in any textbook of abnormal psychology. 
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Common examples are hysterical seizures, hys- 
terical anorexia, hysterical paralysis, and hys- 
terical anesthesia. Both major varieties of 
hysteria, inactivation and autonomy (Cameron, 
1947), show how the role of the invalid may 
involve the entire organism. In the same range 
of intensity of involvement is the couvade. 
In this custom the husband lies-in; the essence 
of the custom is the husband's taking the role 
of the wife (Crawley, 1902) . 

Level VI. Ecstasy, a condition usually in- 
volving suspension of voluntary action, illus- 
trates organismic involvement to a degree 
which is not ordinarily observed in day-to-day 
social interaction. Obviously, such states can- 
not be prolonged over time without damage 
to the functioning of the body. Ethnologists' 
accounts of ecstatic trance experiences, posses- 
sion, religious revivals, conversion experiences, 
and mystical unions show many of the charac- 
teristics of hysteria. But in addition to the 
greater involvement of the skeletal muscula- 
ture, there is a greater involvement of the 
organs that are served by the autonomic nerv- 
ous system. 

In our own wider culture, such roles are taken 
by participants in revival meetings, by adoles- 
cent bobby-soxers who swoon upon hearing 
"the voice," by participants in sexual congress, 
by marathon dancers, etc. What is common to 
these events is the intensity of involvement, the 
apparent relationship of this intensity to the 
activities of the sympathetico-adrenal system, 
and the automatic equilibratory controls. AII 
these activities are terminated through institu- 
tionalized rituals, fatigue and exhaustion, 
and/or autonomic regulation. 

In rites de passage the role of the celebrant is 
characterized by high organismic involvement. 
The manifest purpose of the intense role be- 
haviors of the ritual is to signify the change 
from one position to another in the society; 
the effect of the intensity of role enactment is 
te modify the participant's self-concept so that 
the new role, e.g., adult, may not be incongru- 
ent with the self. If, for example, strength is 
an expected property of the adult person, and 
the rites of passage from adolescent to adult 
includes passing a test of strength, then the 
successful completion of the test allows the 
person to add the adjective "strong" to his self- 
description. Thus he is better equipped to 
occupy the position of adult, not only because 
others know he has strength, but because he 
conceptualizes the self as strong. 

Level VII. The ultimate limit of the inten- 
sity dimension is an extension of the previous 
range. The effects of the sympathetico-adrenal 


235 


system which is excessively activated in such 
role-taking sequences as described above may 
become irreversible under certain stimulus con- 
ditions. The end result of such a performance 
is death. 

Well-authenticated cases of Voodoo death 
allow the employment of the descriptive phrase, 
taking-the-role-of-a-moribund person. Devereux 
(1939) has reported of the Mohave Indians that 
a person who learns that he has violated the 
incest taboo takes the role of the person who 
is about to die. Rivers (1924) reports of the 
Papuans and Melanesians: “Men who have 
offended one whom they believe to have magi- 
cal powers sicken and die, as the direct result 
of their belief; and if the process has not gone 
too far they will recover if they can be con- 
vinced that the spell has been removed" (Riv- 
ers, 1924, p. 50). 

The case of a man dominated by a death wish 
is described by Alexander (1943). Autopsy 
findings revealed no organic pathology. Ap- 
parently he took the role of the dying person so 
intensely that the equilibratory processes failed 
to work. 

The eminent physiologist, Cannon (1942), 
has provided a description of the physiological 
and social events associated with taking the role 
of the doomed person. He uses the description 
of bone pointing among the Australian abori- 
gines provided by Basedow (1925). This is a 
form of black magic in which the sorcerer 
points a sharpened bone at the intended victim. 

Cannon explains the physiological events 
arising from the adoption of the victim's role. 
The fear response acts upon the soma in many 
ways. The sympathetico-adrenal system comes 
into action, exercising control over the viscera 
and blood vessels. The various internal changes 
render the organism ready for physical action 
(struggle or flight). If this state of extreme 
perturbation continues without motoric dis- 
charge, death may follow. But why do the 
equilibratory or internal constancy mechanisms 
fail to re-establish homeostasis, as in ecstatic 
experiences? One answer to this question is 
that the perturbation in the organism is main- 
tained by the enactment of supporting roles by 
other members of the community (Warner, 
1937) . 

'These levels of organismic involvement are 
equivalent to, if not identical with, the concept 
of self-involvement. At one end of the con- 
tinuum, we see relatively automatic responses, 
stereotyped habits, autonomous functioning of 
self and role. At the other end, we see the 
unitary functioning of self and role. 

Dimension of accessibility. Although it 
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would be simpler to use the descriptive term 
“consciousness of role enactment,” the writer 
prefers the term “accessibility,” or reportability. 
The conscious-unconscious polarity carries with 
it connotations derived from psychoanalytic 
theory which assign motivational properties to 
so-called unconscious contents. Cameron (1947) 
has used the expression “not accessible to one's 
self-reactions," which carries the meaning that 
a person cannot make valid observations of his 
own role enactments. 

This dimension applies to events in which the 
actor of the role reports his actions. One can 
immediately think of roles which are enacted 
automatically and where the person cannot 
recall his actions. The illustration given before 
about the person enacting the role of the cus- 
tomer in the supermarket can be applied here. 
He may perform all the actions necessary to- 
ward the selection and purchase of a wide 
variety of articles but be unable to report his 
actions. Another example is that of the hys- 
terical amnesia victim. In enacting this role 
the patient does not recall events prior to the 
amnestic period. In Archer's (1889) study of 
acting, such a dimension was isolated. Some 
actors reported being relatively "unconscious" 
of self in certain roles, while in others they 
were extremely observant of their own role 
enactments. 

The entire psychology of observation and re- 
port is as applicable to one's own role behavior 
as it is to the role-behavior of others. Research 
hypotheses in this area could profit from the use 
of the observations which the depth psycholo- 
gists have conceptualized as repression and 
suppression. 

In what is commonly called self-conscious- 
ness, the actor attends almost exclusively to his 
own actions rather than to the events occurring 
in the broader social field. The self-conscious 
person can report such events as the quivering 
of his voice, the shal.ing of his knees, the drying 
of his throat, etc, It is obvious that being 
greatly involved in the organismic features of 
the role enactment limits the possibility of ob- 
serving the self as a social object. In general, 
one would expect to find a strong inverse rela- 
tionship between the organismic dimension and 
the accessibility dimension. The two dimen- 
sions, however, should be characterized inde- 
pendently. Events that can be placed on the 
organismic dimension are available to the ob- 
servation of others without the necessity of 
phenomenological report, while the accessibil- 
ity dimension can be applied only to the actor's 
report of his own role behavior. 

This dimension has not been studied em- 
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pirically except in clinical settings and in 
hypnotic experiments. The value of this di- 
mension of role enactment would lie in its 
application to the learning of social roles. Self- 
criticism of one's own actions could follow only 
if these actions were accessible or reportable. 
The therapeutic dictum, “making conscious 
what is unconscious’ to effect behavioral 
changes, refers to the same events as making 
accessible to one’s self-reactions inaccessible 
aspects of one’s role enactments in order to 
modify or eliminate these actions. Bowman 
(1949) has utilized this notion in his descrip- 
tion of the development of insight by virtue 
of practice in role enactment using the methods 
developed by Moreno (1934). 

The as if dimension. It is by now a truism 
that the learning of social roles — in fact the 
learning of any concept — is associated with 
the ability to treat an object or event as if it 
is something else. One has to adopt the as if 
set in order to group apparently disparate 
objects or events into a common concept. A 
full philosophical and logical treatment of the 
as if principle is given in Vaihinger (1924). 

In the if statement, or the conditional clause, 
something unreal, untrue, problematical, or 
fictive is stated. With this. assumptive behavior 
is associated the maintenance of the assumption 
in spite of its fictive character. The mainte- 
nance of the assumption is possible only if 
the organism has developed verbal symbols and 
the capacity to bind time and tension. The 
imaginal processes, fantasy, identification, etc., 
are now seen to depend upon the capacity 
to engage in as if behavior. Between the as 
and the if, grammatically speaking, lies a whole 
sentence. The person must “hold the thought" 
or maintain the set established by the condi- 
tional fictive clause, while at the same time 
considering the independent clause. As stated 
earlier, this conceptual process cannot be per- 
formed if the person lacks the ability to bind 
time and tension, a function of the develop- 
ment of the self. Since roles are organized 
concepts, taking-the-role-of-the-other, then, is 
not possible without the as if ability. 

"This as if behavior can be treated as an apti- 
tude or skill in the same way that we treat 
academic aptitude or skill. Operative in ap- 
parently diverse kinds of conduct, the as if 
formulation has been applied by the author 
(1950) to repression as an ego defense, impuni- 
tiveness as a response to frustration, and hyp- 
notizability as a personality trait. These three 
variables (the "triadic hypothesis") have been 
found to be intercorrelated (Rosenzweig and 
Sarason, 1942). In repression the subject be- 
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haves as if an event threatening to the main- 
tenance of the self-concept has not occurred; 
in impunitiveness the person acts as if the frus- 
trating event were no longer frustrating; in 
hypnosis the person behaves as if certain quali- 
ties and events are different from what they 
actually are. 

The as if skill applies to drama, hypnosis, 
fantasy, play — in fact, to all imaginative be- 
havior, To test whether as if behavior has any 
appreciable effect on the performances of a 
subject, we can observe the overt action of per- 
sons under instructions to behave as if some- 
thing is or is not occurring. Some experiments 
along this line have been reported. Jacobson 
(1938) and Schultz (1932) demonstrated the 
influence of the subject’s imagining certain 
events upon bodily functions. Common ex- 
perience illustrates how implicit imaginal proc- 
esses are reflected in overt, muscular movements 
during the act of imagining. A person can feel 
his ears becoming red and his face becoming 
flushed when he imagines a former embarrass- 
ment. Similarly, he may experience nausea in 
imagining something extremely unpleasant or 
disgusting. 

In an experiment on suggestibility Arnold 
(1946) relates hypnotic and suggestion effects 
to vividness of imagery. Her experiment al- 
lowed the inference that the more vivid the 
imaginative process, the more pronounced the 
overt movement associated with the imagined 
act. In a role-theoretical analysis of hypnotic 
phenomena, Sarbin (1950) has presented some 
evidence to support the hypothesis that enact- 
ing the role of the hypnotized person is a func- 
tion of the veridicality of role perception, 
congruence of the role with the self, and role- 
taking aptitude. The last-named item is de- 
fined as success in the use of as if behavior. 

As in the case of the organismic dimension 
and the accessibility dimension, we have no 
readily available measuring instruments. Of 
necessity we must approach this dimension in- 
directly. Imaginal vividness can be studied only 
with the aid of phenomenological report, and 
we are never too sure about contamination with 
linguistic ability in reporting details of the 
covert response. 

As if skill has been assessed indirectly through 
testing the ability of respondents to predict the 
test-taking behavior of the other. The pro- 
cedure is as follows: Smith fills out a per- 
sonality test. This may be a questionnaire, 
trait list, attitude scale, role inventory, or other 
paper-and-pencil device. Smith, or Smith's 
products, such as a handwriting sample, is ob- 
served by Jones. Then Jones, usually on the 
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basis of a limited sample of Smith's behavior, 
fills out the test as he thinks Smith would, as if 
he were Smith. This test may then be extended 
to Smith's predictions of Jones' predictions of 
Smith. In the first application of the test, 
Smith predicts the other's performance; he takes 
the role of the other vis-à-vis a test. In the 
second application, Smith predicts how he 
(Smith) is perceived by Jones; he takes the 
role of the other, vis-à-vis the self. The in- 
structions to the subject are often stated in as if 
terms. “You are to take this test as if you were 
Mr. Jones,” or “Pretend you are Jones," etc. 

Dymond (1949) published an empathy scale 
based on this method. Each subject rated (1) 
self, (2) others, (3) other's self, and (4) 
other's rating of subject's self, on six traits: 
self-confidence, superior-inferior, selfish-unsel- 
fish, friendly-unfriendly, leader-follower, sense 
of humor. Deviations between criterion and 
predictions were computed. The range of scores 
indicated a dimension of personality. The only 
test of validity was a comparison of empathy 
scores with independent judgments of a per- 
son’s empathic skill on the basis of TAT proto- 
cols. The correlations were satisfactory, al- 
though as Dymond says, “hardly evidence on 
which to state that this is a valid test of em- 
pathic ability” (Dymond, 1949, p. 131). 

Notcutt and Sylva (1951) used a similar 
procedure in developing a prediction test. 
Eighteen traits, taken from Murray's system, 
were scaled and checked by husbands and wives 
tor self and other. The predictions of the other 
were significantly greater than chance. Suc- 
cesses were greater on traits where subjects 
made similar self-ratings. 

Dymond's and Notcutt and Silva's studies 
were criticized by Hastorf and Bender (1952) 
on methodological grounds. They point out 
that no correction is introduced for “projec- 
tion.” In their own study, using the Allport- 
Vernon Study of Values, they obtained two 
scores: an empathy score, which was the dif- 
ference between the prediction of the other's 
responses and the criterion (the other’s actual 
test responses) ; and a projection score, which 
was the difference between the prediction of 
the other's responses and the subject’s own re- 
sponses. In their study of 50 subjects, 20 had 
higher empathic scores (predictions were closer 
to the social objects' actual responses) ; 28 had 
higher projection scores (predictions were 
«loser to own responses). Two subjects had 
equal empathy and projection scores. Thus 
projection was more common than empathy in 
this group of subjects. 

These studies illustrate how empathy — the 
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use of as if in social behavior — may be investi- 
gated experimentally. In a later section, some 
additional studies are reviewed which are aimed 
at the origins of variability in taking-the-role- 
of-the-other. The general thesis is that variabil- 
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ity in role perception, role enactment, and 
taking-the-role-of-the-other is a function of the 
self. To establish this thesis, we now turn to a 
psychological analysis of the self. 


Tue SELF 


More than any other single group, the role 
theorists have developed and used the con- 
ception of the self as an intervening variable. 
The historian of modern social psychology 
could trace — in an unbroken line — the rise 
and development of the self from William 
James, Baldwin, Cooley, and Mead through 
Faris, Park and Burgess, and Kimball Young 
to the present. The thesis central to this group 
of writers was that the self arises out of social 
interaction. The self was seen as predomi- 
nantly, if not exclusively, social. The core of 
Cooley's idea, "Each to each a looking-glass 
reflects the other that doth pass," is seen in the 
theory of selfhood advanced by these sociolo- 
gists. 

Among psychologists, Allport, more than any 
other, has emphasized the centrality of the self 
in his conception of trait organization. His 
book (1937) followed by his challenging paper 
(1943) re-awakened interest in the self among 
psychologically oriented theorists. This interest 
was given additional — though not direct — 
impetus by Lewin (1935) and by such Ges- 
taltists as Koffka (1935). Today, no new 
treatise or textbook on social psychology is 
complete without a discussion of the self. (See, 
for example, Asch, 1952; Hartley and Hartley, 
1952; Krech and Crutchfield, 1948.) 

Probably as a result of the convergence of 
normative infant studies and the observations 
of the psychoanalytic child psychologists, 
Mead's notion of the self as originating solely 
in social interaction gave way to the notion 
that the self has earlier origins. Since the 
trend in role theory is in the study of the inter- 
action of self and role as coordinates and not 
as parallels, it behooves us to give a detailed 
account of the origins, functions, and dimen- 
sions of the self. 

Borrowing from James, Symonds (1951) re- 
gards the self as the object of awareness — the 
ego as the acting, knowing, perceiving object. 
Brewster Smith (1950) regards the self as a 
phenomenal entity, the ego as a construct. 
The present writer has used the term self to 
refer to the phenomenal experience of identity, 
an inference arising from a person's experiences 
with things, body parts, and other persons. 
“The referent for the ‘I’ . . . is the cross sec- 
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tion in the present of the total cognitive 
organization that embraces those more-or-less 
enduring substructures, the empirical selves” 
(Sarbin, 1952a, p. 21). The more cumbersome 
term, self-as-inferred-by-others, has been used 
to refer to the products of behavior analysis. 
For convenience, we may substitute the more 
accessible word ego for the latter term. Thus, 
in the discussion which follows, self refers to 
a phenomenal experience (not all of which can 
be communicated via language) ; ego refers to 
inferences made by psychologists (or others) 
about the more central or enduring psychologi- 
cal characteristics of a person. 

Neither self nor ego is directly observable, 
but inferences can be made about both. The 
person can communicate about the phenomenal 
experience of selfhood by the use of I-sentences, 
for example “I am rugged and powerful." The 
observer can likewise communicate about the 
ego of a person by the use of third-person sen- 
tences in which adjectives, such as dominant, 
self-confident, original, reverent, etc., are the 
significant terms. The inferences about ego 
structure can be derived from phenomenologi- 
cal report or from observations of nonverbal 
behavior, Whether making inferences about 
self or ego, the essential terms used stand for 
qualities, rather than actions. These qualities 
may be conceptualized by observers in several 
ways. The present author, following Allport 
(1987), finds the following useful: traits, atti- 
tudes (including sentiments), and habits. 
These are inferences derived from observing 
actions of persons. The trait is a more general 
quality than an attitude, which in turn is more 
general than the habit. Because of space limi- 
tations we shall omit a treatment of attitude 
and habit and, in general, equate quality and 
trait. 

As a cognitive structure, the self develops out 
of interaction between the organism and stimu- 
lus objects and events. In early life, the “moti- 
vating conditions" which lead to action arise 
out of perturbations in the steady state. In the 
neonate, the steady state is maintained via. 
chemical equilibrium or homeostasis. In the 
process of maintaining equilibrium, trace 
residuals are laid down within the nervous 
system — the earliest cognitions. These trace 
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residuals are integrated within the organism; 
the “steady state" is a more complex affair than 
formerly. Homeostasis now serves a more com- 
plicated organism, although still in an auto- 
matic fashion. The beginning of the perceptual 
constancies reflect the equilibrating function 
of the human organism at a still more complex 
level. Object constancy is an adaptive mechan- 
ism — were it not for this phenomenon, a per- 
son would continually be questioning the 
validity of his perceptions as he encountered 
the same objects under different conditions. In 
the same progression, images may be subject 
to the same constancy mechanism as objects. 
In the face of new information to the contrary, 
a person's image of an object or person remains 
relatively constant. Similarly, concepts remain 
constant under diverse conditions. The con- 
stancy of attitudes under planned attempts to 
change them is a further illustration of con- 
stancy of conceptual structures. 

The application of the equilibratory prin- 
ciple from chemical homeostasis to conceptual 
constancy is extended to all cognitive struc- 
tures. The concept of the self and its sub- 
structures are’ subject to the principle of 
constancy. This is the theme in Lecky's Self- 
Consistency (1945). At this juncture, suffice 
it to say that the self as a cognitive structure in 
general functions according to the constancy 
principle. 

Toward the maintenance of what constant 
state does this principle operate in early life 
prior to the apprehension of agents and ob- 
jects? The answer is in part conjectural, but 
available evidence suggests that the stable state 
of early infancy is made up of somesthetic 
qualities. With an immature neuromuscular 
system, the organism has no way of "knowing" 
agents or objects. It does perceive — primi- 
tively — such qualities as tension and quies- 
cence. Qualities, then, are the first cognitions 
— or, in the older language, the first contents 
of consciousness. Before the dawn of gesture 
or language, residues of qualities are laid down 
and form the nucleus of the self. (Freud, 1935; 
James, 1890) . 

The self, then, is seen as an organization of 
qualities at first unverbalized and unverbaliz- 
able, later verbalized in part by gestural or 
linguistic devices such as naming, self-drawings, 
adjectives, I-sentences, etc. The origins of the 
self lie in the resultants of the interaction 
between two series of events. The first is the 
maturational series; the second, the personal- 
social series. The maturational series of events 
may be assumed to be fairly constant from cul- 
ture group to culture group. Within certain 
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limits of variation, the age at which develop- 
ment of the nervous system is complete, the age 
at which ocular focusing begins, the age at 
which dentition occurs, etc, is the same for 
human infants in all societies. 

"The personal-social series of events is the ma- 
jor determinant of psychological variation. 
Persons enacting roles of mother, nurse, father, 
sibling, and others interact with the child ac- 
cording to prescriptions and  proscriptions 
characteristic of the culture, subject to the mi- 
nor variations of individuals. Of particular 
interest are child-rearing practices and their 
influence on the development of the self. This 
has been the focus of study of personality-and- 
culture in the past decade. Of equal impor- 
tance in assaying the origins and structure of 
the self is the time at which specific child-rear- 
ing practices are introduced or terminated. 
Here the concepts of maturational series and 
personal-social series converge. For example, 
child-rearing practices which lead to prolonged 
feeding frustration in early infancy will have 
different effects on the formation of the self 
than the same kind of frustration introduced 
when the child is at a more mature physiologi- 
cal level. Oblique evidence for this assertion 
comes from an attempt at establishing the rela- 
tionship of time of weaning to adult ego struc- 
ture. Goldman (1950), by means of a 19-trait 
scale, placed her subjects on an optimism-pessi- 
mism continuum. Then she compared the 
scores with the age at which breast feeding was 
terminated. This information was obtained 
from the mothers of the subjects. The results 
indicated that subjects weaned before five 
months were rated more pessimistic than sub- 
jects weaned later. 

Although such studies as Goldman's lend 
support to the importance of both the matura- 
tional and personalsocial series in the forma- 
tion of the self, at the present time most of our 
information about the effects of timing infant- 
rearing practices is analogically derived from 
animal studies (Hunt, 1941; Hunt, et al., 1947) . 

Werner (1948) has supplied a number of 
concepts which may be applicable to the study 
of the self. Among these is the dimension of 
differentiation as it applies to perception and 
to motoric behavior. Time binding and tension 
binding are corollary conceptions with which 
cognitive phenomena may be described. For 
example, the newborn infant may be charac- 
ftrized as undifferentiated in action and per- 
ception, with no ability to bind time or tension. 
The mature adult may be characterized as dif- 
ferentiated in actions and perception and, 
within specified limits, able to tolerate time 
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and tension. These dimensions will be dis- 
cussed in the description of the empirical selves 
which go to make up the self structure. 

Another conception necessary for an under- 
standing of the self and its place in role theory 
is that of fixation. The agents of culture, either 
through intentional training procedures or 
through unwitting performances, may prolong 
the organization of the self at a particular level, 
thus preventing the full development of later 
structures. Werner (1948) has pointed out how 
overspecialization at earlier levels inhibits cre- 
ativity at later levels. Thus if a person were 
fixated, either through overlearning or trauma, 
at a developmental level of the primitive-con- 
strued self, the development of the social self 
would be inhibited, and we would expect the 
person to be incapable of performing certain 
adult roles which call for time binding, tension 
binding, and highly differentiated perceptual- 
motor actions. ; 

The concept "fixation" has been used in 
many ways. Our usage of the term is intended 
to convey the meaning of a relatively perma- 
nent change in the structure of the organism, 
which change follows from perturbations in- 
troduced under conditions where the organism 
cannot differentiate cues. The young infant, 
by virtue of its incomplete maturation, is un- 
able to distinguish the environmental events 
associated with the experienced quality. The 
response, in this case a quality (or feeling), is 
linked to any stimulus event which is in some 
way equivalent to the initial stimulus event. 
(Bearing on this notion are the experiments of 
Stellar and Morgan, 1953, and Fredericson, 
1951.) 

With these notions: the constancy principle, 
the maturational and personalsocial series of 
events, the dimensions of cognition, and fixa- 
tion, the early organization of the self may be 
described. Elsewhere, the author has presented 
a more complete analysis of the origins and 
structure of the self. (Sarbin, 1952). A modified 
presentation is offered here. 

As is the case in all organization in nature 
(Sickles and Hartmann, 1942) , residuals result- 
ing from actions designed to maintain con- 
stancy are organized around foci. These foci of 
cognitive organization we shall call empirical 
selves. The first such focus arises out of the 
infant's automatic responses to imbalances 
introduced by the change from the stable intra- 
uterine environment to the unstable extra- 
uterine environment. Unless imbalances in the 
child's physiology are corrected by the inter- 
vention of other persons, the child cannot 
survive. The efforts of persons engaged in child 
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care are directed almost exclusively toward 
maintaining the physiological equilibrium by 
means of temperature control protection 
against natural hazards, and maintenance of 
the food and water supply. We have called this 
first focus the somatic self (S,), which arises 
from the infant's experiences during the first 
few weeks of life when the primary concern of 
the mother or nurse is the maintenance of the 
infant's soma. This inchoate cognitive struc- 
ture lays the groundwork for differentiation of 
the self from nonself. 

At the beginning of the development of this 
first structure, the infant does not distinguish 
between external objects and the various parts 
of its own body. Differentiating its own body 
from the external world — establishing the 
boundaries of the somatic self — is one of the 
chief activities of early-infancy. The infant 
comes to recognize that different sensations 
result when it strikes objects in the external 
world or when it pinches itself. Single stimu- 
lation occurs in the former instance, double 
stimulation in the latter. The Anlage of the 
postural model of the body is laid down as the 
result of differentiating the effects of this dif- 
ferential stimulation. This is the cognitive 
structure which later develops into the body 
image (Schilder, 1935). Overdevelopment of 
this structure through trauma or through pro- 
longed attention to purely somatic disequilib- 
rium may lead to an overspecialization or 
differentiation of this structure. If this occurs, 
it is likely that pathological behavior will result 
— behavior in which the body image is central. 
Severe hypochondrial behavior, narcissism, and 
somatic delusions are examples of adult con- 
duct where the body image or somatic self is 
overdeveloped. 

Not all imbalances are immediately corrected 
by the child-caring persons. Uncorrected 
homeostatic imbalances lead to tensions in the 
organism, These tensions, such as follow from 
hunger, thirst, accumulation of waste products, 
etc., become part of the stimulus field. Recep- 
tor and effector processes lead to tension re- 
duction. The process appears to be similar to 
"drive discrimination." Of course, this means 
that certain maturational events have occurred 
so that the receptor or effector structures are 
functional. By age three or four months the 
infant can respond to external stimuli, although 
in an undifferentiated way. When a child is 
hungry and is given food, the giver can be 
perceived only as the agent of tension reduc- 
tion, not as an object or person. 

With the maturation of the organism and 
with its increased motility, other kinds of 
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stimuli are mediated by means of the receptor 
organs. At this level of self-organization certain 
events, either external or internal, become asso- 
ciated with tension reduction. 

The main characteristic of the organism at 
this time is the differentiation of qualities with- 
in the organism. These qualities are probably 
bipolar: a quality which we designate tenston 
and a quality which we designate as quies- 
cence. The operation of the homeostatic 
mechanism is aided by the actions of the child- 
rearing agents, the result of these actions being 
tension reduction. We can infer only that the 
child perceives the tension and quiescence as 
diffuse generalized qualities, probably the 
equivalent of “feelings.” Chronologically, the 
infant is under five or six months of age when 
this process begins. There is some casual evi- 
dence that prolonged deprivation leads to 
fixation of qualities, which in turn direct the 
human organism's activities towards tension- 
reduction processes. DuBois’ analysis of the 
Alorese (1941) lends credence to this notion. 
The quality of tension associated with pro- 
longed hunger during this phase of develop- 
ment “fixates” the self-organization around the 
receptor-effector focus. The specialized atti- 
tudes toward food and hunger may be consid- 
ered the residue of this overspecialization. 

Since the receptor and effector processes are 
the focus of the child’s rather undifferentiated 
self-organization, we have labeled this second 
focus the receptor-effector self (abbreviated 
Sə). Around this matrix the infant at the 
primitive deductive level (conditioned re- 
sponse) interacts only with that nonself which 
is instrumental in reducing or eliminating ten- 
sions. No time or tension binding is apparent. 
Perception is syncretic, actions are mainly of 
an undifferentiated character. 

The third level of development we have 
called the primitive construed. self ($3). The 
beginning of this focus of cognitive develop- 
ment is about six months of age. Neuromuscu- 
lar maturation has progressed to the point 
where the infant has begun to acquire some 
control over his movements and to express his 
needs through motor activity. He is no longer 
an entirely passive recipient of the tension- 
reducing activities of nonself agents. His in- 
creased motoric activities bring him into con- 
tact with a larger number of stimulus objects. 
Perceptual development has also become more 
differentiated, and the infant can perceive dif- 
ferences between agents instrumental in tension 
reduction and objects instrumental in tension 
reduction. It is doubtful whether the child can 
perceive differences between objects and per- 
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sons qua persons. Bayley (1933) has shown 
that between the ages of four and six months 
differential vocalization to nonself objects oc- 
curs. After six months of age differential 
vocalization to nonself persons begins (Shirley, 
1933) . 

Motoric development is such that the child 
can express pleasure, eagerness, and displeas- 
ure. To a limited extent he can discriminate 
tone and expression. The criterion by which 
a child distinguishes objects from persons is 
the ever-changing nature of the stimulus prop- 
erties of things. The focus of cognitive develop- 
ment here is the laying down of traces of 
perceptions of persons who are selective, dy- 
namic, and somewhat unpredictable. Baldwin 
described the process as follows: ". . . this is 
the child's very first step toward a sense of the 
qualities which distinguish persons. The sense 
of uncertainty grows stronger and stronger in 
its dealings with persons. A person stands for 
a group of experiences quite unstable in its 
prophetic as it is in its historical meaning. 
This we may, ... in assuming it to be the 
first in order of development, call the 'projec- 
tive stage’ in the growth of the child's con- 
sciousness” (Baldwin, 1902, p. 14). The child's 
cognitive organization must now be able to 
select the more enduring stimulus qualities 
from the more evanescent. 

Tension binding has not yet begun. Avail- 
able evidence indicates that perception of 
stimulus qualities and motoric discharge are 
continuous processes. No vicarious trial and 
error is observed. Because of the immature 
neuromuscular development, the infant can no 
more than cry or show diffuse restlessness if 
tensions are not quickly reduced by other per- 
sons. Now that the infant has some control 
over its muscular activities, he can act out his 
responses immediately. Whether the stimulus 
conditions are external, as in the case of a 
brightly colored object, or internal, as in the 
case of hunger, the infant responds immediately 
by reaching, grasping, shouting, crawling to- 
ward, etc. The young infant is still incapable 
of binding time or binding tension. The ability 
to wait, to tolerate delay between stimulus and 
response, does not develop until the beginnings 
of the next level of selfhood. Extrapolating 
from available data on delayed responses in 
children, it is probable that tolerance of delay 
at this stage is not more than a few seconds. 
£n the child of ten or eleven months, delay, 
followed by responses indicating memory, is 
not more than one minute. At this level the 
infant’s behavior is less syncretic than before; 
he can make discrete responses to specific 
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stimuli selected from the undifferentiated mass 
of stimulation received through the several 
sense modalities. 

At this point we infer that the child has three 
foci of cognitive organization: the somatic self, 
the receptor-effector self, and the primitive- 
construed self. These are in the nature of refer- 
ence schemata. They serve as background 
against which incoming stimuli are perceived 
and responded to. ‘They are trace residuals, and 
are derived from interaction with agents, ob- 
jects, and persons, the interactions originating 
in the preservation of constancy. 

In the fourth quarter of the first year the 
beginnings of a new cognitive structure may 
be inferred from the child’s behavior. ‘The 
focus is the gesture (Mead, 1934). With in- 
creasing experiences and further maturation 
the child can associate gestures with things and 
with persons. Now he can discriminate clearly 
between persons and objects. The substruc- 
tures so far described permit the development 
of a new emerging self which we have called 
the introjecting-extrojecting self (S4). The 
development of motor skills, increased motility, 
postural changes, prehension, etc., increase the 
number of nonself objects and persons that can 
be responded to and organized into the cur- 
rently forming empirical self. Words are dif- 
ferentiated and imitated, and the child can 
actively participate in play. 

Many observers of young children have 
described the development of linguistic be- 
havior in relation to the self. Space limitations 
prevent discussion of the contributions of Pia- 
get, Cooley, Lewis, and others. With language 
symbols the acts of others can be grouped into 
a concept “similar to my own" (extrojection 
or projection). Similarly, one’s own acts can 
be grouped into a concept “similar to others" 
(introjection or identification). This reminds 
us of Baldwin's “dialectic of personal growth” 
(1902). He points out how the reactions of an 
infant to the shifting stimulus properties of 
others become assimilated or "subjective" if 
congruent with the mental structures so far 
developed. The corollary process, "ejective 
consciousness," occurs at about the same time. 
This is the same as extrojection and appears 
to result from a primitive analogical process 
which depends upon the use of language. 

The child's cognitive development is such 
that now he can differentiate not only agents, 
objects, and persons, but he can differentiaté 
the acts of persons, Functionally adaptive re- 
sponses to the differential acts of others become 
the order of the day. At first, the responses of 
others are perceived as approval-isapproval. 


Here we have the prototype of learning by rein- 
forcement. We also observe the beginnings of 
tension binding and time binding. The child 
operating under the aegis of approval-disap- 
proval learns to withhold or inhibit certain 
motoric responses upon the perception of in- 
ternal or external stimuli. The discrimination 
between approved acts and disapproved acts is 
the foundation of the sense of shame. His abil- 
ity to form concepts is limited so that the acts 
of others cannot be organized into roles and, 
of course, he cannot yet take the role of the 
other. 

The fact that the child can cognize acts of 
others and of self into groupings reflects the 
beginning of the socius. Fixation at this level 
leads to behavior which is primarily "other" 
oriented. That is to say, actions must first be 
considered in terms of possible approval or 
disapproval, reward or punishment. Extensive 
development of this aspect of behavior, of this 
empirical self, may inhibit the later develop- 
ment of the role-taking ability. Excessive time 
binding and the perception of discrete acts 
rather than organized. wholes prevent the more 
adaptive, relatively rapid taking-the-role-of-the- 
other. 

With the increase in language development, 
particularly the use of the pronoun "L" the 
child attains a more refined concept than be- 
fore, at the same time acquiring a conventional 
symbol for self-reference. Before this, in order 
to communicate about himself, the child was 
restricted to motoric signs or names which were 
used as others use them. With the development 
of the "I" concept, which includes my, mine, 
me, and myself, the child can quickly and easily 
differentiate self from others. His language de- 
velopment also allows for the grouping of the 
actions of other persons into social roles. Cer- 
tain actions are associated with "mother"; 
other actions are organized into the concept 
"brother." The importance of linguistic phe- 
nomena in concept formation needs no further 
emphasis here save to say that, cognitively 
speaking, roles are concepts. Here the social 
self (Sg) emerges, and its differentiations are 
in terms of roles rather than discrete acts as in 
the case of the just preceding empirical self- 
structure. 

Taking-the-role-of-the-other now becomes a 
part of the perceptual and motor behavior of 
the child. The incorporation of "other" roles 
can readily be seen in his play activities. He 
can participate in playing "let's pretend,” the 
first effective test of the as if skill. 

Linguistic devices are the tools which aid in 
the further development of the social self. The 
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learning of language symbols, particularly those 
that have social referents, aids in the perceiv- 
ing, identifying, and conceptualizing of roles 
and role expectations. 

The child can now assign to the self (the 
referent for the "I") the same terms used by 
others. The assignment of qualitative terms, 
such as good, bad, dirty, unkind, stupid, etc., 
are the "reflected appraisals" of others. A theo- 
retical account of this development is given by 
Kimball Young (1945, 1952) , based on Mead's 
writings. [The work of Piaget (1926) should 
also be mentioned for its important observa- 
tional descriptions of the attainment of con- 
cepts.) In summarizing Mead's viewpoint, 
Cottrell and Gallagher (1941) state: 


If we hold to Mead's view that meaning in- 
heres in the ability to call out implicitly in one's 
self the role which one is calling out explicitly 
in the response of another, we can shift our at- 
tention from the philosophical aspects of the 
process upon which he concentrated, and focus 
instead upon the experiences that limit and frus- 
trate this role-taking ability. (p. 27) 


It is to these "experiences that limit and frus- 
trate this role-taking ability" that research 
should be directed. Below are two studies 
which illustrate the effects of different kinds of 
experience on the social self — one conducted 
in another context, the other within the con- 
text of role theory. 

It is well established that the child with ade- 
quate symbolic development is capable of form- 
ing concepts. If the concept of self is the 
reflection of the evaluative actions of others, 
then we would expect to find differences in 
self-concepts among persons who experience 
different kinds of socialization behaviors. Sears 
(1950), for example, has shown the influence 
of different kinds of maternal child-rearing 
actions on the ego structures of first and second 
children. He reports the results of several in- 
vestigations which converge to show a positive 
correlation. between degree of frustration in 
nursing and weaning and dependency behavior 
during the preschool period. Dependency be- 
havior as studied and defined is a quality of 
the more central aspects of the person inferred 
by an observer, hence it may be labeled an as- 
pect of the ego structure. 

Returning to the constancy principle, we 
have already mentioned how a concept, whether 
of a physical object, a social object, or of the 
self, tends to remain constant. The structure 
of the self develops out of experiences with 
events, objects, and persons. If the further 
development of the social self is a result of the 
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“looking-glass” principle, then we would expect 
different kinds of self-concepts in persons whose 
early socializing interactions were limited to 
one or two parent figures from those whose 
early socializing interactions were more exten- 
sive. The more parent figures in the socializing 
process, the more likely that the common quali- 
tative aspects of the culture will be learned. 

This leads to the testable hypothesis that the 
person whose early socialization is mediated 
by reciprocal role taking with many significant 
persons will have a different self-concept from 
the person whose early socialization was the 
responsibility of one or two parent figures. Fur- 
ther, those with multiple parent arrangements 
will be more socially adjusted than the others 
and will share many common traits. As a class 
exercise, 90 upper division students (30 males, 
60 females) were asked to describe the self on 
a 200-word checklist. They were also required 
to furnish certain personal data, among these 
the number of persons responsible for their 
care and rearing during the first five years. It 
is important to note that the criterion for the 
number of parent figures was left to the dis- 
cretion of the subjects and hence was as phe- 
nomenological as the criterion for determining 
which of the 200 adjectives were self-descriptive. 

The distribution of subjects in regard to the 
number of parent figures was as follows: one 
or two parent figures, 54 (12 males, 42 fe- 
males) ; 3 or more parent figures, 36 (18 males, 
18 females). For each of the four groups thus 
derived, the number of checks for each adjec- 
tive was counted. This was converted into a 
proportion, Where the difference in proportion 
between the two groups of the same sex was at 
least .20, the adjective was retained as charac- 
teristic of the self-conceptions of members of 
that group. 

Although the groups with three or more 
parent figures did not check a significantly 
larger number of adjectives, they did check 
more of the same adjectives, indicating that 
those who have more significant “others” in 
their childhood tend to have more similar self- 
concepts. 

When we examine the quality of the adjec- 
tives characteristic of the groups, we find a 
further demonstration of the influence of the 
number of parent figures on the self-concept. 
Both men and women of the one- or two-parent 
figure classification describe themselves as re- 
Stricted and nonsocial. The members of the 
other classification describe the self in a number 
of adjective clusters: sociable, logical, imagi- 
native, affectionate, and argumentative. These 
tentative conclusions are warranted: persons 
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reared by multiple-parent figures have self- 
conceptions that are: (a) more "social" and 
extended, and (b) more homogeneous than 
persons reared by one- or two-parent figures 
(Sarbin and Van Spanckeren, 1953) . 

Space considerations limit exposition of the 
further development of the social self. The 
reader should not interpret the choice of illus- 
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Although the foregoing is by no means the 
final word on the theory of self-development, it 
reflects the tenor of the times. Within the past 
few years such writers as Rogers (1947), Hil- 
gard (1949), and Newcomb (1950a, b) have 
helped re-establish the concept of the self. With 
this re-establishment has appeared the necessity 
for discovering methods for assessing the self. 

Recapitulating, the self is what the person 
"is," the role is what the person "does." When 
interested in the self, we regard the person as 
an organization of qualities. When we study 
roles, we regard the person as an organization 
of acts. Parenthetically, direct observation re- 
veals only action systems resulting from the 
interplay of self and role. 

The quality is not something that is directly 
observed, as in the case of the act. The quality 
must be inferred from acts, In English syntax, 
the usual form for communicating qualities is 
the adjective, Thus the adjective is a symbol 
which can be used in describing a quality of 
the self. At first glance, this seems too simple 
to describe the complexities of the self-struc- 
ture. Oversimplified as everyday language 
appears to be for describing a phenomenal 
datum, it is, as Allport says (apropos of trait 
names) : 

. several grades more adequate than the 
mathematical symbols and neologisms that psy- 
chologists sometimes employ. The nature of our 
problem forces us to seek out, to identify dynamic 
mental structures and substructures, and to name 
them. And this is necessary even though the lexi- 
con of any language is far from offering a perfect 
catalogue of the elements of mental life... 
(Allport, 1937, p. 310) 


If we are interested in phenomenological re- 
ports, of course, we can record a person's auto- 
biographical statements about the self. The 
uneconomical nature of this method for pur- 
poses other than therapy requires no comment 
here. This method may be abbreviated by the 
use of I-statements, such as in personality in- 
ventories or in attitude questionnaires. Often, 
however, the units of such statements confuse 
acts with qualities. A direct and unconfusing 


trative material to mean that postinfantile 
events have no influence on the development 
of the self. However, the self, as organized in 
the earlier years of life along the dimensions 
described before, serves a selective and directive 
function, thereby placing limits on the number 
and quality of role perceptions and role enact- 
ments. 


FOR ASSESSING THE SELF 


approach to qualitative description of the self 
is the use of an adjective checklist (Gough, 
1950). 

To illustrate the method and its advantages, 
Sarbin and Rosenberg (1952) performed three 
experiments. In the first, the self-conceptions 
of diagnosed neurotics and normal controls 
were compared. In the second, the self-concep- 
tions of men and women students were com- 
pared. In the third, the self-conceptions of 
three religious groups were compared. Only 
the findings of the first experiment will be re- 
viewed here. At least 70 percent of the neurotic 
group checked the following qualities: anxious, 
changeable, confused, cooperative, dissatisfied, 
emotional, gentle, imaginative, immature, nerv- 
ous, self-centered, worrying. At least 70 percent 
of the controls checked these qualities: active, 
adaptable, affectionate, civilized, conscientious, 
considerate, curious, dependable, fair-minded, 
friendly, intelligent, sincere, sociable, tolerant. 

If the postulate is accepted that the self is 
cognized as an organization of qualities and 
adjectives are linguistic devices that enable 
persons to communicate about qualities, then 
the adjective-checking method as illustrated 
here has an important place in social psycho- 
logical research. | 

Another promising method for getting at the 
phenomenal experience of self is the W-A-Y 
technique reported by Bugental and Zelen 
(1950). The respondents are simply asked 
"Who are you?" three times in succession. The 
three responses are then classified according 
to 17 categories such as age, sex, social status, 
name, occupation, etc. Differences in self-con- 
ception are noted for objectively different 
groups; eg. older subjects conceptualize the 
self differently from younger subjects in the 
use of age and social status categories. 

The authors advance some tentative conclu- 
sions as a result of their preliminary investiga- 
tions. Among these conclusions are: the 
Írequent use of the "name" category suggests 
the name as a central feature of the self; the 
personal pronoun category was used more fre- 
quently by women and by older persons; the 
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sex category was more frequently used by 
younger than by older subjects — the authors 
suggest the possible relationship between sex 
identification and psychosexual development; 
the age category differentiated among the 
groups — it was used most frequently by per- 
sons who were “at the apparent peak of physical 
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and social abilities" and less frequently by 
younger and older persons. 

In the review of experiments emerging from 
the theory, still other methods will be de- 
scribed. We should make clear that this is a 
representative but by no means complete cata- 
log of empirical methods for assessing the self. 


SOME EXPERIMENTS EMERGING FROM ROLE THEORY 


The worth of any theory lies in its potential 
for suggesting hypotheses. The pragmatic value 
of role theory is indicated by the kind and 
number of hypotheses generated by it which are 
subject to empirical investigation. Role theory, 
with its emphasis on the self, has stimulated a 
number of researches, some of which are re- 
viewed below. 


TENSION BINDING 


One example of a study stimulated by this 
theory of the self is concerned with the dimen- 
sion of tension binding. Simply stated, tension 
binding is the ability to inhibit a motoric act 
once the stimulus to the act has been perceived. 
The dependence of tension binding and time 
binding on the development of as if behavior 
has already been noted. The ability to bind 
tension makes it possible for a person to inhibit 
inappropriate role behaviors. The dimension 
may be thought of in quantitative terms — cer- 
tain people can inhibit motoric acts for long 
periods of time, others can tolerate the tension 
only for brief periods. It has become cus- 
tomary to refer to those who are deficient in 
this ability to bind tension as undercontrollers, 
that is, as lacking in self-control; and to those 
who bind tension more than objective condi- 
tions warrant as overcontrollers. Clinically, 
persons diagnosed as psychopaths exhibit the 
inability to bind tension; they act on impulse; 
they are unconcerned about the approval or 
disapproval of others. Persons diagnosed as 
neurotic characteristically overbind tension; 
they act compulsively rather than impulsively; 
they are overconcerned about the approval or 
disapproval of others. The overt conduct of 
these classes of persons indicates some directing 
or selective organization at work. 

We consider tliis organization to be the self. 
The undercontrollers, those who cannot bind 
tension, are presumably characterized by a self- 
structure where the primitive construed self 
(S3) is the dominant substructure. The over- 
controllers, those who overbind tension, are 
presumably characterized by a self-structure 
where the introjecting-extrojecting self (Sa) 
is the dominant structure. The characteristics 


of these self-structures as defined allow for the 
prediction of role behavior under various con- 
ditions. 

The tendency to act immediately upon the 
presentation of an annoying stimulus may be 
fulfilled or inhibited. In organized social living 
those tendencies to act are constantly being 
brought to the surface by multifarious annoy- 
ing stimuli. Persons with a low threshold for 
binding tension would respond more frequently 
or more intensively to annoying stimuli, while 
persons who have a high threshold for tension 
binding would respond less frequently or less 
intensively to the same stimuli, Furthermore, 
the state of being annoyed is more easily at- 
tained by persons who cannot delay their 
responses, during which delay vicarious trial- 
and-error (VTE) (Tolman, 1939) behavior is 
possible, the content of the VTE’s being ques- 
tions of approval or disapproval by others. 
Conversely, the state of being annoyed is less 
readily attained by persons who can delay re- 
sponses, during which delay they can vicariously 
assess whether the acts in question will gain 
the approval or disapproval of others. Such 
delay is frequently nonadaptive, in that the 
moment for adaptive action may have passed 
while the extensive VTE behavior is occurring. 
(This is the essence of clinically defined neuro- 
sis, particularly the compulsive type. Between 
these two classes of persons are those who may 
be characterized as "socially adequate tension 
binders” ($5). These are persons who can bind 
tension according to objective demands, who 
require less time for VTE behavior because of 
their use of an organized role-taking approach. 
That is to say, the quick and efficient grouping 
of stimuli can be performed with the aid of a 
concept — in this case a role — rather than the 
more time-consuming approach of applying an 
approval-disapproval criterion to every incipi- 
ent response. 

Gough's roletaking theory of psychopathy 
(1948) (alter Mead, 1934) embraces some of 
the same ideas. He asserts, after a review of 
the literature, that the psychopath is fixated 
in such a manner that he cannot perceive the 
self as an object; he cannot take another per- 
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son’s perspective. This aspect of role taking, 
emphasized by Gough, is substantially the 
same as our tension-binding dimension. When 
frustrated, the psychopath cannot take the time 
to put himself in the role of the other. He 
cannot maintain the as if set. Once an act is 
begun, it is carried through to completion. 

In order to test this hypothesis that psycho- 
paths, or poor tension binders ($5), would be 
the most responsive to annoying stimuli, we 
administered a 172-item annoyance question- 
naire, developed by Harsh (1936), to 60 male 
subjects: 20 diagnosed as psychopaths ($4), 27 
diagnosed as neurotic and representing 5,, and 
13 without psychiatric diagnosis (normals) rep- 
resenting Sg. The annoyance inventory was 
divided into thirteen categories: e.g., sensory 
annoyances, inconsiderateness of others, matters 
of dress, etc. The means of the three groups 
were compared on each of the category scores 
and on total scores. On total mean scores, dif- 
ferences between means of psychopaths and 
normals were significant (P — .01); between 
normals and neurotics P — .001. In twelve of 
the thirteen categories, the magnitude of the 
scores was in the predicted direction, the psy- 
chopaths had the highest mean annoyance 
score, the neurotics the lowest, and the normals 
intermediate (Albrecht and Sarbin, 1952) . 


SKILL IN TAKING-THE-ROLE-OF-THE-OTHER 


Taking-the-role-of-the-other is one of the 
qualities absent in persons fixated at the level 
of the primitive-construed self. If the dominant 
cognitive structure in the psychopath is this 
empirical self (S), then we would predict 
poor scores on tasks involving the participation 
of the social self. The most obvious kind of 
performance to test this hypothesis is an em- 
pathy test such as described earlier in this chap- 
ter. 

Baker (1954) performed an experiment on 
a group of male prisoners in an effort to test 
the hypothesis that psychopaths would be less 
able to empathize with their cellmates than 
nonpsychopaths. All the subjects had shared 
cells with their cellmates at least four weeks. 
Assignment to the category of psychopath was 
affected by the use of three MMPI criteria. The 
procedures were carried out in one sitting. 
Each subject and his cellmate filled out four 
adjective checklists: (A) for self, (B) for cell- 
mate, (C) prediction of self-checks of cellmate, 
and (D) prediction of (B), subjects guesses 
of how he appears to his cellmate. The predic- 
tions made by each subject on checklists (C) 
and (D) were compared with the cellmate's 
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actual choices on checklists (A) and (B). 
"These comparisons provide two objective meas- 
ures of empathy: the first, percent correct pre- 
dictions on list (C) , the second, percent correct 
predictions on list (D). (C)-percent in part 
gets at the ability to perceive traits or qualities 
that a person assigns to the self; (D) -percent in 
part gets at the ability to perceive traits or 
qualities others use in assessing one's self. 

Although the samples are small, 21 psycho- 
paths and 13 nonpsychopaths, the differences on 
(C) -percent and on (D) -percent are significant. 
The reduction in statistical reliability intro- 
duced by a correction for subject-object simi- 
larity did not appreciably alter the findings. 
Small positive correlations between empathy 
tests and age and education were also found. 
As a whole, the experiment devised as a result 
of hypotheses arising from role-taking theory 
verified the predictions. 

Cline (1953) has reported some findings 
which tend to confirm hypotheses arising from 
this theory. By means of a hidden camera and 
microphone, he filmed a number of "job appli- 
cants” in a realistic life situation. These "job 
applicants" will be referred to as social objects 
in the following discussion. Seven of the films 
(11 minutes each) were used in his study of 
the assessment of judges of personality. "These 
films were shown to the subjects of the study, 
who were required to check an adjective check- 
list as they thought the social objects had 
checked it. This is one test of taking-the-role- 
of-the-other. A second test involved making 
postdictions about the social objects. For this 
test, Cline made up a series of multiple-choice 
items on the basis of extensive interviews with 
parents, friends, and associates of the social ob- 
jects. Only items were used upon which there 
was consensual agreement. Here is an example: 
When the subject (seen in the film) is in a 
violent argument he usually (a) becomes very 
sarcastic, (b) uses profanity and obscene words, 
(c) leaves the room (or area), (d) strikes his 
opponent with his fist. 

Using a judging index made up of these two 
tests, which appear to test the ability to take- 
the-role-of-the-other, Cline compared the sub- 
jects in the top and bottom quartiles on a 
number of variables, Among these were scales 
of the MMPI, the California Public Opinion 
Survey, and the Henmon-Nelson intelligence 
scale, 

From our theoretical emphasis on the im- 
portance of early fixation, the prediction would 
follow that a person in whose cognitive organi- 
zation the somatic self (S,) is a dominant 
structure would be less skillful in role taking 
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(S5). Hypochondriasis is a syndrome in which 
the somatic self dominates all the other selves. 
In Cline's analysis, the hypochondriasis scale 
of the MMPI, at a level of confidence better 
than .01, differentiates those scoring high on 
the role-taking tests from those scoring low. 
The hypochondriacs get lower scores on the 
role-taking tests. 

If the assumption that certain kinds of psy- 
chopathology follow from fixation of the self 
at different maturational levels (Sarbin, 1952) 
is granted, then adult schizophrenia is seen to 
be related to the dominance of the receptor- 
effector self (S) and psychopathy to the 
primitive-construed self ($4). Persons scoring 
high on scales which tap schizophrenic or psy- 
chopathic behavior, then, should do poorly on 
role-taking tests which sample behaviors medi- 
ated by the social self (S;). Cline's results 
confirm the theoretical predictions; the schizo- 
phrenic scale and the psychopathic-deviate 
scale of the MMPI, at levels of confidence be- 
tween .01 and .05, differentiate the top and 
bottom quartiles on the role-taking tests in the 
predicted direction. 

Some other variables which differentiate the 
top and bottom quartiles consistently with the 
theory are: intellectual efficiency (MMPI), 
social status (MMPI), prejudice (MMPI), and 
intelligence. Ethnocentrism and Authoritarian- 
ism (E and F scales of the California Public 
Opinion Study) likewise show differences in 
the predicted direction. 

An experiment by Sarbin and Stephenson 
(1952) was also aimed at identifying the charac- 
teristics of good role takers. The hypothesis 
stated that persons characterized as rigid would 
be less able to adopt a role which was incon- 
gruent with the ego structure. Persons who 
were classified as flexible would be able to 
adopt a role not congruent with the ego struc- 
ture. 

On the basis of Rokeach's experiment (1948) 
rigidity and flexibility were defined as high and 
low scores on the ethnocentrism scale. The 
procedure was as follows: Two undergraduate 
classes in psychology provided the subjects (24 
men, 28 women). The University of California 
Public Opinion Survey was administered (this 
includes a political-economic conservatism 
scale, an ethnocentrism scale, and an authori- 
tarianism scale). One week later each member 
of the class filled out the authoritarianism scale 
“as you think a person named Mr. (Miss) Smith 
would take it," Smith is described as an in- 
tensely prejudiced person. Two weeks later the 
same procedure was followed except that the 
role instructions were now of Jones, an equali- 


tarian person. Our predictions were as follows: 
persons scoring high on ethnocentrism would 
more readily enact the congruent authoritarian 
role than the incongruent equalitarian role; 
persons scoring low on ethnocentrism would be 
able to enact both the incongruent authori- 
tarian and the congruent equalitarian roles. 
The analysis confirmed our predictions. Corre- 
lations between scores on ethnocentrism (with- 
out role instructions) and authoritarianism 
with the role instruction to play the part of 
Smith, the authoritarian person, were —.29 for 
males and —.41 for females. The flexible sub- 
jects (low ethnocentrism scorers) were better 
able to change their responses. The correlations 
between the same two scales when the role in- 
struction was to play the part of Jones (the 
equalitarian person) were .65 for males and .78 
for females. The rigid persons (high scorers 
on ethnocentrism) did not change their re- 
sponses under the role instructions in the 
direction of equalitarianism. Taking-the-role- 
of-the-other, an aptitude or skill, and equali- 
tarianism, a cluster of attitudes, appear to be 
related. 

This experiment aimed at ego-siructure 
variables yielded as a by-product a procedure 
for getting at the phenomenal self. All the sub- 
jects regarded the “equalitarian attitude” as a 
quality of the self. That is to say, despite their 
checking items characteristic of a fairly wide 
spread on the authoritarianism scale, few sub- 
jects checked different items when asked to take 
the role of the nonethnocentric person. For 
example, if a subject's score on the first test was 
high (ethnocentric) when taking the equali- 
tarian role, the score remained high. The 
inference from this incidental finding is that 
a discrepancy exists between conception of self 
as a phenomenal object and the ego (inferences 
made about the self by others). From this find- 
ing and from the usual definition of insight as 
consistency of self-evaluation with an outside 
criterion (usually the pooled judgments of 
others), the tentative statement follows that 
ethnocentric persons are deficient in insight 
(cf. Adorno, et al., 1950). If we had used a 
suggestion of Hastorf and Bender (1952) we 
would have found that the ethnocentrics were 
"projectors," and the equalitarians were "em- 
pathizers.” 

The general picture of the empathic person 
and the nonempathic person developed from 
‘these studies is in line with the tentative de- 
scriptions reported by Cottrell and Dymond 
(1949) . TAT protocols were analyzed for high 
or low scorers on the empathy test. Those high 
on empathy were judged as emotionally expres- 
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sive, insightful, outgoing, optimistic, warm, 
interested in others, flexible, with satisfactory 
family relations in childhood. Those who 
scored low on the empathy test were judged as 
rigid, introverted, noninsightful, inhibited 
emotionally, subject to emotional outbursts; 
these subjects experienced difficulty in inter- 
personal relations, mistrusted others, were less 
well integrated and had had unsatisfactory fam- 
ily relations in childhood. 

Another experiment which points to the 
pragmatic value of role-taking theory was con- 
ducted by Gough and Peterson (1952). The 
objective of their study was to develop a per- 
sonality test which would yield predictions of 
antisocial behavior. This objective was a sig- 
nificant one in view of the generally negative 
results of previous attempts to differentiate 
criminals from noncriminals by psychological 
test procedures (Schuessler and Cressey, 1950) . 
Among other things, the role-taking theory of 
psychopathy (Gough, 1948) suggested items 
whose content had no obvious relationship to 
criminal acts but which was associated with a 
self-organization characterized by insensitivity 
to the behavior of others, lack of concern over 
interpersonal relationships, shallow affect, and 
inability to take-the-role-of-the-other (syndrome 
of psychopathy), Like the MMPI, Gough's 
scale is made up predominantly of I-statements 
which enable the investigator to make infer- 
ences about the ego structure. 

The scale (64 items) has been administered 
to reformatory inmates, Army stockade prison- 
ers, behavior-problem adolescents and appropri- 
ate controls. Cross-validational procedures 
have demonstrated the effectiveness of the scale 
in separating known delinquents and criminals 
from normals. 


THE SELF AND THE PERFORMANCE OF 
COGNITIVE TASKS 


Using the same typology as in the study of 
annoyances cited before, Mathews, Hardyck, 
and Sarbin (1953) investigated the influence 
of self-organization in the performance of se- 
lected cognitive tasks, Predictions were made 
as follows: Persons about whom inferences 
could be made that the primitive-construed self 
was relatively dominant would take a set to 
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“act quickly” when presented with tasks which 
have both speed and accuracy instructions 
(Sg). Persons whose self-structures were classi- 
fied as predominantly introjecting-extrojecting 
would take the set "act cautiously” (S4). Per- 
sons whose self-structures were classified as so- 
cial would be more flexible and would 
approach each cognitive problem with a set 
pertinent to the task (S5) . 

The subjects were 31 volunteer students 
classified on the basis of MMPI profiles. If the 
psychopathic deviate and hypomanic scales were 
elevated more than one sigma from the mean, 
the subject was placed in the $5 category; if the 
depression and psychasthenia scales were ele- 
vated more than one sigma from the mean, he 
was placed in the 5, category; if all scores save 
one were within a sigma of the mean, the sub- 
ject was placed in the Sẹ (normal) category. 

Among the tests used was the Sears-Hovland 
motor-conflict apparatus, The reactions of the 
subjects on the conflict trials were recorded as 
(1) responding to both stimuli, a "double" re- 
sponse, (2) responding to one of the two 
conflicting stimuli, "single response," (3) "com- 
promise" between the two stimuli, and (4) 
"blocking" — no action. We predicted that 
the Sy group would be dominated by the action 
set “do something quickly"; the members of the 
S, group “wait, think it over, be cautious" and 
the members of the $5 group, variable. A chi- 
square test significant between .02 and .01 
indicated that our groups behaved differently 
on this test, The S's gave predominantly 
"double" responses (excessively motoric); the 
S,'s gave "block" responses (VTE's) most 
frequently, but they also gave a large number 
of "single" responses. The S,'s gave an ap- 
proximately equal number of "single," 
“double,” and “block” responses. In general, 
the predictions were verified: S's were exces- 
sively motoric, Sa excessively "bound," and 
Ss's variable. 

Thus, predictions of certain cognitive per- 
formances were made from the theory and 
verified by experiment. The implication of 
these findings for role theory is that the quality 
of role enactment, even in the role of a labora- 
tory subject, is a function of the self. 


INTERACTION OF SeLF AND ROLE 


In the first part of this chapter we dealt with 
concepts in which role was central. Insufficient 
to account for complex social psychological be- 
havior by itself, the conception of role was sup- 
plemented with the conception of self. These 


are coordinates, Implied throughout the de- 
scription of each of these ideas is the inter 
action of self and role. Despite the tyranny of 
literary exposition which forced us to consider 
but one notion at a time, we have incidentally 
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demonstrated certain of the interrelations of 
self and role. In this section we deal specifically 
with some of these interactions and their ef- 
fects. Of the many questions which could be 
asked, only the following will be treated. 

1. Since the self is a product of experience, 
what changes in the self follow from the pro- 
longed enactment of a specific role or role 
cluster? 

2. What variations in role enactment are 
attributable to variations in the self? 

3. What happens to the self and/or the role 
when the person is placed in a position in which 
the role expectations of others call for per- 
formances which are incongruent with the self 
(self-role conflicts)? A related question is: 
What happens when the person occupies two 
positions concurrently and the role expectations 
of each are incongruent one with the other 
(role-role conflicts) ? 

4. Can the independent effects of the self- 
and-role interaction be isolated? Or, stated in 
another way: Can self and role as independent 
concepts be demonstrated empirically? 


INFLUENCE OF PROLONGED ROLE 
ENACTMENT ON THE SELF. 


The emphasis in the preceding sections has 
been, perhaps unwittingly, on early develop- 
ment of self and role. Although the foundation 
for the self is laid down early in life, this does 
not mean that residues of experience resulting 
from adult role enactment have no effect on 
the self-organization. Notwithstanding, we 
maintain our historical emphasis: the early 
form of the structure has selective and direc- 
tive properties, The effect of prolonged role 
enactment on the self would be different for, 
say, the person fixated at the level of the primi- 
tive construed self ($4) , and the person whose 
central empirical self-structure was the social 
self (Ss). If we take the psychopath as repre- 
sentative of the S,-fixated person, we would 
predict little effect on the self-structure of his 
adult role enactments because of limited as if 
skills, : 

Merton (1940) has written a logical analysis 
of a problem which bears directly on the sub- 
ject under discussion. In analyzing the social 
structure of bureaucracy, he points up the ap- 
parent depersonalization which occurs as a 
result of prolonged occupancy of a bureaucratic 
position, This depersonalization probably re- 
sults from the necessity of regarding clients and 
personnel as objects. All must be ‘treated ac- 
cording to their classification in the codified 
rules and regulations; there is no flexibility 
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allowed for personal sentiments or extenuating 
circumstances. A quality of the self, rigidity, 
appears to develop out of the daily and exclu- 
sive practice in enacting roles which are clearly 
defined, regularized, and conforming. 

One of the hypotheses suggested by this dis- 
cussion may be stated for later testing: The 
effects on the self of prolonged occupation of 
a bureaucratic position are such that rigidity 
and depersonalization become qualities that 
can be inferred from behavior in other (non- 
occupational) roles. The same hypothesis, in- 
cidentally, may be stated about any other posi- 
tion in the social structure. Waller (1932) has 
advanced some hypotheses arising from his 
analysis of what “teaching does to the teacher." 
Such hypotheses can best be conceptualized as 
effects of role enactment on the self. 


VARIATIONS IN ROLE ENACTMENT AND THE SELF. 


An experiment on self-and-role interaction. 
The conduct of subjects in an age-regression 
experiment in hypnosis is plausibly explained 
by the use of this self-and-role formulation. In 
the usual experiment the hypnotized adult sub- 
ject is instructed to enact the role of a child. 
The performances of many such subjects are 
indeed convincing. The theoretical question 
becomes: Is hypnotic age regression an example 
of play acting (little organismic involvement) 
or genuine as if behavior? The answer, of 
course, depends on the kinds of tests used to 
determine the degree of organismic involve- 
ment in the as if behavior. A crucial test would 
be one in which the subject performed role 
behaviors about which he could have no prior 
knowledge. The Rorschach Inkblots fulfilled 
this requirement. 

The subject could not know from prior ex- 
perience the kind of responses characteristic of 
a three- or six-year-old child. Therefore, if the 
subject, when enacting the role of a three-year- 
old, reports perceptions which in fact are 
characteristic of three-year-olds, then we can 
say that the as if behavior is not of the play- 
acting variety. Sarbin and Farberow (1952) re- 
port an experiment in which adult subjects, 
able to engage in the as if behavior of hypnosis, 
were “regressed” to various childhood levels. 
The results of the experiment proper were 
equivocal; some of the subjects seemed to enact 
the role of the child with remarkable fidelity, 
‘as judged from test performance on the Draw- 
A-Person and from handwriting, speech, pos- 
ture, and other samples of expressive behavior. 
At the same time, the Rorschach performances 
were incompatible with the assigned role. Other 
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subjects did not exhibit the same proficiency 
in overt role enactment, yet their Rorschach 
performances were in the range of the assigned 
age role. The apparent lack of consistency 
could be explained most readily with the aid of 
the concepts of self-perception and role percep- 
tion as inferred from role enactment. 

The Rorschach test lends itself to the task 
of assessing self- and role perceptions. The 
more enduring ego (matrix of self-perceptions) 
is inferred by the experimenters from the struc- 
tural features of the protocols, such as kind and 
number of movement, color, shading, whole, 
detail, etc., responses. The variation in quan- 
tity and time of appearance of these com- 
ponents reflects the level of selfhood. These 
components are called “validating” or ego fac- 
tors. They are inferred by the experimenter 
from the reported object perceptions. 

Role perceptions, on the other hand, may be 
inferred from the role behaviors as seen in the 
content of the Rorschach, (e.g., “It looks like 
a bunny-rabbit") , from verbal and vocal charac- 
teristics, and from mannerisms and gestural 
responses. The Draw-A-Person test also serves 
as a test of role enactment. “These expressive 
features of behavior are the vehicles of commu- 
nication; it is by the use of such expressive 
behaviors that the person makes known to 
others his current role. We shall call these 
aspects of [role] behavior expressive or periph- 
eral factors” (pp. 119-120). 

The performances of two subjects were used 
as illustrations of the thesis of the interaction 
of self and role. Thus, one subject’s ego is rela- 
tively labile — the validating features of her 
Rorschach performances were actually like 
those of average three- and six-year-old chil- 
dren. We would infer, then, that her self- 
perceptions shifted with the requirements of 
each role. Her role enactments, however, were 
deficient as evidenced by the various expressive 
measures. Clinical history revealed a relatively 
immature person who had not yet solved the 
adolescent problem of independence from 
mother. “Her self-perceptions were those ap- 
propriate to à person occupying the position 
of a young child in our society. . . . She was 
deficient in enacting the role, . . . probably as 
a result of the fixation of the self at an imma- 
ture level, thus allowing for fewer opportunities 
for moving through the usual sequence of social 
positions with the accompanying role behavior. 
We can infer, then, that this subject's ego is 
relatively unstable (or labile) and her role 
perceptions are relatively invariant" (Sarbin 

and Farberow, pp. 123-124) . 

The second illustrative subject showed a dif- 
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ferent pattern of self- and role perceptions. 
The validating feature of the Rorschach shows 
a relatively stable self-concept, in spite of the 
different role behaviors performed. Unlike the 
other subject, she had solved the problem of 
autonomy. "She has been married and leads 
an active social life. Under the various role as- 
signments, the validating features changed but 
little. The expressive features, however, showed 
gross changes as she moved from one age role 
to another. The drawing test results, the con- 
tent analysis, the childlike postures, etc., re- 
vealed flexibility in role enactment" (p. 124). 

The data seem to make most sense when 
placed in a system of coordinates (Fig. 2). Role 
enactment is one axis, the termini of which are 
flexibility and rigidity. The self is the other 
axis, the termini of which are stability and 
lability. 

In the two illustrative cases, the first subject's 
conduct is seen as the result of the interaction 
of relatively rigid role enactments with a labile 
self-concept; the second subject's responses are 
the product of the interaction of flexible role 
enactments with a stable self-concept. The 
inference does not follow that these continua 
are reciprocal. Another subject (x) showed a 
labile self-concept and a marked flexibility of 
role enacument. (This subject was later diag- 
nosed as a psychopathic delinquent) The 
fourth quadrant (stable self, rigid role enact- 
ment) would contain such persons as compul- 
sive neurotics. 

Summarizing, the apparent inconsistencies in 
enacting experimentally devised roles are recon- 
ciled when the perceptual and the motoric 
behaviors are plotted on two coordinates: 
stability-lability of the self and rigidity-flexibil- 
ity of role enactment. The latter is obviously 
an aspect of the number-of-roles dimension; 
the former is related to, if not identical with, 
the differentiation dimension of cognition 
(Werner, 1948) as applied to the self. This 
study supports Murphy's apt statement: "Per. 
sonality is in considerable degree a matter of 
role behavior; even more, however, it is a mat- 
ter of role perception and of self-perception 
in the light of the role” (1947, 1. 559). 

The use of self-and-role in personality scal- 
ing. Even such a prosaic, although important, 
topic as the prediction of college achievement 
is subject to analysis by means of the concepts 
heretofore outlined. Included in the college 
student's role expectations are certain actions 
such as going to classes, listening to lectures, 
writing exams, organizing abstract material, 
using the library, etc., and certain qualities 
such as friendly, cooperative, good-natured, etc. 
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If the actions and qualities which comprise this 
role are congruent with the self-concept of a 
particular person, then there is a high proba- 
bility that he will perform according to the 
role expectations of the professor and other 
members of the college community. 

Gough (1952) has devised a personality 
scale which gets at certain aspects of the self- 
concept. From the score on this scale, one can 
make gross predictions about the role enact- 
ment of any particular student qua student. 

A logical analysis of the 36 items in the 
scale (most of which are L-statements) reveals 
traits of seriousness and self-confidence, con- 
sistent attitudes favoring intellectual events, 
and efficient scholastic habits. Since the self 
is a cognitive organization of traits, attitudes, 
and habits, it is appropriate to designate this 
scale as a part measure of the self. Since the 
expectations of the occupant of the position 
of student are known, we can predict the 
"goodness" or validity of the role enactment 
from this part knowledge of the self. 


CONFLICT 

The structure of conflict between self and 
incongruent role or between two incompatible 
roles is the ultimate source of literary pro- 
ductions, notably the tragedy form. In daily 


life such conflicts appear frequently but usually 
in less intense forms than those described by 
the novelist, We shall present here a brief social 
psychological analysis of the structure of con- 
flict, utilizing the concepts introduced before. 

Self-role conflict. 'To begin, we refer again 
to the principle of constancy, especially as it 
applies to cognition. A cognitive structure 
tends to maintain its organization despite forces 
directed toward changing it. The application 
of this idea to the self has already been made. 
The same principle applies to substructures — 
differentiations of the broader structure — in 
this case systems of role expectations. When 
the broader system — the qualities that make 
up the self — direct the organism to one set of 
role enactments, and the narrower system — 
the acts and qualities that make up the role 
expectations — are incongruent or disjunctive, 
then we have conflict. This is the quintessence 
of the moral dilemma. 

Coincident with conflict are perturbations 
within the organism, phenomenally experi- 
enced as anxiety. The stable organismic state 
is disrupted. To re-establish constancy the 
person behaves in ways which make for the 
least change in either structure. These con- 
stancy-maintaining behaviors have been con- 
ceptualized as ego-defense mechanisms (Anna 
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Freud, 1937) , techniques of adjustment (Cam: 
eron, 1947) , defenses against anxiety (Fenichel, 
1946), and psychodynamic mechanisms (Ro- 
senzweig, 1944). Repression, projection, 
negativism, isolation, wish-fulfilling fantasy, 
displacement, rationalization, compartmentali- 
zation, overcompensation, etc., are forms of 
behavior designed to maintain constancy of the 
self while at the same time allowing the ful- 
fillment of role expectations. Pertinent ex- 
amples may be cited from an exploratory study 
carried out by Burchard (1953). Interested in 
the institution of military chaplaincy, he in- 
terviewed a number of men engaged in per- 
forming the role of military chaplain. He 
discovered several such self-maintaining mech- 
anisms at work in the effort to reconcile the 
qualities and actions expected of the military 
officer and the qualities of the self — such 
qualities being brotherly love, abhorrence of 
violence, "turn the other cheek," and other 
traits defined by Christian codes. Among the 
mechanisms used to reconcile this conflict were: 


Rationalization: “Someone has to carry the 
gospel to these boys.” 
Gompartmentalization: "Render therefore 
unto Caesar the things which are Caesar's; 
and unto God the things that are God's." 
Repression: "I don't see any conflict." 
Negativism: "I'd rather not talk about it.” 


Another study which demonstrates the use 
of self-maintaining mechanisms in the enact- 
ment of an incongruent role was reported by 
Cousins (1951). A task involving as if partici- 
pation in a contrived situation around a moral 
dilemma was presented to the subjects. He 
used a questionnaire-type approach in which 
the subject was to take the role of a student 
monitor pledged to enforce an unpopular and 
overly rigorous curfew rule. The conflict oc- 
curs when the subject has to choose between 
two role enactments: to report or not to report 
two student violators to the authorities, After 
reaching a decision, the subjects were in- 
structed to select three reasons for their deci- 
sions from among a list of six reasons. The 
following is a generalized statement of the find- 
ings: Persons who did not fulfill the role ex- 
pectations as established by the hypothetical 
society used self-maintaining mechanisms more 
frequently in explaining or justifying their con- 
duct than did the subjects who enacted the 
prescribed role. That is, they were more likely 
to select ego-defensive items as reasons for their 
decision. The specific items under considera- 
tion were rationalization, displacement, and 
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wish-fulfilling fantasy. "Those who enacted the 
role according to, the prescribed role expecta- 
tions attributed the responsibility for the de- 
cision to the self; those who enacted the role 
contrary to the prescribed role expectations 
attributed the responsibility away from the 
self. 

Role-role conflict. In the two above-men- 
tioned illustrations the persons under con- 
sideration, chaplains and college students, 
presumably were characterized by self-structures 
which had enabled them to fulfill socially- 
approved roles. The speculation follows that 
the self-structures for most of these subjects 
were adequately maintained by such constancy 
mechanisms, These self-maintaining mechan- 
isms may be characterized as "preferred modes 
of cognizing.” 

Under certain conditions these preferred 
modes of cognizing may become functionally 
autonomous (Allport, 1937) and organized 
into the existing self-structure as trait systems. 
Neurotic behavior is often explained as the 
overdetermined utilization of such self-main- 
taining mechanisms. That is to say, all roles 
are enacted in such a manner that the "neu- 
rotic" qualities of the role enactment stand out. 

When the person has few or no self-maintain- 
ing mechanisms or if, when enacting roles in: 
congruent with the self, the use of such 
mechanisms fails to re-establish constancy, then 
we have the preconditions for psychosis. The 
self is ultimately maintained, but the extreme 
form that the constancy mechanism takes leads 
to autistic, invalid, and desocialized role enact- 
ments. 

"The second type of social psychological con- 
flict occurs when two substructures of the social 
self — two systems of role expectations — are 
incongruent. Warren (1949b) has written a 
series of propositions relating social disorgani- 
zation and roles. One covariant of disorganiza- 
tion is the proportion of role clusters that 
contains incompatible roles. Of course, in à 
perfectly organized society, such conflicts or 
incompatibilities would be impossible. Since 
societies are not perfectly organized, institu- 
tionalized procedures are developed for recon- 
ciling or resolving conflicting role enactments. 
The effect of such procedures is to maintain 
the constancy of the social structure. 

A role is a partstructure of the social self. 
Part-structures, like other structures, have in- 
ternal organization which is subject to the 
constancy principle. "Therefore, a role will 
tend to maintain itself in the face of events 
which threaten to disorganize it, Since the role 
enactment is public, the techniques by which 
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role constancy is maintained have social sanc- 
tion and are more directly taught and learned 
than is the case with the self-maintaining tech- 
niques described above. 

The procedures for handling such role-role 
conflicts are of two broad kinds (Toby, 1952). 
The first is by the use of a hierarchization prin- 
ciple; the second is by the use of a segregation 
principle. In the first, the person faced with 
the role-role conflict utilizes the widely shared 
fact that certain role obligations have priority 
over others. For example, reporting for Selec- 
tive Service has priority over reporting for 
work; attending grandmother's funeral has pri- 
ority over school attendance, etc. However, 
there are other conflicts in role expectations 
which are not so clearly hierarchized in the 
social system. To maintain the preferred of two 
roles, a person seeks an excuse. That is, he 
seeks others’ acceptance of his failure to dis- 
charge the obligations of a role by declaring 
that the competing role had a hierarchical pri- 
ority. The excuse (whether supported by con- 
firming evidence or not) is accepted as a way 
out of role-role conflicts and maintains the per- 
son's statuses. 

Rules of etiquette make use of the hierarchi- 
zation principle. Rituals are established for the 
orderly enactment of roles where the rights and 
obligations may be idiosyncratic. For example, 
the order of entering a doorway is prescribed 
by etiquette so that the woman goes first. Who 
is introduced to whom is likewise prescribed. 
Who has the right-of-way on the highway is 
codified, The occupant of an age-sex position, 
e.g, elderly lady, has priority over the occupant 
of another age-sex position, young man, in tak- 
ing a seat on a crowded street car. These are 
extensions of the use of the hierachization prin- 
ciple. A 

The principle of segregation for handling 
role conflicts follows from the widely shared 
knowledge that two incompatible roles, if seg- 
regated by a time interval, may be validly 
enacted (Toby, 1952). It is a mechanism for 
maintaining role constancy. A person may oc- 
cupy a position which embraces two conflicting 
sets of rights and obligations. The role of labor 
relations counselor is an example. He must 
serve two opposing sets of role expectations. 
A solution to the conflict is found by enacting 
the subroles separately, Such techniques as 
stalling, playing off one group against the 
other, repudiation of one subrole, leading a 
double life, etc., are cited by Toby as examples 
of the use of the segregation principle in resolv- 
ing role conflicts. 

The role-role conflict formulation is central 


to the study of conflicting social norms. Its 
applicability to the marginal man concept, for 
instance, is patent. An empirical method based 
on the theoretical system of Parsons (1949) has 
been contributed by Stouffer (1949) and ex- 
tended by Stouffer and Toby (1951) for the 
study of role-role conflicts. In the experiment 
reported by Stouffer and Toby, a forced-choice 
type of questionnaire was used to get at role 
expectations in a number of conflicting situa- 
tions. The subject had to choose between the 
role of friend and the role of member of the 
wider society. In the first instance, the orienta- 
tion is particularistic, in the second, univer- 
salistic. 

The results of this investigation point un- 
mistakably to a predisposition among subjects 
to choose one form of role enactment over the 
other. The predisposition to be particularistic 
rather than universalistic in the choice of role 
enactment can be considered a trait. As such, 
we are dealing with a quality of the self. This 
underscores the thesis that valid predictions of 
the conduct of others must take into account 
aspects of ego structure as well as knowledge of 
competing systems of role expectations. 


A METHOD FOR ASSESSING INDEPENDENT 
EFFECTS OF SELF AND ROLE 


To the observer or behavior analyst, a person 
is seen as a complex entity enacting a role 
(Parsons and Shils, 1951) . Phenomenologically, 
the person is a unity. If the concepts of self and 
role are to be justified, their experimental 
separation should be assayed. 

It is to this task, the experimental separation 
of self and role, that we addressed ourselves in 
a large-scale study (Sarbin and Kamiya, 1952) . 
We enlisted the services of 14 volunteers from 
an advanced class in psychology, 7 men and 7 
women. These persons will be referred to as 
social objects in the description of the experi- 
ment. Each of these social objects enacted three 
roles. None of the roles was contrived, but 
emerged from the interaction situation. In the 
first role, the social object interacted with a 
like-sex peer (hereinafter called the auxiliary 
ego, after Moreno). In the second role, the 
social object interacted with an opposite-sex 
peer. In the third role, the social object inter- 
acted with an older male authority figure. The 
auxiliary egos were standard; that is, the same 
fersons played the "other" roles vis-à-vis the 
social objects. The task was simply "to get ac- 
quainted." The interaction could be charac- 
terized as bland and nonthreatening. All the 
social objects were from the same psychology 
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course, so the task given them was not unrepre- 
sentative. The persons in the interaction situa- 
tions were observed and judged by college 
sophomores from a large lecture class in intro- 
ductory psychology, N = 526. 

Each group of subjects made judgments 
about one male and one female social object. 
The schedules were arranged so that no group 
of judges observed interaction situations in 
which the same auxiliary ego appeared. The 
instructions to the judges were simply to ob- 
serve the interaction and to record personality 
judgments characteristic of the social object 
on a Personality Word Card, a 200-item adjec- 
tive checklist. In this manner we obtained 
qualitative judgments of 14 persons in 3 roles. 
The design for males is given below. The design 
for the females was identical. In the interest 
of brevity, the descriptions of the experiment 
will be confined to the male objects. 


SociaL Osyects (Mates) 


FETT 
p| | 
a 


I. vis à vis male peer 


II. vis à vis female Peer 


Roles 


dE 
action 


Positions 


III. vis à vis older male |z 


Egos 


In each cell are listed the 15 adjectives most 
commonly checked by the judging group. In 
cell x, for example, are listed the adjectives 
most frequently checked by the judges observ- 
ing social object 4 enacting position I, male 
with male peer. In y are listed the 15 adjectives 
most frequently checked by an equated group 
of judges who observed the same social object A 
enacting position II, male with female peer. In 
cell z are listed 15 adjectives most frequently 
checked by an equated group of judges after 
observing social object 4 in position III, male 
with older male. Thus each cell contains the 
common adjective judgments. The horizontal 
dimension represents the effect of position, the 
vertical represents the effect of the self. 

The next step was to find adjectives common 
to each of the social objects; by counting adjec- 
tives which appeared in the enactment of the 
three roles (the vertical dimension), there 
emerged a qualitative description of person con- 
sistency. This is, in a limited way, a description 
of the ego. For example, social object B was 
seen most frequently by each of the three groups 

of judges as quiet, sincere, calm, modest, co- 
operative. Social object C was seen as good- 
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natured, cooperative, well-mannered, friendly, 
cheerful, informal, pleasant, masculine, sincere, 
natural. 

This distillation of the columns provided us 
with the construct, person consistency or ego. 
By counting the common adjectives on the 
horizontal axis, we constructed a qualitative 
description of the role. A number of analyses 
were made, a portion of which are herein de- 
scribed. All the social objects are seen with 
some consistency in the enactment of these 
limited roles. There is a dispersion around a 
mean. Some social objects have but 5 adjectives 
which are common to the three roles, others as 
many as 13. 

The role dimension provides some interesting 
findings. Even though verbalized requirements 
of the positions were identical, the qualitative 
role behaviors of the subjects were sufficiently 
different to show up in a frequency count of 
the rows. For at least 5 of the 7 male social 
objects, the following adjectives survived the 
subtractive analysis for each of three roles. 


Rore I Rote II Rote III 
INFORMAL WELL-MAN- AMBITIOUS 
HONEST NERED frank 
SINCERE INTELLIGENT cheerful 
natural sociable cooperative 
frank natural good-natured 
sociable friendly pleasant 
friendly cheerful 
cooperative pleasant 
good-natured good-natured 
cheerful cooperative 
pleasant 


Legend: lower case, adjectives common to three 
roles; italics, adjectives common to two roles; capi- 
tals, adjectives characteristic of one role. 


ee 


The common adjectives for the three roles are 
in lower case letters. As would be expected 
from the similarities in the requirements of 
each position, the role enactments are seen as a 
cluster of qualities which we call “good-na- 
tured.” This is one of the generalized role 
expectations in the middle-class student com- 
munity. 

The differences in the roles are interesting. 
When the male social objects were interacting 
with a likesex peer, the qualities informal, 
honest, and sincere were inferred by the judges 
from the role enactments of the 7 social ob- 
jects. When enacting the role visd-vis the 
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opposite-sex peer, the three qualities just listed 
do not appear regularly. Instead the qualities 
well-mannered and intelligent survive the 
counting process. In the third role, where the 
social object appears with an older male, only 
one adjective meets the criterion of communal- 
ity without appearing as a quality of the other 
roles, ambitious. In addition, sociable and 
natural are qualities common to the roles en- 
acted vis-à-vis peers, and frank to the roles en- 
acted vis-à-vis males. 

In this exercise we have shown how role and 
ego (here defined as observed person consis- 


tency) can be empirically determined. Even 
though the verbalized requirements of the posi- 
tion were identical, certain qualities emerged 
from the differential role enactment, probably 
due to age and sex of the other. 

Two further ideas emerged from this study 
besides those already mentioned. One is the 
utility of the adjective-checking method for 
assessing the qualitative aspects of role enact- 
ment, The second is the utility of the design 
which attempts to deal qualitatively with five 
variables simultaneously, persons (social ob- 
jects), positions, roles, egos, and interactions. 


FiNAL STATEMENT 


Persons occupy positions or statuses in inter- 
actional situations, Psychologically-considered, 
positions are cognitive systems of role expecta- 
tions, products of learning. Role expectations 
are bidimensional, for every role expectation 
of other there is a reciprocal role expectation of 
sell. The organized actions of the person, di- 
rected toward fulfilling these role expectations, 
comprise the role. Variation in role enactment 
is a function of at least three variables: 

l. the validity of role perception (this im- 
plies the concurrent or just-prior perception 
and locating of the position of other and re- 
ciprocally of self) ; 

2. skill in role enactment (related to practice 
in the use of as if behavior) ; 

3. the current organization of the self — a 
cognitive structure that exercises a selective and 
directive effect on role perception and role 
enactment. 


In the body of this chapter, we have tried to 
detail the main conceptions, wherever possible 
referring to empirical studies. As is usually the 
case with experimentation at a molar level, the 
rigor of the animal laboratory is absent. None- 
theless, these studies warrant inferences which 
give at least initial credence to predictions aris- 
ing from theory. A few years ago, only impres- 
sionistic and anecdotal materials were available 
to document the self-role-interaction scheme of 
human conduct. In fact, Kluckhohn and Mur- 
ray (1948, p. 268) were constrained to say: ". . . 
in spite of the obvious importance of role train- 
ing and role playing for personality formation, 
this subject has barely been opened up at an 
empirical level" 'The number of empirical 
investigations cited in this chapter indicate 
that the subject has "opened up," that since 
1948 social psychologists in increasing numbers 
have employed self, role, and interaction as key 
conceptions. 
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Research Methods 


CHAPTER 7 


Experiments: Their Planning and Execution 
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Psychologists do research because they seek 
answers to problems or because they wish to test 
answers that have been suggested. For ex- 
ample, suppose we pose this problem: Is it bet- 
ter to present one or both sides of a controver- 
sial issue, if we desire to shift attitudes in one 
direction only? A search through the available 
literature may reveal that (1) the question has 
not been considered before; that is, no answers 
have been suggested; (2) the question has been 
considered, but more than one answer has been 
suggested; (3) the question has been considered 
and there is agreement among authorities as 
to the answer. 

In case a problem has not been previously 
considered, research might be undertaken to de- 
termine an answer to the problem. If a prob- 
lem has been considered, but various answers 
have been suggested, then research might be un- 


dertaken to determine which of the various an- 
swers is most adequate under specified condi- 
tions. 

Even when authorities are agreed upon a sin- 
gle answer to a problem this does not necessarily 
mean that the answer is correct. Authorities 
are not infallible. How did they arrive at the 
answer they offer? If the answer is based upon 
speculation and theory it must be granted that 
these do not provide as sound a basis for an an- 
swer as might be desired. 

Speculation is valuable and may provide good 
hunches, but hunches are still hunches and 
must be regarded as tentative answers. The- 
ory is also valuable in raising hypotheses or 
tentative answers to problems. ‘Tentative an- 
swers can become definite answers, however, 
only when tested by empirical research. 


CLASSIFICATION OF PSYCHOLOGICAL RESEARCH 


Research is done to seek an answer to a ques- 
tion or to a problem, or to test answers which 
have been suggested. A convenient classifica- 
tion of psychological research has been provided 
by Edwards and Cronbach (1952) in terms of 
the kinds of problems which psychologists raise. 
These authors propose that psychological re- 
search may be classified as (1) survey research, 
(2) technique research, (3) applied research, 
and (4) critical research. 


SURVEY RESEARCH 


Survey research is concerned with the rather 
general problem of what variables will be rele- 
vant to or relate to another given variable. Sur- 
vey research is exploratory in nature and is 
most useful in the preliminary stages of investi- 
gation of a particular problem. This is the 
time at which the psychologist is relatively ig- 
norant of the variables that may be of signifi- 
cance, 

Suppose a psychologist is interested in deter- 
mining what variables are relevant to opinions 
held on a given issue. He does not know, for 


example, whether the reasons people give for 
holding the opinions they hold will be relevant 
or not. The same reasons may be given by those 
holding opposed opinions. If this is true, then 
the psychologist cannot very well say that the 
reasons given are relevant to the opinions held. 
On the other hand, if it is found that reasons 
given are in some way related to the opinions 
held, then this variable may be considered rele- 
vant to the problem. 

Similarly, survey research might investigate 
whether political party affiliation, socioeco- 
nomic status, educational level, sex, intellectual 
level, personality differences, and so forth, are 
in any way related to opinions held on a defined 
issue, Numerous examples of this kind of sur- 
vey research may be found in Cantril (1944) . 

The hunches or tentative answers guiding the 
psychologist in his selection of the variables to 
be investigated in survey research are unusually 
vague. The variables observed may be deter- 
mined to a large extent by the hope that if a 
sufficiently large number are investigated, some 
of them may prove to be relevant. 
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TECHNIQUE RESEARCH 


Technique research is concerned with the 
problem of the method or technique to be used 
in the making of observations. It may be de- 
sired, for example, to do research upon attitudes 
on some issue. The question arises as to the 
method by which attitudes are to be studied. If 
techniques or methods are not available for 
making observations of this variable, they must 
be developed before the research can go ahead. 
Examples of technique research may be found 
in the work of Thurstone and Chave (1929), 
Guttman (1947), Likert (1932), and Edwards 

and Kilpatrick (1948). These studies are all 
concerned with the development of methods of 
measuring social attitudes. 

When several methods or techniques of ob- 
serving a particular variable are available, tech- 
nique research may be concerned with a com- 
parison of these methods. Technique research, 
in this instance, may provide answers to such 
questions as: Which method has the greatest 
reliability? Which method is the simplest to 
use? To what extent do the various methods 
yield comparable results? The work of Edwards 
and Kenney (1946) involving a comparison of 
the Likert (1932) and Thurstone and Chave 
(1929) methods of attitude scale construction 
is an example of this kind of technique research. 
The comparisons of different psychological scal- 
ing methods by Hevner (1930), Saffr (1937), 
and Attneave (1949) are other examples of 
technique research. 


APPLIED RESEARCH 


Applied research is concerned with problems 
of a practical administrative nature. Suppose 
we are confronted with the problem of choos- 
ing among several methods for changing atti- 
tudes in a desired direction. Which method 
will best attain the objective? Research may 
provide us with an answer and thus guide our 
choice. 

Several methods may be proposed for the se- 
lection of leaders in a group situation, Which 
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of these methods gives the best results? Which 
method will select the largest number of good 
leaders and the smallest number of poor lead- 
ers? If research shows that the several methods 
are equally effective, then administrative action 
may be taken in terms of the relative costs of the 
methods, or in terms of the simplicity of the 
methods, 


CRITICAL RESEARCH 


Critical research is research designed to test a 
hypothesis stemming from theory. A certain 
theory suggests that the outcome of the research 
should be in a certain direction. We have, in 
other words, from prior considerations, some be- 
lief that if the research is done, it should yield 
a particular answer. The research is then done 
in order to determine whether the outcome is 
compatible with theoretical expectations. 


OVERLAPPING OF CLASSES 


Although psychological research may be clas- 
sified as survey, technique, applied, and critical 
research, it should be recognized that there may 
be a great deal of overlapping among the classes. 
Survey research may have practical or applied 
aspects, once it is completed, Examination of 
the results of survey research may provide a 
basis for administrative decisions, although this 
was not contemplated when the research was 
planned. Technique research may also have 
applied aspects, and applied research may have 
theoretical as well as practical implications. 

If we ask the question, however, as to why 
a particular piece of research is to be done, 
prior to the actual research itself, we may then 
find that the motivation for doing the research 
enables us to place it in one or another of the 
four classes described, It is being done to dis- 
cover relevant variables, or to investigate a 
method of making observations, or to determine 
what action should be taken, or to test some 


hypothesis suggested by theoretical considera- 
tions. 


EXPERIMENTS AND EXPERIMENTATION 


Experiments are particular ways of making 
observations. An experiment differs from other 
research methods in that the experimenter has 
some degree of control over the variables in- 
volved and the conditions under which the 
variables are observed. ‘ 

To determine whether socioeconomic status 
is related to opinion on a given issue does not 
involve experimentation, but the answer can 
be determined by survey research or the making 


e 


of pertinent observations. In this problem, the 
investigator is studying two variables, socio- 
economic status and opinions held, as they exist 
in the group under observation. The investi- 
gator has no control over the socioeconomic 
status of the subjects nor of their opinions. 
Socioeconomic status is already fixed for each 
subject; it cannot be manipulated or changed 
directly by the research worker. Similarly with 
respect to opinions. The opinions that are 
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held by the members of the group are the opin- 
ions that are held. The investigator can only 
determine what these opinions are and then 
see whether they are related to socioeconomic 
status. 

In an experiment, however, the experimenter 
has control over certain variables. These vari- 
ables are called the independent variables. In- 
dependent variables are variables which the ex- 
perimenter himself manipulates or changes. As 
the independent variables are changed, the ex- 
perimenter observes other variables to see 
whether they are related to the changes intro- 
duced. These variables are called the depend- 
ent variables. The problem of the experiment 
is to determine the relationship between the 
independent and dependent variables. 

An experiment might be conducted to deter- 
mine whether a certain skill, as measured by a 
psychological test, can be most effectively taught 
by means of a given film strip or by means of a 
specified lecture. The film strip or lecture would 
be the independent variable and the perform- 
ance of the subjects, as measured by the test, 
would be the dependent variable. The experi- 
ment is designed to determine if there is any re- 
lationship between the dependent variable 
(performance) and the independent variable 
(film strip versus lecture). The control exercised 
by the experimenter has to do with the inde- 
pendent variable. He can determine the condi- 
tions under which the independent variable is 
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to be introduced and he can control other fac- 
tors that might influence the dependent vari- 
able. 

The advantages of making observations un- 
der controlled conditions over observations 
made without such control have been pointed 
out by Woodworth (1938, p. 2): 


1. The experimenter makes the event happen 
at a certain time and place and so is fully pre- 
pared to make an accurate observation. 

2. Controlled conditions being known condi- 
tions, the experimenter can set up his experiment 
a second time and repeat the observation; and, 
what is very important in view of the social 
nature of scientific investigation, he can report 
his conditions so that another experimenter can 
duplicate them and check the data. 

$. The experimenter can systematically vary 
the conditions and note the concomitant varia- 
tion in the results. 


It was customary in the earlier days of experi- 
mentation to believe that the experimenter 
must hold all variables (independent) constant 
except one. The “rule of the single variable” 
was formulated before more efficient statistical 
methods and experimental designs had been de- 
veloped. In modern experiments, with the sta- 
tistical tools now available, it is possible to 
handle several independent variables in the 
same design and also to have as many dependent 
variables as seem necessary. Experimental de- 
signs of this sort will be discussed later. 


'Trsrs OF SIGNIFICANCE 


Since experiments are one of the methods of 
research, it should be clear that experiments are 
designed to yield answers to problems. The 
statement of objectives of an experiment is a 
statement of the question or problems which the 
experiment is planned to answer. 

It is sometimes said that experiments are 
planned in order to test hypotheses. This is true 
in the broad sense that a hypothesis is a state- 
ment of the possible nature of the relationship 
between two or more variables. It is a tentative 
answer to the question of what the relationship 
is. Now, if a hypothesis is regarded as a state- 
ment of the relationship between two or more 
variables, and if variables are defined as the 
things which the research worker observes, then 
it is perfectly obvious that the test of a hypothe- 
sis must involve the making of observations. In 
other words, we make observations of each of 
the variables involved in the hypothesis and 
then determine whether the relationship is of 
the nature stated by the hypothesis. 

The hypotheses or tentative answers concern- 
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ing relationships which the experimenter has 
prior to the making of observations under the 
experimental conditions can be expressed more 
precisely in statistical form, once the observa- 
tions have been made. Outcomes of experi- 
ments are usually expressed in terms of means, 
standard deviations, proportions, correlation co- 
efficients, and other statistics. "These statistics 
summarize the results of the experiment in a 
convenient form. P 
A test of significance can then be made in 
which the statistic expressing the results of the 
experiment is compared with a theoretical dis- 
tribution of results to be expected under a 
specified hypothesis. This hypothesis is often 
called a null hypothesis (Fisher, 1942) . Theo- 
retical distributions frequently used in tests of 
significance include the ¢ distribution, the F dis- 
tribution, and the y? distribution. On the 
basis of the observations made during the course 
of the experiment, an observed value of t, F, or 
y2 may be computed. This observed value is 
then compared with the available tables (Fisher 
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and Yates, 1949) of these values to determine 
the probability of obtaining the observed value 
under the hypothesis being tested. 
The test of significance thus results in a prob- 
ability value. When the probability given by 
the test of significance is small, say .05 or less, 
then we say that the obtained value is signifi- 
cant. When the probability given by the test 
of significance is large, say greater than .05, we 
say that the obtained value of t, F, or y? is not 
significant. The important point is that we not 
only need to know the outcome of the experi- 
ment, expressed in some statistic such as a mean, 
correlation coefficient, standard deviation, or 
variance, but that we often find it desirable to 
evaluate the outcome in terms of some test of 
significance, 


ONE- AND TWO-TAILED TESTS OF SIGNIFICANCE 


The nature of the test of significance will de- 
pend upon the hypothesis formulated by the 
experimenter prior to the actual experiment. 
We may distinguish between two major cases. 
If the experimenter's hypothesis concerns the 
direction of the relationship to be found, the 
results will be evaluated in terms of a one-tailed 
test of significance. The experimenter, in this 
instance, is interested only in determining 
whether the results are in the direction specified 
by the hypothesis. On the other hand, the ex- 
perimenter may be interested in determining 
whether any relationship exists, regardless of its 
direction. In this instance, the outcome of the 
experiment would be evaluated by means of a 
two-tailed test of significance. 

One- and two-tailed tests of significance are 
discussed in terms of psychological experiments 
by Jones (1952) and Marks (1951). The terms 
one- and two-tailed tests refer to whether one 
or both tails of the theoretical distributions, 
discussed previously, are to be used in arriving 
at the probability value. 

Consider the case, for example, where the re- 
sults of an experiment are expressed in terms of 
the correlation coefficient between a dependent 
and an independent variable. The correlation 
coefficient is a measure of the relationship be- 
tween two variables when they are linearly 
related. It can range in value from 1.00, indi- 
cating a perfect positive relationship, through 
zero, indicating no relationship, to -1.00, indi- 
cating a perfect negative relationship. Suppose 
the observed value of the correlation coefficient 
as determined by the observations made in the 
experiment is .48. 

If the experimenter had started out with the 
hypothesis that the relationship between the 
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two variables should be positive, the observed 
value of the correlation coefficient would be 
evaluated by means of a one-tailed test of sig- 
nificance. The experimenter, in this case, is 
interested in determining whether he may con- 
clude that the relationship is significantly 

eater than zero in a positive direction. On the 
other hand, if the experimenter merely wanted 
to determine whether any relationship existed 
between the two variables, i.e., whether a posi- 
tive or a negative relationship existed, the ob- 
served value of the correlation coefficient would 
be evaluated by means of a two-tailed test of sig- 
nificance. In essence, the experimenter is inter- 
ested in determining whether the correlation co- 
efficient differs significantly from zero in either 
a positive or a negative direction. 

Consider another example, in which one 
group of subjects is tested under experimental 
condition A and another group of subjects is 
tested under experimental condition B. The 
independent variable here is the experimental 
condition, and the dependent variable is a meas- 
ure of performance of the subjects under the ex- 
perimental conditions. The outcome of the 
experiment is summarized in terms of the means 
of the groups on the dependent variable along 
with a measure of the variation in performance, 
ie., the standard deviation or its square, the 
variance. 

If the experimenter, prior to the experiment, 
had formulated a hypothesis that the mean of, 
say, the A group should be higher than the 
mean of the B group, then the difference be- 
tween the means would be evaluated by means 
of a one-tailed test of significance. In essence, 
the experimenter is stating that the average per- 
formance should be positively related to the A 
condition. He is interested in determining 
whether the mean of the 4 group is signifi- 
cantly greater than the mean of the B group. 
On the other hand, if the experimenter had no 
prior hypothesis as to which group should be 
superior, the difference between the means 
would be evaluated in terms of a two-tailed test 
of significance, In essence, the experimenter is 
interested in determining whether the relation- 
ship is such that either the 4 group will be 
superior to the B group, or vice versa. 


TYPE I AND TYPE II ERRORS 


The hypothesis tested may be true in fact, but 
the test of significance may erroneously result 
in the rejection of the hypothesis. For example, 
the experimenter may be testing the null hypo- 
thesis that two means do not differ significantly, 
and he is prepared to reject this hypothesis on 
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the basis of a two-tailed test of significance. If 
it should actually be true that the two means 
do not differ, but the test of significance results 
in the rejection of the null hypothesis, this is 
called a type I error. On the other hand, the 
two means may actually differ, but the test of 
significance may fail to result in the rejection of 
the null hypothesis. Acceptance of the null 
hypothesis under this condition is called a type 
II error. 

It is obviously desirable that any given test of 
significance result in the rejection of very few 
true hypotheses and, at the same time, result in 
the rejection of as many false hypotheses as pos- 
sible. The probability of rejection of a true 
hypothesis can be set by the experimenter. For 
example, if the experimenter decides to view a 
probability value of .05 as a basis for rejecting 
the hypothesis tested, then the frequency of type 
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I errors will be 5 in 100. If the experimenter 
demands a probability of .01 before rejecting the 
hypothesis, then the frequency of type I errors 
will be 1 in 100. It should be clear that if the 
experimenter demands an extremely small prob- 
ability before rejecting a hypothesis, the result 
will be a decrease in the type I errors he makes. 
Yet, at the same time, he will increase the fre- 
quency of type II errors he makes. The proba- 
bility values that are commonly used in research 
work are those of .05 and .01. 

Type I and type II errors are discussed in 
greater detail from the point of view of statis- 
tical considerations by Hoel (1947), Mood 
(1950) , and Johnson (1949) . Less technical dis- 
cussions can be found in Walker (1943), 
Tippett (1941), Cochran and Cox (1950) , and 
Marks (1951). See also Operating Characteris- 
tics or Power Curves in Chapter 8. 


VARIABLES IN PSYCHOLOGICAL RESEARCH 


The variables which psychologists observe in 
research may be represented as response vari- 
ables, stimulus variables, and organismic vari- 
ables. These three terms are used in a broad 
sense. Response, for example, is used to desig- 
nate any kind of behavior or action of the or- 
ganism. Some responses may be very simple in 
nature, such as a finger flexion, and others may 
be highly complex, such as the behavior we call 
aggression. Stimulus variables also range in 
complexity. In some experiments the stimulus 
may be of a relatively simple kind, such as a 
flash of light. In other experiments the stimu- 
lus may represent a complex social situation. 

Any property or characteristic of the organ- 
ism may, in a given experiment, be called an 
organismic variable. A clear-cut example of 
an organismic variable would be height or 
weight, neither of which seems properly to fit 
within the class of stimulus or response vari- 
ables. Other organismic variables may be des- 
ignated upon the basis of prior experience of 
the individuals. Subjects in an experiment, for 
example, may be classified into groups in terms 
of their degree of prior education. Sometimes 
an experiment may incorporate into its design 
an organismic variable based upon some previ- 
ous observation of behavior of the subjects. For 
example, it may be useful in some experiments 
to classify subjects upon the basis of their pres- 
ent attitudes toward some issue, or on person- 
ality traits. This classification, in turn, might 
be based upon prior observation of a response 
variable, i.e, scores upon an attitude scale or 
a personality inventory. In all such cases as 
those described, the organismic variable is an 


independent variable. It is one of the factors 
introduced into the design by the selection or 
classification of subjects by the experimenter. 

In terms of these three kinds of variables, we 
may attempt a classification of the specific prob- 
lems of research which the psychologist under- 
takes. Introductory texts in general psychology 
usually state the principle that 


R = f (0, S), 


which means that response or behavior is be- 
lieved to be a function both of properties of 
the organism and of the stimulus situation. 
But not all research is concerned with the 
formulation R — f (O, E). Technique research, 
as described earlier, is often concerned with a 
single variable and the problem of recording or 
measuring that variable more precisely. It is 
conceivable that other psychological research 
problems may be concerned with any two or 
more of the three kinds of variables considered 


‘in all possible permutations (Edwards, 1950b) . 


As a single illustrative example, consider the 
case where the research interest is in relating 
socioeconomic status to educational level. This 
might be considered as an instance of an O-O 
research problem. That which differentiates 
the social psychologist from other psychologists 
is his interest in O, S, and R variables that are 
social in nature. 

One of the gravest tasks which confronts the 
experimenter is the defining of the specific vari- 
ables involved in a problem. For research pur- 
poses, variables must be defined in terms of 
observations or operations upon observations. 


* * 
* 
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Research and experimentation consist of at- 
tempting to relate two or more such sets of 
observations (variables). Consequently, vari- 
ables must be defined in terms of the observa- 
tions made and related. A variable can mean 
no more than the set of observations or opera- 
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tions upon the observations which define it. 
Hypotheses, it may be recalled, are tentative 
answers concerning the relationships between 
variables which are to be tested by research and 
experimentation. 


QUANTIFICATION OF VARIABLES 


The problem confronting the experimenter 
in the primary stages of planning his work is 
that of determining how and in what way the 
variables he intends to work with are to be 
observed and recorded. It is the aim of all 
branches of science to seek quantification of 
observations. Quantification of observations is 
a step toward quantification of the relationships 
between variables. As the formulation R — 
f(O,S) indicates, it is desired to express be- 
havior as some mathematical function of various 
organismic and stimulus variables. 

With some variables, the problem of quantifi- 
cation is relatively simple. Various techniques 
have been devised and constructed for the 
measurement of variables with which the psy- 
chologist deals. Suppose that in an experiment 
the stimulus variable consists of a set of weights 
which are to be lifted and judged in compari- 
son with some standard weight. The weights 
themselves can be measured on a physical con- 
tinuum, i.e., the quantification of the stimulus 
variable is a simple matter of determining the 
actual weight of each stimulus. But the prob- 
lem of quantification of the response variable, 
ie, the judgment made by the subject, is much 
more complex. Suppose the standard stimulus 
has been assigned a middle value in the series 
of weights and the subjects are asked to rate 
the degree of weight of the comparison stimuli 
in terms of a nine-point scale extending on 
both sides of the standard. No physical con- 
tinuum exists or is known for the series of 
judgments or responses obtained from the sub- 
jects. 

i In this instance, however, quantification can 
be obtained on a psychological continuum by 
means of various techniques of psychological 
scaling which have been developed (Guilford, 
1938; Attneave, 1949; Edwards and Thurstone, 
1952. See also Chapter 9). Psychological scal- 
ing methods have been given a great deal of 
attention by research workers and it is now 
possible to scale or measure judgments with a 
rather high degree of accuracy and reliability. 
These methods are of particular value when 
physical measurement of a variable is not 


possible. 


Psychological tests represent another valuable 
source of quantification for response and or- 
ganismic variables of various kinds. A vast 
number of psychological tests exist for the pur- 
pose of quantifying skills, abilities, personality 
attributes, and other variables. Psychological 
tests of this nature are described by Goode- 
nough (1949) and Cronbach (1949). The 
methodology of test construction, as described 
by Gulliksen (1950) , has been highly developed 
and can be applied to research problems for 
the construction of new devices or instruments 
for quantifying behavior where none already 
exist, 

Rating methods, described by Guilford 
(1936), are another valuable set of devices for 
the research worker. These methods may be 
used for a vast number of problems in which 
it is desired to obtain a numerical value for 
some complex behavior which can be judged 
by observers but not otherwise measured. Rank- 
ing methods may also be used for this purpose. 
Kendall (1948) has described statistical meth- 
ods appropriate for the analysis of ranked 
observations. For further discussion see Non- 
parametric Methods in Chapter 8. 

As important as the defining of variables is 
the problem of the reliability of the observa- 
tions themselves, By reliability is meant the 
extent to which we would obtain comparable 
observations in the same circumstances. If two 
equally trained observers cannot rate the be- 
havior of subjects in an experiment in such a 
way that their ratings agree, we say that the 
rating system is unreliable or has a low degree 
of reliability. Obviously, with unreliable ob- 
servations, we cannot hope to have much faith 
in any relationships established for this set of 
observations with some other variable. Experi- 
mentation and research demand that the 
observations made be reliable. 

Physical measurements possess a high de- 
gree of reliability, and psychological tests can 
usually be so constructed that the scores ob- 
tained from them are reliable. Various tech- 
niques have been devised for increasing the 
reliability of ratings (Guilford, 1936) . 
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SELECTION OF SUBJECTS 


In discussing selection of subjects to be used 
in an experiment, we shall consider two cases: 
one where the experimental design involves an 
organismic variable as an independent variable 
and the other where it does not. 

If we took a group of unselected college stu- 
dents and measured them with respect to almost 
any variable, we would find that they showed 
considerable individual differences or variation. 
They would differ with respect to height, 
weight, abilities, attitudes, skills, previous ex- 
perience, interests, socioeconomic status, traits, 
habits, and so on. A measure of this variation 
or of individual differences would be given by 
the standard deviation or by the square of the 
standard deviation, called the variance, of the 
distribution of observations. The arithmetic 
mean of the distribution would give a charac- 
teristic value which we assume represents the 
typical member of the group. 

Now if we took another group of unselected 
subjects from the population at large, we could 
also find the standard deviation of their distri- 
bution of observations which would be a meas- 
ure of their variation. And we could find the 
arithmetic mean of the distribution, which we 
again may assume would represent the typical 
member of this unselected group. In compar- 
ing the means and standard deviations obtained 
from our two groups, we should probably arrive 
at two conclusions, regardless of the nature of 
the particular observations made. The groups 
would differ with respect to their arithmetic 
means, but the direction of this difference 
would not necessarily be systematic. We might 
expect the college group: to have a higher mean 
than the unselected group from the general 
population with respect to abilities, intellectual 
level, socioeconomic status, and on any other 
variable that is positively related to educa- 
tional status. But on some of the variables 
which may be negatively related to educational 
status, we would find that the college group 
has a lower mean than the group from the 
general population. 

With respect to the standard deviations, how- 
ever, it would be safe to predict that, in gen- 
eral, the college group would show less varia- 
tion than the unselected group from the general 
population. For example, the college group 
would vary less in height and in age than the 
unselected group, and this would be true for 
almost any other variable that we might name. 

Now, when an experiment is done with a 
sample or group of subjects and certain rela- 
tionships are found between the variables 


investigated, it is usually the desire of the ex- 
perimenter to be able to generalize his results 
beyond the particular group which has been 
studied. If a certain result is found when male 
college students are subjects, to what extent may 
we expect this result to hold true with female 
subjects? With high-school subjects? With older 
subjects? Strictly speaking, it is possible to 
generalize beyond a sample only to other groups 
that are comparable to the group investigated. 
Arid comparable, in this instance, means with 
respect to the organismic variables that charac- 
terize the group investigated. 

If the group investigated consists of subjects 
who are above average or below average or all 
of approximately average level with respect to 
some organismic variable, then we cannot be 
sure, without further investigation, that the 
results obtained will hold for other groups of 
different levels of the organismic variable. The 
organismic variables may be important inde- 
pendent variables in the sense that they are 
related to or influence the results observed with 
respect to the dependent variables. 

The problem here is similar to that found 
in experimental work in other branches of 
science. A well-planned experiment in agri- 
culture, for example, may show that a specified 
amount of fertilizer applied to units of land in 
a section of Iowa results in greater production 
of corn than any other specified amount of the 
fertilizer tested in the experiment. The varying 
amounts of fertilizer, in this experiment, cor- 
respond to the independent variable, and the 
amount of corn yield is the dependent variable. 
The land units to which the fertilizer is applied 
correspond to subjects in a psychological ex- 
periment. If the land units are homogeneous 
and differ from the land units that might be 
found in Nebraska or in another section of 
Iowa, there is no way to generalize the results 
obtained from this experiment to what might 
be expected in these other areas. Soil dif- 
ferences may make up important independent 
variables in the sense that they are related to 
the results obtained from the applications of 
the different amounts of the fertilizer. A par- 
ticular amount of fertilizer may work very well 
with certain kinds of land and have little or no 
effect when applied to other kinds of land. 

The potential errors involved in generalizing 
béyond a sample must obviously be kept in 
mind when attempting to apply the results 
obtained in a socio-psychological experiment 
with subjects from one culture to subjects from 
another culture. 
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The physiological processes of the human or- 
ganism appear to be much the same regardless 
of whether the organism being studied is found 
among the Central Eskimo, the African Bantu, 
or the Australian Bushmen. An experiment 
which shows that a certain drug depresses or 
facilitates some physiological process in Ameri- 
can subjects may be expected to result in similar 
reaction for subjects from other cultures. Is it 
possible that psychological processes such as 
perception, reasoning, learning and remember- 
ing, and so on, are also comparable among 
members of different cultures? If that were 
true, then it would be possible to project the 
results obtained from psychological experi- 
ments dealing with these processes to the mem- 
bers of cultures different from the one in which 
the experiment was originally conducted. The 
evidence on this point is surely not complete 
or decisive at this stage of psychological de- 
velopment, for relatively little research has been 
done on the problem of comparing psychologi- 
cal processes between members of different cul- 
tures. 

The evidence available at the present time, 
however, does not indicate any reason to be- 
lieve that the learning process of children and 
adults in the Northern United States differs 
from the learning process of children and adults 
in other sections of the United States. And, in 
general, this is true of other psychological 
processes. Hence, we may tentatively suggest 
that results of experiments relating to psycho- 
logical processes will be applicable among 
all members of a given culture. But only addi- 
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tional experimentation can give a decisive 
answer to the problem. 

It is often possible to incorporate into the 
formal experimental design one or more po- 
tentially relevant organismic variables. Thus, 
if age is a variable that we have reason to be- 
lieve may be related to the observations of the 
dependent variable, the experimenter may take 
steps to see that subjects from different age 
levels are used in the experiment. If there is 
reason to believe that sex classification may be 
related to the outcomes, both men and women 
may be used as subjects. If sex is found not 
to be an important independent variable, then 
it is possible to generalize the results of the 
experiment for both sexes. On the other hand, 
if sex is found to relate to the outcomes of the 
experiment, we shall have two answers to the 
question investigated: one answer for males 
and another for females. 

The importance of including organismic 
variables in the experimental design is that if 
these variables are not found to be relevant, 
the results of the experiment can be generalized 
on a much broader basis. They will be more 
generally applicable than would otherwise be 
the case. If the organismic variables do turn 
out to be relevant, then the experimenter will 
be in a position to suggest different answers 
to the problem he set out to investigate, de- 
pending upon the nature of the organismic 
variable involved. Further consideration will 
be given to this problem in a later section on 
experimental design. 


NUMBER OF SUBJECTS 


Let us assume that an experiment has reached 
the planning stage where the experimenter 
must give consideration to the number of sub- 
jects to be investigated or tested. How many 
subjects shall he use? There is no simple answer 
to this question and only suggestive answers can 
be given in terms of the objective of the inves- 
tigation. If the objective of a given experiment 
is to be able to state rather precisely the degree 
of relationship between two variables, then it 
can be said that, in general, the experiment will 
require a larger number of subjects than would 
be the case if the objective were merely to de- 
termine whether any relationship at all exists 
between the two variables. 

In the first case we are interested in estimat- 
ing a parameter, the exact degree of relation- 
ship, within rather narrow limits. In the second 
case, we are interested in determining whether 
the parameter is greater than zero. In general, 


the amount of variability to be expected in 
estimates of parameters is a function of the size 
of the sample, i.e., the number of subjects in- 
vestigated. 

If the relationship that exists between the 
two variables is high, as measured, let us say, 
by a correlation coefficient, then it will require 
fewer observations to determine whether the 
relationship is significantly greater than zero 
than would be the case if the relationship is 
low or small. If the objective is merely to de- 
termine whether the relationship is significantly 
greater than zero, the number of subjects re- 
quired will depend upon the degree of relation- 
ship that does exist. On the other hand, if we 
want to estimate the degree of the relationship 
rather precisely, a larger number of subjects 
will be required, regardless of whether the 
exact relationship is high or low. 

In prediction studies, where the relationship 
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between one or more independent variables and 
a dependent variable is studied and the results 
are then to be used for predicting performance 
of subjects not yet tested, a fairly large number 
of subjects is required. The reason is that the 
relationship found in the original study is used 
in the form of an equation to make predictions. 
If the relationship is not estimated rather ex- 
actly, the resulting predictions will be in greater 
error than would be the case with a more exact 
estimate. By means of statistical methods it is 
possible to estimate the number of subjects 
required to estimate parameters with a given 
degree of accuracy. 

Consider another experiment, in which the 
objective is to determine whether the average 
measure on the dependent variable for one 
group is significantly higher than the average 
measure for another group on the dependent 
variable. In general, the number of subjects 
required in an experiment of this sort will de- 
pend upon the variability in performance of 
the subjects within each group and also upon 
the magnitude of the difference found between 
the two means. If we assume that the dif- 
ference between the means remains constant, 
then the greater the variability in performance 
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of the subjects within each group, the larger 
the number of subjects required in order to 
determine whether the difference between the 
two means is significant. If we can assume that 
the variability of performance remains con- 
stant within each group, then the greater the 
difference between the observed mean perform- 
ance the fewer the number of subjects that will 
be required to detect this difference. 

There are methods available for estimating 
the number of subjects required in an experi- 
ment of this sort. They require that we have 
available some estimates of the magnitude of 
the differences to be observed in mean perform- 
ance and also some estimates of the degree of 
variability in performance to be expected. It is 
possible, also, that we may not have any in- 
terest in a difference unless it is of some speci- 
fied magnitude. Given some estimate of 
variability of performance, it is then possible 
to estimate the number of subjects required in 
order to draw some conclusion concerning the 
value specified (Harris, Horvitz, and Mood, 
1948). For further discussion of sample size 
see Operating Characteristics, and the Sign 
Test in Chapter 8. 


ANALYSIS OF VARIANCE 


Consider the situation in which N subjects 
are divided at random into r groups of n sub- 
jects each. Assume that these subjects are all 
tested under exactly the same conditions, i.e., 
we have no independent variable operating, 
but only a measure of performance X. It is a 
typical finding that the observations made upon 
the subjects will vary. This variation can be 
measured by expressing each observation as a 
deviation from the mean m of all N observa- 
tions, squaring the resulting deviations, and 
summing for all subjects. We shall call this 
the total variation or total sum of squares. This 
sum of squares may be written 


I ve $ (X — m). (1) 


The total sum of squares can, in turn, be 
analyzed into two component parts: one part 
will be based upon the variation of the indi- 
vidual n observations within each set about the 
set mean X. This component of the total sum 
of squares measures the variation within groups 
about the group means and we may call it the 
sum of squares within groups. The second com- 
ponent of the total sum of squares will be based 
upon the variation of the r group means X 


about the mean m of all N observations. This 
component measures the variation in the group 
means and we shall call it the sum of squares 
between groups. The sum of squares within 
groups and between groups may be written 


Suse SS xx) (2) 
11 
T — 
Xx,2 = Xn (X — m)?. (3) 
1 
Then 
Ix? = Dx py? + Dxp2. (4) 


Dividing the sum of squares between groups 
and the sum of squares within groups by the 
appropriate number of degrees of freedom, 
* — l and nr — r, respectively, we obtain the 
mean squares, 5? and 5,2, shown in (5) and 


(6). 


Ex? 
West? e 
t 
Ex? 
So? = RESCH (6) 


The analysis of a total sum of squares into 
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a number of component parts, each with its 
own appropriate degrees of freedom, along with 
the tests of significance that may be made, is 
known as the analysis of variance. In the simple 
case described above, it can be shown that s? 
is an estimate of the variance of the individual 
measures, c?, plus n times the variance of the 
means, ngg?, whereas Ze? is an estimate of the 
variance of the individual measures only, c?. 
These two estimates may be written 
S? > o2+ noz? 


(7) 


2 —> g2, (8) 


Sw 


‘TEST OF SIGNIFICANCE 


In a typical experimental design, the r 
groups of subjects are tested under different 
experimental conditions constituting the inde- 
pendent variable. The experimenter's interest 
is in determining whether the experimental 
conditions relate to differences in mean per- 
formance of the subjects, ie, whether the 
means may be considered to differ significantly. 
The test of significance is accomplished by 
comparing 5,2, a measure of variation based 
upon the sum of squares between groups, i.e., 
variation based upon the average performance 
of groups of subjects treated differently, with 
Sw, a measure of variation based upon the sum 
of squares within groups, i.e., variation based 
upon the performance of groups of subjects 
treated alike. 

The logic of the test of significance is de- 
termined by the consideration that if subjects 
have been randomly assigned to each of the r 
groups, and if the subjects have all been tested 
under exactly the same conditions, then the 
variation in the r group means should be the 
result of the process of randomization or chance 
factors only. On the other hand, if the sub- 
jects have been randomly assigned to the r 
groups and the groups have been tested under 
different experimental conditions, and if these 
are related to mean performance, then the 


FACTORS CONTRIBUTING TO 


We shall now consider some additional prob- 
lems in terms of the factors that may contribute 
to the variation in performance of the subjects, 
and we shall see whether the influence of these 
factors is primarily upon the variation within 
groups (experimental error) or the variations 
between groups (experimental conditions). It 
should be obvious that if there are any factors 
contributing to the variation between groups, 


» 
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variation of the group means will be increased, 
ie, have a component which reflects the ex- 
perimental conditions as well as random or 
chance factors. This component is indicated 
by the second term on the right in (7). Under 
conditions of random sampling and with no 
systematic differences in experimental condi- 
tions operating to influence the group means, 
the second term on the right in (7) will tend 
toward zero. Thus, on the average, the ratio 
552/s,? should approach unity. 

Suppose, however, that one of the experi- 
mental conditions operates systematically in 
such a way as to increase average performance, 
and that another experimental condition oper- 
ates in such a way as to decrease average per- 
formance. Then, obviously, the various group 
means will differ more among themselves than 
would be the case if the experimental condi- 
tions contributed nothing to the mean per- 
formance. At the same time, if the influence 
of an experimental condition is systematic, i.e., 
if, in general, it tends either to increase per- 
formance or to decrease performance but not 
both, then the experimental condition will not 
tend to increase the variation within the group 
of subjects tested under this condition. Only 
in the rather atypical case where the experi- 
mental condition tends to increase the perform- 
ance of some subjects and to decrease the 
performance of other subjects would the ex- 
perimental condition contribute to the varia- 
tion within groups. (For further discussion see 
Transformations in Chapter 8.) 

In this typical experimental design, with n 
subjects assigned at random to r groups, the 
variation within groups provides a basis for 
evaluating the variation between groups. The 
variation within groups, Sw, is often called 
experimental error (Edwards, 1950b; Cochran 
and Cox, 1950; Fisher, 1942; Snedecor, 1946) . 
The ratio, 5,?/s,,2, is the familiar F ratio, tabled 
by Snedecor (1946). In general, the larger the 
value of F, the smaller the probability that it is 
the result of chance, under the hypothesis being 
tested. 


VARIATION BETWEEN GROUPS 


in addition to the experimental conditions 
themselves, then no clear interpretation of the 
significance of the experimental conditions 
will be possible. Any such factors will have 
to be controlled or randomized throughout all 
groups. In addition, it should be obvious that 
any factor which contributes to the variation 
within groups will make it more difficult to 
detect a significant variation in the group 
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means. The measure of variation within groups, 
it will be recalled, provides the basis for evalu- 
ating the significance of the variation between 
groups. On the other hand, if experimental 
error, variation within groups, can be de- 
creased, this, in turn, should add to the preci- 
sion of the experimental design. 


INDIVIDUAL DIFFERENCES 

A major contributing source to the total 
variation in performance of subjects in an ex- 
periment is that of individual differences. It is 
found, for example, that with almost any test 
or measure made upon human subjects, the 
resulting observations will vary. Psychologists 
speak of this variation in terms of individual 
differences, regardless of whether the differences 
are acquired or innate. We have referred to 
such differences, characteristic of individuals, 
as organismic variables. 

Consider an experimefit in which the experi- 
menter, without randomization, places subjects 
into one of r groups. For simplicity, let r be 
equal to two, and assume that one group is 
tested under one experimental condition 4 
and the other is tested under a different experi- 
mental condition B. The experimental condi- 
tions constitute the independent variable, and 
we shall assume that the dependent variable is 
some measure of learning. The experimenter 
is interested in determining whether learning, 
the dependent variable, is in any way related 
to the independent variable, the experimental 
condition. 

Suppose it is found that the 4 condition is 
superior to the B condition, as determined by 
the fact that the variation between the means 
of the two groups is significant when tested 
against the variation within groups, with the 4 
mean being higher than the B mean. The ex- 
perimenter draws the conclusion that the 
difference observed is a result of the indepen- 
dent variable, that is, the experimental condi- 
tions. However, let us assume that it is also 
found that the members of the 4 group have a 
significantly higher mean level of intellectual 
ability, as measured by some test of academic 
aptitude, than do the members of the B group. 
Since intellectual ability is probably positively 
related to learning ability, the experimenter has 
no way of knowing whether the superior per- 
formance of the A group under the experimental 
condition is the result of the experimental con- 
dition itself or is merely an artifact of the initial 
differences in the groups in terms of intellectual 
ability. 

If the members of the A group are, in general, 
superior in intellectual ability to the members 
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of the B group, this will operate as a systematic 
source of error in the measures of performance 
obtained under the experimental conditions. 
The mean of the A group, in other words, may 
be expected to be higher than the mean of the 
B group, regardless of the experimental condi- 
tions, Thus, if the means do differ on the de- 
pendent variable, this may be due either to an 
organismic variable, intellectual ability, or to 
the experimental variable, or both. The effect 
of the experimental condition is said to be con- 
founded with intellectual ability and no clear 
conclusion concerning the experimental variable 
is possible. 


RANDOMIZATION OF SUBJECTS 


When organismic variables are not incorpo- 
rated into the experimental design as inde- 
pendent variables whose influence is to be in- 
vestigated, it is important to randomize these 
variables so that they will not have any system- 
atic influence on the mean outcomes on the 
dependent variable. If subjects are randomly 
assigned to the different experimental condi- 
tions, then we may expect differences in organ- 
ismic variables to be randomly distributed 
among the various experimental groups. The 
means of the experimental groups may thus be 
expected to differ as a result of the organismic 
variables only within the limits of random sam- 
pling. 

Methods of randomly assigning subjects to 
experimental groups are described by Lind- 
quist (1940), Walker (1943), and Edwards 
(1950b). With random assignment of subjects 
to the experimental conditions, organismic var- 
iables may be assumed to contribute to the 
random variation in the means of the groups, 
but not to any systematic variation in the means. 

The necessity of randomization in the assign- 
ment of subjects may be made obvious if we 
consider an experiment in which 40 male and 
60 female subjects are available. If an equal 
number of subjects is to be assigned to each of 
two groups, the method by which the subjects 
are assigned to the groups might result in hav- 
ing a preponderance of females in one group, 
with only a relatively small number of males. 
If the two groups of subjects are tested under 
different experimental conditions, and it is 
found that one group has a significantly higher 
mean on the dependent variable, this difference 
between the means may be the result of the ex- 
perimental conditions or it may be the result of 
a systematic sex difference in performance, or 
both of these variables may contribute to the 


difference between the means. 
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In the case described, it would clearly be pos- 
sible to assign the subjects so that the same 
number of males and females are present in each 
group. Balancing of subjects in this way may 
often be possible when the organismic variable 
is as obvious as it is in this case. But there will 
still be any number of other organismic vari- 
ables operating, and on many of these the ex- 
perimenter may have no knowledge of possible 
systematic differences which may be operative 
in performance under the experimental condi- 
tions. In what way shall the 60 females be di- 
vided into two groups of 30 each for the two 
experimental conditions? In what way shall 
the 40 males be divided into two groups of 20 
each for the two experimental conditions? By 
randomly assigning subjects in each sex group 
to the experimental conditions, the experi- 
menter has some assurance that these other or- 
ganismic variables will be randomized and 
thus removed as a possible systematic source of 
error contributing to the variation in the means 
of the groups on the dependent variable. Fail- 
ure to randomize the assignment of subjects 
means that the outcome of the experiment is 
always potentially open to more than one inter- 
pretation. 


ARRANGEMENT OF CONDITIONS 


Another systematic source of variation may 
occur with respect to the arrangement of the 
experimental conditions themselves. If, for ex- 
ample, one group of subjects is tested under one 
experimental condition at a particular time, in 
the morning, let us say, and a second group of 
subjects is tested under another experimental 
condition at another time, in the afternoon, let 
us say, then time of testing may contribute to 
any observed difference between the means of 
the groups on the dependent variable. Any 
systematic influence resulting from the differ- 
ence in time of testing could, of course, be con- 
trolled by testing all subjects at the same time. 
Or the influence of time of testing could be 
randomized by the assignment of the experi- 
mental conditions in such a way that some sub- 
jects would be tested under each condition, in 
the morning and in the afternoon. 

Time of testing itself might be introduced 
into the design as another independent variable 
to be investigated. This could be done by plan- 
ning the experiment so that half of the subjects 
in each experimental condition are tested in the 
morning and half in the afternoon, Random- 
ization in the assignment of the subjects would 
still be involved, however, in this design. 

If prior evidence was available to indicate 
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that time of testing did not contribute in any 
significant way to performance under the ex- 
perimental conditions, this variable, of course, 
would not have to be considered in the plan of 
the experiment. 

Temperature differences, differences in illu- 
mination, or differences in other environmental 
variables may be considered in the same way as 
the illustrative example of time of testing. All 
of these may be potential sources of systematic 
variation, to be randomized, controlled, or in- 
vestigated. 


IMPORTANCE OF RANDOMIZATION 


The importance of randomization is stressed 
by Fisher in his discussion of the classical case 
of the lady tea-taster. This lady, he states, “de- 
clares that by tasting a cup of tea made with 
milk she can discriminate whether the milk or 
the tea infusion was first added to the cup” 
(Fisher, 1942, p. 11). The experimental de- 
sign selected to investigate this claim is a rela- 
tively simple one: Eight cups of tea are pre- 
pared, four mixed in one way and four in the 
other. These cups are presented to the lady in 
random order for judgment. She has been told 
that there are four cups of one kind and four 
of the other. She is asked to taste the cups and 
to divide them into two sets of four each, in 
terms of how she believes the mixture to have 
been prepared. She is also told that the cups 
will be presented to her in random order. 

The outcome of the experiment is evaluated 
in terms of the assumption that the lady does 
not have the ability to make the discrimination 
that she claims. If that is the case, the results 
of the experiment may be assumed to be gov- 
erned by chance. Any observed outcome can 
then be evaluated in terms of the frequency 
with which that outcome would be expected in 
terms of the chance distribution. It can be 
shown that if nothing but chance is operating to 
influence judgments, then a completely correct 
classification of the cups would occur with a 
theoretical relative frequency of 1 time in 70. 

If the lady makes a completely correct clas- 
sification, this outcome would have a probabil- 
ity value of .014, under the hypothesis specified, 
i.e., that the judgments are a matter of chance. 
Consequently, the experimenter would reject 
the chance hypothesis for this outcome in terms 
of the usually accepted significance standard. 
Having rejected the chance hypothesis, can the 
experimenter conclude that the lady is correct 
in her claim? This is possible only if he has as- 
surance that, lacking the discrimination, the 
completely correct classification actually does 
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have a probability of .014. Fisher's comments 
on the necessity of randomization to ensure that 
this is the case are worth quoting in some detail: 


If all those cups made with the milk first had 
sugar added, while those made with the tea first 
had none, a very obvious difference in flavour 
would have been introduced which might well 
ensure that all those made with sugar should be 
classed alike. These groups might either be 
classified all right or all wrong, but in such a 
case the frequency of the critical event in which 
all cups are classified correctly would not be 1 
in 70, but 35 in 70 trials, and the test of sig- 
nificance would be wholly vitiated. Errors equiva- 
lent in principle to this are very frequently 
incorporated in otherwise well-designed experi- 
ments. 

It is no sufficient remedy to insist that “all 
the cups must be exactly alike” in every respect 
except that to be tested. For this is a totally im- 
possible requirement in our example, and equally 
in all other forms of experimentation, In practice 
it is probable that the cups will differ perceptibly 
in the thickness or smoothness of their material, 
that the quantities of milk added to the dif- 
ferent cups will not be exactly equal, that the 
strength of the infusion of tea may change be- 
tween pouring the first and the last cup, and that 
the temperature also at which the tea is tasted 
will change during the course of the experiment. 
These are only examples of the differences prob- 
ably present; it would be impossible to present 
an exhaustive list of such possible differences 
appropriate to any one kind of experiment, be- 
cause the uncontrolled causes which may influ- 
ence the result are strictly innumerable. (Fisher, 
1942, pp. 17-18) 

The element in the experimental procedure 
which contains the essential safeguard is that 
the two modifications of the test beverage are to 
be prepared “in random order.” This, in fact, 
is the only point in the experimental procedure 
in which the laws of chance, which are to be in 
exclusive control of our frequency distribution, 
have been explicitly introduced. The phrase 
“random order” itself, however, must be regarded 
as an incomplete instruction, standing as a kind 
of shorthand symbol for the full procedure of 
randomisation, by which the validity of the test 
of significance may be guaranteed against corrup- 
tion by the causes of disturbance which have not 
been eliminated. To demonstrate that, with 
satisfactory randomisation, its validity is, indeed, 
wholly unimpaired, let us imagine all causes of 
disturbance — the strength of the infusion, the 
quantity of milk, the temperature at which it 
is tasted, etc. — to be predetermined for each 
cup; then since these, on the null hypothesis, 
are the only causes influencing classification, we 
may say that the probabilities of each of the 70 
possible choices or classifications which the sub- 
ject can make are also predetermined. If, now, 
after the disturbing causes are fixed, we assign, 
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strictly at random, 4 out of the 8 cups to each of 
our experimental treatments, then every set of 4, 
whatever its probability of being so classified, 
will certainly have a probability of exactly 1 in 
70 of being the 4, for example, to which the milk 
is added first. However important the cause of 
disturbance may be, even if they were to make it 
certain that one particular set of 4 should receive 
this classification, the probability that the 4 so 
classified and the 4 which ought to have been 
so classified should be the same, must be rigor- 
ously in accordance with our test of significance. 

It is apparent, therefore, that the random 
choice of the objects to be treated in different 
ways would be a complete guarantee of the valid- 
ity of the test of significance, if these treatments 
were the last in time of the stages in the physical 
history of the objects which might affect their 
experimental reaction. The circumstance that the 
experimental treatments cannot always be ap- 
plied last, and may come relatively early in their 
history, causes no practical inconvenience; for 
subsequent causes of differentiation, if under the 
experimenter's control as, for example, the 
choice of different pipettes to be used with dif- 
ferent flasks, can either be predetermined before 
the treatments have been randomised, or, if this 
has not been done, can be randomised on their 
own account; and other causes of differentiation 
will be either (a) consequences of differences 
already randomised, or (b) natural consequences 
of the difference in treatment to be tested, of 
which on the null hypothesis there will be none, 
by definition, or (c) effects supervening by 
chance independently from the treatments ap- 
plied. Apart, therefore, from the avoidable error 
of the experimenter himself introducing with his 
test treatments, or subsequently, other differences 
in treatment, the effects of which the experiment 
is not intended to study, it may be said that the 
simple precaution of randomisation will suffice 
to guarantee the validity of the test of signifi- 
cance, by which the result of the experiment is 
to be judged. (Fisher, 1942, pp. 19-21) 


SYSTEMATIC ERRORS OF RATINGS 


In some experiments different observers may 
be used to record or rate the behavior of sub- 
jects. Assume that two observers are available 
and that one observer has been assigned to each 
of two experimental groups whose performance 
is to be rated. If one of the observers tends to 
overrate systematically the performance of sub- 
jects in the group which he is observing, the 
mean performance score of this group will be 
higher than it would be otherwise, merely be- 
couse of the rating tendency of the observer. 

A comparison of the two means might reveal 
that one is significantly greater than the other. 
The experimenter may then conclude that the 
difference is the result of the independent vari- 
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able, the experimental conditions, whereas the 
difference may be primarily a function of the 
observer and not the independent variable at 
all. Systematic errors of observation or rating 
of this kind will contribute to the variation in 
the means of the groups but not to the vari- 
ability. 

A suggested solution to the problem described 
might be to have both raters observe both 
groups and to pool their ratings, under the as- 
sumption that any systematic errors of rating 
would tend to operate in the same way for each 
of the two groups. If this assumption is valid, 
then the variance between groups would not be 


contributed to significantly by the errors of 
rating. 

It is worth noting that systematic errors of 
rating will not be detected simply by correlating 
ratings of the same subjects obtained from two 
raters. Systematic errors of the sort described 
will have no influence upon the correlation co- 
efficient. Two observers may show equal vari- 
ability in their ratings of the same subjects and 
may also show perfect agreement in their rat- 
ings as measured by the correlation coefficient. 
Yet one observer may systematically over- or 
underestimate the performance of the individ- 
ual members of the groups. 


REDUCING EXPERIMENTAL ERROR 


So far we have been discussing the necessity 
of controlling by experimental techniques or of 
randomizing systematic sources of error which 
might contribute to the variation between 
groups in addition to whatever variation might 
be contributed by the differences in experi- 
mental conditions. We shall now look at some 
factors contributing primarily to the variation 
within groups, i.e., to the measure of experi- 
mental error, and discuss the possible methods 
and techniques for reducing this source of vari- 
ation. 


RANDOM ERRORS OF MEASUREMENT 


Let a group of N subjects be divided at ran- 
dom into r groups of n subjects each. Let the 
subjects in each group be tested under different 
experimental conditions and assume that the 
experimental conditions operate systematically 
to contribute to the variation in the means of 
the r groups on the dependent variable. We 
shall assume that no systematic sources of error 
are present, but that each single observation on 
the dependent variable is subject to random 
errors of measurement. By a random error of 
measurement we shall mean an error which 
shows no systematic tendency to result in over- 
or underestimates of the single observations. 
Stated in another way, we may say that an error 
of measurement is random if the probability 
that a single observation is overestimated by a 
given amount is equal to the probability that 
the observation is underestimated by the same 
amount. 

In general, random errors of measurement 
will contribute to the variation within groups in 
such a way that the value of sẹ? obtained wiil 
be larger than it would be if no random errors 
of measurement were present. As shown in (8), 
Sw? is an estimate of c? and the test of signifi- 
cance of the variation in the experimental 


means will be made by dividing the estimate of 
(7) by that of (8) to obtain the F ratio. Thus 


DE c? + noz? 
DI 9 


An examination of (9) will show that if 2? is 
very large compared with noz? the value of F 
will not be greatly in excess of 1.0. If, for ex- 
ample, nog? remains constant and g? is in- 
creased, the value of F approaches 1.0 as a limit. 
On the other hand, if the second term in the 
numerator of F does not vanish, then when c? 
is decreased, the value of F will become very 
large. 

Reduction of experimental error as measured 
by sw? will thus enable the experimenter to de- 
tect more readily any systematic differences in 
the means of the experimental groups as meas- 
ured by noz?. Such reduction in experimental 
error can often be accomplished by increasing 
the reliability of the observations of the de- 
pendent variable and thus decreasing random 
errors of measurement. 


F= 


Sw 


INCREASING THE NUMBER OF REPLICATIONS 


| Let the difference between two means X,— 
X, be D, let s,? represent the variability of the 
individual observations about the group means 
and let n be the number of observations in each 
group. Then a test of significance of the dif- 
ference, X, —X,, can be made in terms of the t 
distribution, where 
AUI 
t= EE A (10) 
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Sw? is as defined by (6), and the usual assump- 
tions of normality of distribution and homo- 
geneity of variance are involved. 
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The value of t obtained by (10) may be com- 
pared with the table of t (Fisher and Yates, 
1949) to determine its probability of occurrence 
under the hypothesis tested. In general, the 
larger the value of t, the smaller the probability 
that it is the result of chance or random vari- 
ation. 

It is apparent that if D is small relative to 
5,2, it will require a larger n to obtain a speci- 
fied value of t than would be the case if D were 
large relative to s,,2. In other words, the smaller 
the difference between two means with a con- 
stant value of 5,2, the larger the number of ob- 
servations that will be required to regard the 
difference as significant. 

This same notion can be generalized to the 
case of more than two means. For this case, the 
test of significance may be made by (9). It is 
apparent in (9) also that if the true variation 
in a group of means, gei, is small relative to the 
variation of the individual measures, c?, then a 
larger value of n will be required to obtain a 
specified value of F than would be the case if 
c;? were large relative to c?. In other words, in- 
creasing n in (9) will result in a larger value of 
F, other things being equal. 

In books on experimental design (Fisher, 
1942; Cochran and Cox, 1950; Edwards, 1950b) , 
increasing the number of observations in a 
given experimental plan is referred to as in- 
creasing the number of replications. Replica- 
tion, in a sense, refers to the trials of an experi- 
ment. For example, if the experiment con- 
sisted of two experimental conditions with sep- 
arate groups under each condition, one pair of 
subjects would provide one replication or trial 
of the experiment. Each additional pair of sub- 
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jects would provide another replication of the 
experiment. With four experimental condi- 
tions and with a separate group of subjects 
tested under each condition, four subjects 
would provide one replication. Each additional 
set of four subjects would provide one addi- 
tional replication. 

From the discussion above it can readily be 
seen that increasing the number of replications 
of an experiment, for any specified true dif- 
ference or variation in a group of means, will 
result in a larger value of t or of F than would 
be obtained with fewer replications. Since, in 
general, the larger the value of t or F observed, 
the smaller the probability that it results from 
random or chance factors under the hypothesis 
tested, it can be said that increasing the num- 
ber of replications will increase the likelihood 
of detecting a given difference in the means of 
the experimental groups at a defined signifi- 
cance value. 


SELECTION OF SUBJECTS 


Since perhaps the largest contribution to Sw? 
in experimental work in psychology is the in- 
dividual variation of the subjects, a reduction 
in 5,2? can often be accomplished by the selec- 
tion of subjects who are more homogeneous 
with respect to individual differences. Such 
selection may be based upon variables which 
the experimenter has reason to believe are cor- 
related with the dependent variable. 

Assume, for example, that an organismic var- 
iable is positively related to the dependent 
variable, as shown in Fig. l. An examination 
of Fig. 1 shows that the subjects of high, aver- 
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Fic. 1. The theoretical variation on a dependent variable of un- 
selected and selected subjects on an organismic variable that is 
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age, or low values of the organismic variable will 
vary less among themselves on the dependent 
variable than would be the case for unselected 
subjects covering the entire range of the organ- 
ismic variable. 

The experimenter may select subjects whose 
measures on the organismic variable place them 
in the average section. These subjects, in turn, 
may be assigned at random to the experimental 
conditions. If the experimental conditions con- 
tribute to the variation in the means of the 
groups, then this contribution will be more 
easily detected than would be the case with un- 
selected subjects. This is because of the reduc- 
tion in sẹ? obtained by the selection of subjects 
on the organismic variable (it is assumed—and 
this will usually be the case—that the experi- 
mental conditions do not contribute signifi- 
cantly to the variation within groups). 

The reduction in error, as measured by Zei 
is achieved, in this instance, at some loss in the 
generality of the results of the experiment, as- 
suming that the observed means of the experi- 
mental groups differ significantly. It would 
not be possible, for example, to generalize the 
results to subjects with either high or low levels 
of the organismic variables. "They are not rep- 
resented in the experimental groups and the re- 
sults obtained may or may not hold for these 
subjects. In other words, without further re- 
search, the conclusions apply only to the level 


of the organismic variable selected and investi- 
gated in the experiment. 

Experimental designs in which levels of an 
organismic variable can be used in such a way 
as to reduce the error variance without loss in 
generality of the conclusions drawn will be dis- 
cussed in a later section. 


COVARIANCE TECHNIQUES 


Covariance techniques may also be used to re- 
duce experimental error as measured by s,?. 
Covariance designs require that the experi- 
menter have available measures on one or more 
organismic variables. If subjects are assigned 
at random to experimental groups and if the 
means are approximately equal with respect to 
the organismic variable for which measures are 
available, as they might be expected to be with 
random assignment, and if, at the same time, 
there exists a relationship between the measure 
of the organismic variable and the dependent 
variable, then the error variance will be reduced 
by taking this relationship into account. ‘The 
amount of reduction in the error variance will 
depend upon the strength of the relationship 
that exists between the organismic variable and 
the dependent variable. Covariance analysis 
is described by McNemar (1949), Lindquist 
(1940), Cochran and Cox (1950), Johnson 
(1949), Edwards (1950b), and Fisher (1942). 


EXPERIMENTAL TECHNIQUE 


Experimental technique refers to the manner 
and conditions under which the experiment it- 
self is carried out or conducted. Cochran and 
Cox (1950, p. 30) have pointed out that the 
main objectives of a good technique are as 
follows: 


i. To secure uniformity in the application of 
the treatments. 

ii. To exercise sufficient control over external 
influences so that every treatment produces its 
effects under comparable and desired conditions. 

iii. To devise suitable unbiased measures of 
the effects of treatments. 

iv. To prevent gross errors, from which no 
type of experimentation seems to be entirely free. 


It is important to recall that the experimenter 
anticipates that a specified experimental condi- 
tion will either tend to raise or to depress the 
measures obtained in a systematic fashion, so 
that the mean will be either raised or lowered, 
with no significant increase in variation of the 
individual measures. If an experimental con- 
dition tends to increase the performance of 
some subjects and to depress the performance 


of others, it is of importance to determine the 
associated organismic variable or variables 
which would enable the experimenter to pre- 
dict which subjects will show improvement and 
which will not. The organismic variable would 
thus become another independent variable in 
the experiment and involve a change in the ex- 
perimental design. 

Now by “uniformity in the application of the 
treatments” is meant that the experiment is 
conducted in such a way that any specified ex- 
perimental condition will be essentially the 
same, insofar as possible, for any specified group 
of subjects. It is assumed that when this is the 
case, errors will not operate in such a way as 
to increase the variation in the measures ob- 
tained under the experimental condition. 

Suppose, for example, that subjects are tested 
individually under the experimental conditions 
and that the instructions to the subject are 
given by the experimenter. A difference in 
emphasis, stress upon one aspect, or other dif- 
ferences which might occur from subject to 
subject should be avoided, otherwise they may 


EXPERIMENTS: 


tend to provide "sets" for some subjects which 
will facilitate performance and "sets" for other 
subjects which will depress performance. 

In psychological research, uniformity of the 
experimental conditions is of particular impor- 
tance because of the subtleties of the interac- 
tions between experimenter and subjects. Sub- 
jects may react to the mood, appearance, and 
behavior of the experimenter as well as to the 
experimental conditions in which the experi- 
menter is primarily interested. A friendly ap- 
proach upon the part of the experimenter with 
some subjects and a more hostile or aggressive 
approach to others may introduce departures 
from uniformity which will influence individual 
performance. 

Controlling "external influences so that every 
treatment produces its effects under comparable 
and desired conditions" is another aspect of a 
good technique. It is obvious that if external 
factors are not controlled with respect to the 
various experimental conditions, these factors 
may tend to have systematic influences upon the 
means. An experiment may be designed to test 
performance under conditions A, B, and C. The 
setup of the experiment may require that each 
test condition be assigned to a different experi- 
mental room. One of the rooms may differ 
from the other two in that it is poorly lighted 
and ventilated. Subjects tested in this room 
may show performance that is inferior to the 
performance of subjects tested in the other 
rooms. The difference observed in the means, 
however, may be the result of differences in the 
rooms rather than the result of differences in 
the experimental conditions which the research 
worker believes he is investigating. 

Methods of recording and quantifying ob- 
servations have already been discussed in an- 
other connection. The experimenter will need 
to know something about the reliability of the 
measures he obtains, Some research may re- 
quire that new methods of measurement be de- 
veloped—a research problem in and of itself. 
In addition, the changes in performance pro- 
duced by different experimental conditions may 
be of such a subtle nature that the measuring 
instrument used is not sufficiently sensitive to 
reflect these changes. 

Gross errors may result from a variety of 
causes. The experimenter may misread a dial 
or may miscalculate a score on a test. Careful 
observation will prevent many such gross errors, 
and examination of the records obtained from 
individual subjects may assist in the detection 
of gross errors. Measures obtained from single 
subjects which are out of line with those ob- 
tained from other comparable subjects may be 
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the result of errors of observation, errors of 
recording, or errors of calculation. 


Faulty technique may increase the real experi- 
mental errors in two ways. It may introduce 
additional fluctuations of a more or less random 
nature, Such fluctuations, if they are substantial, 
should reveal themselves in the estimate of error 
as calculated in the analysis of variance . . . 
Where his estimated standard errors are consis- 
tently higher than those of other workers with 
similar material, the experimenter is advised to 
seek the reason, which may lie in differences in 
technique. In addition, faulty technique may 
result in measurements that are consistently 
biased. The estimate of error does not take ac- 
count of such biases, since it is derived from 
comparisons of the measurements with one an- 
other; in other words, it estimates precision rather 
than accuracy. The principal safeguards against 
such biases are care and skill in the construction 
and handling of measuring devices, plus the in- 
telligent use of randomization. (Cochran and 
Cox, 1950, pp. 30-31) 


TESTS OF TECHNIQUE 


"Tests of technique can often be incorporated 
into an experimental design. This is accom- 
plished by dividing the subjects at random so 
that each experimental group consists of a num- 
ber of smaller random subgroups. Each set of 
subgroups will then constitute one trial of the 
complete experiment. A comparison of the 
results, that is, differences in the means, and of 
the error variances from the various trials of the 
experiment should provide some information 
concerning the technique of the experiment. If 
the separate error variances themselves show 
considerable variation beyond chance expec- 
tancy, the technique is open to question, since 
a good technique should not show significant 
variations in error upon repetition with com- 
parable groups of subjects. A test of signifi- 
cance of variation in error variances can be 
made by means of Bartlett's (1937) test for 
homogeneity of variance. 

Lack of consistency in the differences among 
the experimental means and in their ordering in 
the trials of the experiment would provide evi- 
dence with respect to the repeatability of the 
results obtained. Obviously, there is something 
wrong with an experimental technique in which 
comparable results are not obtained with com- 
parable groups of subjects. On the other hand, 
if comparable results were obtained in the vari- 
ous trials of the experiment this would not nec- 
'essarily reflect upon the adequacy of the tech- 
nique and the absence of systematic sources of 
error. The same uncontrolled factors may be 
operating in each of the various trials, thus 
producing a spurious kind of consistency. 
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FUNCTIONS OF A CONTROL GROUP 


Many experiments in social psychology have 
been concerned with the influence of motion 
pictures, pamphlets, lectures, and other materi- 
als upon social attitudes. A typical experiment 
consists of giving a group of subjects an attitude 
test, usually called a pretest, then presenting the 
materials designed to influence the attitude, and 
finally giving the subjects a second attitude test, 
usually called a posttest. Solomon (1949) has 
called attention to the fact that many of the 
attitude studies summarized by Murphy, Mur- 
phy, and Newcomb (1937, pp. 946-979) have 
used this procedure. 

The difference between the mean score on the 
pretest and on the posttest is found and tested 
for significance. If the means differ signifi- 
cantly, the experimenter concludes that the 
change in attitude is the result of the materials 
presented. This may or may not be the case. 

It might be argued, for example, that the 
mere act of taking the first attitude test is suffi- 
cient to start subjects re-examining their atti- 
udes and that this is the major factor producing 
the change. Any possible influence that the 
materials might have, in other words, is com- 
pletely confounded with the potential influence 
of taking the pretest. The experimental de- 
sign, in this instance, does not permit a clear 
interpretation of the results. 

A more adequate experimental design in- 
volves a control group. The term control group 
in psychological research refers to a group of 
subjects whose performance on a dependent 
variable is to be used as a basis for evaluating 
the performance of one or more experimental 
groups on the same dependent variable. For 
the problem described, a group of N subjects 
could be divided at random into two groups of 
r subjects each. One of these groups would be 
designated the control group and the other the 
experimental group. Both groups would be 
given the pretest of attitudes. The experimental 
group would then be presented with the mate- 
rials designed to influence their attitudes, but 
the control group would not. Both groups 
would then be retested. The experimental de- 

sign may be summarized, following Stouffer 
(1950) , as shown below: 


SEQUENCE EXPERIMENTAL CONTROL 
Group Group 
1. Pretest of attitude p X, 
2. Materials presented yes no 
3. Posttest of attitude Ze Bee 


Difference X; E d == D X. =X,’ =D 


The difference D between the mean of the ex- 
perimental group on the pretest X, and on the 
posttest X, is regarded as the change result- 
ing from the taking of the pretest and the ma- 
terials presented. If the amount of change 
shown by the control group is now subtracted 
from the change D shown by the experimental 
group, we have available a measure of the in- 
fluence of the materials alone. 


EXTENSION OF THE CONTROL GROUP DESIGN 


In his discussion of the use of control groups 
in psychological research, Solomon (1949) has 
expressed his conviction that the design de- 
scribed above is still inadequate for a clear in- 
terpretation of the results. It is his contention 
that the pretest may interact with the experi- 
mental condition. Taking the pretest, he points 
out, “may conceivably change the ‘set’ or atten- 
tional factors important to the effectiveness of 
training” (p. 141) or materials presented. Or 
the pretest may influence the manner in which 
the materials are perceived by the experimental 
group. 

To estimate the influence of possible interac- 
tion effects, Solomon proposes an extension of 
the control group design, consisting of the addi- 
tion of a second control group. This second 
control group would differ from the first in that 
it would not be given the pretest, but would 
be presented with the experimental materials. 
The experimental design would be as shown 
below: 


EXPERIMENTAL CoNTROL CONTROL 


Group Grour | Group 2 
1. Pretest of attitude X, X^ ? 
2. Materials presented Yes No Yes 
3. Posttest of attitude X, at Xy" 
Difference D D’ D" 


Since the second control group has not been 
given a pretest, the mean score for this group on 
the pretest must be estimated from the mean 
scores, X, and X,’, obtained from the experi- 
mental group and the first control group. The 
difference D” between this estimated mean and 
the mean on the posttest for the second control 
group is regarded as a measure of the influence 
of the experimental condition alone. The dif- 
ference D' between the means of the first con- 
trol group on the pre- and posttests is a measure 
of the influence of taking the pretest alone. The 
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difference D between the means of the experi- 
mental group is a measure of the influence of 
taking the pretest, the experimental condition, 
and any interaction effects that may exist be- 
tween the pretest and experimental conditions. 
An estimate of the interaction, designated by 
Solomon as J, is given by 


1=D— (D 4D’). 


There are many ways to interpret an obtained 
interaction. For example, if an interaction term 
is relatively large compared with the improve- 
ment of the experimental group, and if it is 
positive in sign, we might assume that the pre- 
test somehow facilitated the effective use of train- 
ing procedures, We then might start on a search 
for the kinds of attitudinal and perceptual 
changes that the pretest had produced, since such 
changes might have a generality of application 
for many kinds of educational procedures. In 
attitude testing, a negative J term would lead 
one to suspect that the pretest developed some 
kind of resistance to the propaganda material. 
This problem then could be approached by new 
experiments. It is possible that rules for building 
pretests which make subjects receptive to educa- 
tion might come out of such a program. In other 
words, the magnitude and direction of I ought to 
provide some hints to the experimenter. (Solo- 
mon, 1949, p. 143) 


Additional extensions of the control group 
design, involving three control groups, are also 
discussed by Solomon (1949). 

The use of control groups in psychological re- 
search will not always involve both a pretest and 
a posttest. A group of N subjects, for example, 
may be divided at random into two groups of r 
subjects each. One of these groups is designated 
as the experimental group and the other as the 
control group. Members of the experimental 
group experience the experimental condition 
(independent variable) and members of the 
control group do not. Both groups are then 
tested on the dependent variable. This design 
assumes that nothing occurs in the time interval 
between the division of the subjects at random 
into the two groups and the time of testing on 
the dependent variable that would differentially 
influence performance of the two groups on the 
dependent variable. Nothing, that is, except 
in the case of the experimental group, which 
is, in the interval, put through the experi- 
mental condition. 


Designs involving both a pre- and posttest of 
the experimental and control groups have been 
called by Stouffer (1950) and Hovland, Lums- 
daine, and Sheffield (1949) before-after designs. 
When the experimental procedure involves only 
the posttest, this has been called an after-only 
design. The relative merits of these two kinds 
of designs are discussed in some detail from the 
point of view of statistical and sampling consid- 
eration, the kinds of analyses of the data pos- 
sible, and the possible effects of the measuring 
process on the measurements observed, by Hov- 
land, Lumsdaine, and Sheffield (1949, pp. 308- 
$28). In addition, these authors provide many 
examples of applications of both types of de- 
sign in research on problems of mass communi- 
cation. 


NUMBER OF SUBJECTS TO BE ASSIGNED TO 
CONTROL AND EXPERIMENTAL GROUPS 


If subjects have been divided at random and 
we may assume that the variation within the 
two groups will be approximately the same on 
the dependent variable, then it can be shown 
that the most efficient division of the subjects 
is such that we have an equal number in each 
group. In general, "we may denote as an effi- 
cient design that distribution of the total popu- 
lation among the treatment groups which will 
yield maximally significant differences when the 
comparisons are made between each experi- 
mental group and the appropriate control 
group” (Kurtz, Jaspen, and Ash, 1951, p. 580) . 

If two experimental groups are used along 
with one control group, the most efficient divi- 
sion of the subjects is such that each experi- 
mental group has 29.29 percent of the available 
subjects, with 41.42 percent in the control 
group. Tables have been prepared by Kurtz, 
Jaspen, and Ash (1951) showing the most effi- 
cient division of the subjects for one to four 
contro] groups and from one to ten experi- 
mental groups. The tables are so constructed 
as to show the division of subjects when the 
assumption of homogeneity of variance is not 
tenable, as well as for the case where this as- 
sumption is tenable, These tables are designed 
for use when the comparisons to be made are 
between the mean of the control group and each 
of the respective means of the experimental 
groups. 


RANDOMIZED GROUPS DESIGN 


One of the most frequently used and simplest 
experimental designs is called the randomized 
groups design. In this design N subjects are 
divided at random into r groups of n subjects 


each. It is not necessary that the groups have 
equal ais, although the analysis of the data is 
somewhat simpler when this is the case. One 
of the groups may be a control group and the 
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other groups may be experimental groups. For 
each subject we have available a measure X on 
the dependent variable. The independent var- 
iable consists of the different experimental 
conditions. 

The randomized groups design is of value in 
problems where the differences among a set of 
experimental means are to be evaluated or 
where comparisons are to be made between a 
control group and each of the experimental 
groups. 

A randomized groups design, for example, 
might be used to investigate: 


. . . the influence of different periods of food 
deprivation upon learning; the influence of dif- 
ferent numbers of reinforcements upon condi- 
tioning and extinction; the influence of different 
methods of instruction upon achievement; the 
influence of different sets of verbal instructions 
upon problem solving; the influence of different 
sensory cues upon maze learning; the influence 
of different kinds of motivation upon perform- 
ance; the influence of different periods of rest 


RANDOMIZED 


In randomized blocks design subjects are di- 
vided into a number of blocks or groups, each 
of which, the experimenter has reason to believe, 
will be more homogeneous in performance on 
the dependent variable than unselected subjects. 

The term “randomized blocks” comes from 
agricultural experiments in which the experi- 
mental unit is a plot of land, and a block cor- 
responds to a section of land consisting of adja- 
cent or neighboring plots. There is reason to 
believe that plots of land which are near to one 
another are more alike with respect to fertility 
and general soil condition than plots which are 
more remote. In psychological research, the 
experimental unit is the subject instead of the 
plot, the relatively homogeneous group corre- 
sponds to a block, and individual differences 
correspond to fertility and general soil con- 
dition. Each block will, in general, consist of 7 
subjects, where r is the number of experimental 
conditions, With n such blocks available, the 
experiment will be replicated n times. 

On what basis does the experimenter make 
up the blocks or groups of subjects? This will 
depend a great deal upon the insight of the ex- 
perimenter and upon his knowledge of the de- 
pendent variable and related organismic vari- 
ables. If he knows, for example, that a particu- 
lar organismic variable correlates substantially 
with the dependent variable, the organismic 
variable can be made the basis of the subdivi- 
sion of the subjects into blocks. An examina- 


upon fatigue; the influence of different kinds of 
interpolated activities upon learning; the influ- 
ence of different kinds of work situations upon 
production and fatigue; the influence of different 
periods of practice upon learning. (Edwards, 
1950b, p. 185) 


The randomized groups design has the prop- 
erty of having the error variance based upon a 
larger number of degrees of freedom than any 
other design using the same number of sub- 
jects. This is likely to be of value in experi- 
ments with a small number of observations, 
where a possible reduction in the error variance 
by an alternative design would not be offset by 
a corresponding loss in the degrees of freedom 
for the error variance. At the same time, the 
error variance will include all of the variation 
attributable to individual differences. When 
subjects are fairly heterogeneous in performance 
on the dependent variable, the design may re- 
quire an excessive number of subjects in order 
to detect significant differences in the means of 
the experimental groups. 


BLocks DESIGN 


tion of Fig. 1 will show that if vertical slices are 
made through the correlation diagram, the sub- 
jects thus selected within each slice will be 
more alike with respect to the organismic vari- 
able and that they will also be more alike with 
respect to performance on the dependent vari- 
able. 

Assume that r experimental conditions are 
to be tested and that the experimenter has meas- 
ured N subjects, where N is some multiple of r, 
with respect to some organismic variable that 
he believes is positively related to performance 
on the dependent variable. 1f the subjects are 
arranged in order of magnitude of their values 
on the organismic variable, then the first 7 sub- 
jects will make up the first block or group, the 
next r subjects the second block or group, and 
so on, until n such blocks or groups have been 
selected. All that now remains is for the experi- 
menter to assign the subjects within each block 
at random to the various experimental condi- 
tions, with one subject in a block to each con- 
dition. 

Since, in psychological research, the subjects 
will be matched with respect to an organismic 
variable within each block, the randomized 
blocks design has also been called a matched 
groups design (Edwards, 1950b, pp. 264-283) . 
Each group or block of subjects may be consid- 
ered as representing a level of the organismic 
variable. If the matching has been successful, 
in the sense that the levels of organismic vari- 
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able used for matching the subjects are posi- 
tively and substantially correlated with per- 
formance of the subjects on the dependent vari- 
able under the experimental conditions, then, 
in general, the error variance for the matched 
groups design will be less than the correspond- 
ing error variance for the randomized groups 
design, The reason is that some of the variation 
included in the error variance of the random- 
ized groups design will now be attributable to 
the variation in the means of the replications 
or blocks. 

The randomized blocks design is also ap- 
plicable in the case of two groups, where one 
group serves as a control group and the other as 
an experimental group. 

In the matched groups design discussed, it 
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has been assumed that each of the r groups will 
be representative of a single common popula- 
tion described by the matching variable. That 
is, the means and variances within the several 
groups should be approximately equal with re- 
spect to the matching variable. The procedure 
of taking a block of homogeneous subjects and 
then assigning one member from the block to 
each of the experimental conditions at random, 
pretty much ensures that the groups will be 
comparable within the limits of random sam- 
pling errors. The independent variable in this 
design will be the differences in the experi- 
mental conditions to which the subjects are 
assigned and the dependent variable will be 
the performance measures of the subjects ob- 
tained under the experimental conditions. 


REGRESSION EFFECTS IN MATCHED GROUPS DESIGNS 


There are other cases of experimental designs 
using matched groups (Thorndike, 1942) which 
involve additional considerations. These con- 
siderations concern the phenomenon of re- 
gression. 

Assume that the distribution of measure- 
ments on an X variable is as shown in Fig. 2. We 
assume that this variable is positively related to 
another variable Y, but that the relationship is 
not perfect. If we select a group of subjects who 
are all one standard deviation above the mean 
on the X variable and another group of subjects 
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who are all one standard deviation below the 
mean on this variable, the difference between 
the means of these two groups, in standard devi- 
ation units, will be 2.0. If it is known that the 
correlation between the X variable and the Y 
variable is .80, then it can be predicted that if 
the two groups are tested on the Y variable, 
without any intervening experimental condi- 
tions, the difference between the means on this 
variable will be less than 2.0 standard deviation 
units or, more precisely, that the difference will 
be 1.6 standard deviation units. 


[^] zl 


Fic. 2. Normal distribution with mean equal to zero and stand- 


ard deviation equal to 1.0. 


This prediction is based upon the phenome- 
non of regression. The scores of both groups 
of subjects will regress toward the common pop- 
ulation mean, which, for scores in standard devi- 


ation units, is equal to 0. Thus the mean score 
of the group which was 1.0 standard deviation 
units above the mean on the X variable, may be 
predicted to be only .8 standard deviation units 
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above the mean on the Y variable. The mean 
score of the group which was 1.0 standard devia- 
tion units below the mean on the X variable 
may be predicted to be only .8 standard devia- 
tion units below the mean on the Y variable. 
The scores for each group will regress, move 
back toward, the population mean. 

The degree of regression that is present on the 
Y variable can be shown to be inversely related 
to the coefficient of correlation between the X 
and Y measures and directly related to the 
amount of deviation of the X measures from 
the X mean. 

Now, assume that the X variable is a variable 
upon which subjects are to be matched and that 
the Y variable is a dependent variable. It should 
be clear that the procedures described for 
matching in the case of a randomized blocks 
design will, in general, tend to assign subjects 
to the experimental groups in such a way that 
the' members of each group will show compar- 
able deviations from the X mean. For each 
subject who is a given distance above or below 
the mean on the X variable in any one of the 
groups, there will be a corresponding subject in 
each of the other experimental groups. Conse- 
quently, in this situation, regression should in- 
fluence each of the groups in much the same 
way. In the randomized blocks design the pri- 
mary purpose of matching of subjects has to do 
with achieving a reduction in the error variance 
of the dependent variable. 

We now consider a case, described by Thorn- 
dike (1942), in which the primary purpose of 
matching subjects on a variable is to eliminate 
this variable as a potential factor in the explan- 
ation of the results observed on a dependent 
variable. Assume, for example, that an experi- 
menter wished to determine whether subjects 
taking a course in Latin will show greater im- 
provement in English vocabulary than students 
who do not take such a course. It would not 
do just to compare the mean vocabulary test 
scores of a group taking a course in Latin and 
a group not taking this course, for it might 
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well be argued that the two groups differed ini- 
tially with respect to their vocabulary test 
scores as well as possible other variables related 
to academic performance. 

Assume that it is true that students who pro- 
pose or plan to take a course in Latin consti- 
tute one population with respect to vocabulary 
test scores and let the mean of this population 
be designated as my. Assume also that students 
who do not propose to take a course in Latin 
constitute another population with respect to 
vocabulary test scores and let the mean of this 
population be designated as m,. We shall as- 
sume that the distributions of vocabulary test 
scores for a large number of students in each 
population are as shown in Fig. 3. 

Obviously, if students taking a course in 
Latin are tested for vocabulary only at the con- 
clusion of the course and their mean is com- 
pared with the mean vocabulary test score of 
students not taking a course in Latin, we may 
expect the two means to differ on this variable. 
The course in Latin may contribute little or 
nothing to the observed mean difference, which 
may merely be the result of initial differences in 
vocabulary of the two groups. To control ini- 
tial differences in vocabulary, the experimenter 
decides to match two groups upon the basis of 
their initial vocabulary test scores. 

Examination of Fig. 3 will indicate that if 
individuals in the non-Latin group are to be 
matched with individuals in the Latin group, 
most of the successful matchings will occur with 
respect to those subjects in each group whose 
scores fall between m, and ms. In general, 
then, the subjects in the non-Latin group will 
be those subjects whose test scores are above the 
population mean m, for non-Latin students. 
And the subjects in the Latin group will be 
those subjects whose test scores are below the 
population mean my of Latin students. With 
successful matching of pairs of subjects, the 
two groups will have equal means on the first 
or initial test, and the groups might be said to 
be equivalent with respect to this variable. We 
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Fic. $. "Theoretical distributions of vocabulary test scores for students 
planning to take a course in Latin and for students not planning to take a 


course in Latin. 
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shall indicate this common initial mean as m. 

But consider now the influence of regression 
in the absence of any experimental condition. 
If the two groups are retested on another form 
of the vocabulary test Y, which is to be used as 
a measure of the dependent variable, what may 
we predict? Even without the introduction of 
any experimental condition, we may predict 
that the mean score for the Latin group on the 
second vocabulary test will be higher than that 
of the non-Latin group. The reason for this is 
that since the correlation between X and Y may 
be high, but not perfect, the Latin students will 
show regression in their test scores on the second 
test toward the mean of the population from 
which they were drawn, Le, mg. In other 
words, on the second test the mean should move 
away from m and toward m» for the Latin stu- 
dents. And for the non-Latin group, we may 
predict that the mean m will be displaced to- 
ward m, the mean of the population from 
which they were drawn. It is important to 
note that the regression of scores is, as Thorn- 
dike (1942, p. 94) points out, “toward the mean 
of the particular population to which they be- 
long." 

The differential influence of regression, in 
this instance, might account for the difference 
observed between the means of the two groups 
of subjects on the second test, and the Latin 
course itself may contribute little or nothing 
to this observed difference. The problem is to 
obtain two groups who may be expected to be 
equivalent on the second test, apart from any 
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difference attributable to the course in Latin, 
not two groups who are equivalent on the first 
test. Thorndike suggests the following solution: 


We should match members from our two 
groups in terms of predicted final test score. 
Again the regression equation for predicting final 
test score will be different for each population 
and each final test score must be predicted in 
terms of the regression equation for that popula- 
tion. The regression equation for each popula- 
tion must be the one which holds when the spe- 
cial experimental treatment is not applied, or 
else the effect of the experimental treatment will 
be absorbed into our regression equation. 
(Thorndike, 1942, p. 101) 


Using the basis for matching suggested above, 
the observed difference between the means of 
the two groups on the second test may be as- 
sumed to be independent of any differential 
regression influence. 

Comparable problems would arise if subjects 
high and subjects low on a measure of intelli- 
gence were matched with respect to another 
measure of ability, say verbal ability. If one 
of these two groups was then given some kind 
of experimental treatment and retested with a 
second test of the ability measure, we would 
undoubtedly find that the two means now dif- 
fered on the matching variable. This observed 
difference also would be an instance of the re- 
gression phenomenon, and not necessarily the 
result of any differential influence of the experi- 
mental condition upon the verbal ability of 
bright and dull subjects. 


Ex Posr Facto DESIGN 


Another kind of matched groups design which 
has had frequent use in social psychology has 
been described in detail by Chapin (1947). 
Chapin calls the design an “ex post facto" 
study. The design may be illustrated by Chris- 
tiansen's (1947) study of the economic adjust- 
ment of high school graduates and non-high 
school subjects. 

We may designate the high school graduates 
as the experimental group and the non-high 
school graduates as the control group. The fac- 
tor of interest was the economic adjustment 
(dependent variable) of these two groups as re- 
lated to educational status (independent vari- 
able). Since differences between the two groups 
on variables other than education might be ex- 
pected to relate to economic adjustment, an at- 
tempt was made to control these variables by 
matching the members of the experimental and 
control groups on as many of the variables as 


possible. 


When the two groups were matched by equat- 
ing means and variances on five factors, the ini- 
tial 1130 members of the high school groups 
were reduced to 200. Matching on a sixth fac- 
tor reduced the number of subjects from 200 
to 145. When individuals were paired in the 
matching, rather than simply striving for equal 
means and variances on each of the matching 
factors, the six factors resulted in only 23 suc- 
cessful matches. 


"The expectation, of course, in the ex post facto 
design is that factors which are equated for the 
two groups will not influence the comparison on 
the variable of investigative interest. That this 
may not be the case when a large number ot 
subjects are eliminated from the experimental 
group because successful matches cannot be made, 
is demonstrated by a study of Freedman and 
Hawley (1949). Freedman (1950) reports, for 
example, that in comparing educational differ- 
ences between matched groups of migrants and 
nonmigrants, a significant difference was observed 


282 


when 58 percent of the original group of mi- 
grants was matched with a control group of 
nonmigrants, but the difference was not signifi- 
cant when 89 percent of the original group was 
matched. In other words, a difference which may 
be present between subjects easily or readily 
matched may not be the same as the difference 
observed between subjects who are more difficult 
to match. (Edwards, 1951a, p. 336) 


In addition to the problem mentioned above, 
the ex post facto study design is subject to re- 
gression complexities described earlier. It 
seems likely, for example, that the high school 
subjects in Christiansen’s study constitute one 
population with respect to economic adjust- 
ment and the non-high school subjects consti- 
tute another population. 

In a study such as Christiansen’s, the research 
interest is in the relationship of the group clas- 
sification and a dependent variable, when sub- 
jects are truly equivalent on some matching 
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variable or background trait. No experimental 
conditions are involved or implied in the'com- 
parison of the groups on the dependent vari- 
able. Thorndike (1942, pp. 100-101) points 
out that in this instance: 


If the groups are to be equivalent in true score 
on the background trait, they must be matched 
on the basis of predicted true score — i.e., score 
predicted by the regression equation between 
original test on the background trait and a retest 
at the time of the experimental comparison. 
Since the regression equations for the different 
populations will not be the same in the case 
which we are now considering, the predicted true 
scores for each individual must be determined 
from the regression equation for his own popula- 
tion. Groups matched in this way will be truly 
equivalent upon the background trait, and dif- 
ferences between them in the experimental trait 
must be due to some factor other than back- 
ground trait differences. 


FACTORIAL DESIGN 


In some experiments, we may have two or 
more independent variables, each of which is 
varied in two or more ways, sometimes called 
levels. If we make all possible combinations of 
these variables and incorporate each combina- 
tion as a separate experimental condition, we 
have what has come to be known as a factorial 
design. 

Consider the case of two variables, designated 
by the letters 4 and B. Let A be varied in two 
ways which we may designate A, and Ay. For 
simplicity, we shall let B be varied in only two 
ways also, B, and By. In general, if we let the 
subscripts i and j stand for the number of ways 
in which each of the variables is varied, we shall 
have i X j — r experimental combinations, each 
combination making up one of the experimental 
conditions under which subjects are to be 
tested. For the example described, we have 
2 x 2 — 4 different combinations. These com- 
binations are shown below: 


A,B, 
EP 


A,B, 
A;By 


One replication of the experiment described 
will require r equals 4 subjects. If we have N 
subjects available, where N is some multiple of 
r, then the subjects may be divided at random 
into r groups of n subjects each. With 80 sub 
jects, for example, we would have 4 groups of 
20 subjects each, This factorial design would 
permit a test of significance for the main effects, 
ie, the differences between the A means and 


the differences between the B means, and also 
for the interaction that may be present between 
A and B. 

The concept of an interaction was introduced 
earlier in connection with control groups. To 
see more precisely what an interaction meas- 
ures in the present case, assume that the A vari- 
able represents the number of presentations of 
a transcription of a short lecture, with A, cor- 
responding to one presentation and A, to two 
presentations. Let the B variable represent 
whether the transcription is read by a male, B4, 
or a female, Bj. We shall assume that 80 male 
subjects have been divided at random into 4 
groups of 20 subjects each. One group is then 
assigned at random to each of the 4 experi- 
mental conditions. The dependent variable is 
a measure of learning determined by a test 
covering the material presented in the transcrip- 
tion. The mean scores on the dependent vari- 
able of the four experimental groups are as 
shown below: 

Group: A,B, A,B, 


A,B, A,B, 


Mean: 58.0 34.0 72.0 64.0 


The following questions may be raised: Are 
two presentations superior to one? Is the male 
speaker superior to the female? Is there any 
interaction between the number of presenta- 
tions and the sex of the speaker? The four 
means may be arranged in a 2 x 2 table as fol- 
lows: 


EXPERIMENTS; THEIR PLANNING AND EXECUTION 


B, B, 
Male Female Mean B,—B, 


A, Two presen- 


tations 720 64.0 68.0 8.0 
A, One presen- 
tation 58.0 340 46.0 24.0 
Mean 65.0 49.0 57.0 
4,-4, 140 30.0 


To determine whether the number of presen- 
tations is a significant variable, we would test 
for the difference between the means 68.0 and 
46.0. To determine whether the sex of the 
speaker is a significant variable, we would test 
for the difference between the means 65.0 and 
49.0. Now, if there is no interaction between 
the two variables, then we should expect the 
difference in the amount learned under a male 
and female speaker to be the same, within ran- 
dom errors, regardless of the number of presen- 
tations. In this instance, the difference between 
the means of the male and female speakers for 
one presentation, 24.0, and the difference be- 
tween the means of the male and female speak- 
ers for the two presentations, 8.0, should esti- 
mate the same quantity, i.e., the difference due 
to the sex of the speaker. The average of these 
two means would be (8.0 + 24.0) /2 = 16.0 
and represents the main effect of the B variable. 
It can be observed that this value is the same 
as the difference between the two means for the 
male and female speakers referred to earlier, 
i.e., 65.0 — 49.0 equals 16.0. 

On the basis of the same argument, assuming 
no interaction, the difference between one and 
two presentations for the male speaker, 14.0, 
and the difference between one and two presen- 
tations for the female speaker, 30.0, should esti- 
mate the same quantity, i.e., the difference at- 
tributable to the number of presentations. The 
average of these two values would be (14.0 + 
30.0) /2 — 22.0 and it can be observed that this 
is the difference between the means for one and 
two presentations referred to earlier, i.e., 68.0 — 
46.0 equals 22.0. 

It seems apparent in the present instance, 
however, that an interaction effect is present. 
The basis for this observation is the discrepancy 
between the values 8.0 and 24.0, which suppos- 
edly estimate the same quantity, i.e. the differ- 
ence attributable to the sex of the speaker. It 
seems obvious that the superiority of the male 
speaker is much less when two presentations 
of the transcript are used than when only one 
is used. The degree of superiority of the male 
speaker over the female speaker, in other words, 
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is influenced by the variable “number of presen- 
tations," and we call this an interaction effect. 
A measure of the interaction effect is given by 
the absolute difference between 8.0 and 24.0, 
which equals 16.0. 

Now examine the two values 30.0 and 14.0, 
which we previously assumed to be estimates of 
the same quantity, the influence of the variable 
“number of presentations.” It is apparent from 
the discrepancy between these two values that 
the superiority of two presentations over one 
presentation is much greater in the case of the 
female speaker than in the case of the male 
speaker. In other words, two presentations seem 
to have a much greater influence in improving 
learning over one presentation in the case of 
the female speaker, and to contribute much less 
in the case of the male speaker. Again, we call 
this an interaction effect between the two vari- 
ables, and this interaction is measured by the 
absolute difference between 14.0 and 30.0, 
which is equal to 16.0. This is the same inter- 
action that we discussed in the previous para- 
graph. We have simply gone at it from the 
point of view of the A variable rather than of 
the B variable. 

It is important to note that in the present 
experiment it is impossible to generalize be- 
yond the particular male and female speakers 
who made the transcript. It is not, in other 
words, possible to conclude that male speakers, 
in general, will be superior to female. No ran- 
dom sample of speakers of the two sexes has 
been incorporated into the design. Similarly, 
we cannot generalize beyond the one and two 
presentations tested in the experiment. And it 
is clear, if the interaction effect is significant, 
that any generalization concerning two presen- 
tations versus one will be qualified in terms of 
the interaction between number of presenta- 
tions and sex of speaker. It is equally true that 
the generalization concerning sex of the speaker 
will be qualified in terms of the interaction of 
this variable with the number of presentations. 

The advantage of the factorial design in re- 
search is that it enables the experimenter to in- 
vestigate several variables in all possible com- 
binations. If only the main effects or a portion 
of them turn out to be significant, the conclu- 
sions drawn will be established on a broader 
base than if the variables were studied one at 
a time in isolation from all other variables. At 
the same time, information is gained with re- 


‘spect to several variables rather than with only 


one variable and this is accomplished with an 
economy of observations. 

If interaction effects are significant, informa- 
tion is available about the nature of the inter- 
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actions of the variables, as is clear from the illus- 
trative example. For these reasons, the factorial 
design is apt to be of great value in the explora- 
tory stages of research, where little is known 


about the possible ways in which several inde- 
pendent variables may interact to produce 
changes in the dependent variable. 


LATIN SQUARE DESIGN 


If r experimental conditions are to be investi- 
gated, these can be arranged in the form of an 
r X r square in such a way that each experi- 
mental condition appears once and only once 
in each row and once and only once in each 
column. If we let the letters, 4, B, C, D, and E 
correspond to the different experimental condi- 
tions, then an example of a 5 X 5 Latin square 
is shown below: 


BEDGA 
CABED 
DBCGCAE 
ECADB 
ADEBC 


The assignment of the letters to the cells is 
at random, subject only to the restriction that 
each letter appears once and only once in each 
row and each column. 

In the Latin square design, as used in psycho- 
logical research, the rows usually correspond 
to subjects. Each subject, for example, is tested 
under all of the experimental conditions but in 
a different sequence or order. The columns of 
the square show which particular experimental 
condition is to be applied to a given subject at 
a given time or period. It will be noted that 
every row (subject) represents a complete rep- 
lication of the experiment and that every col- 
umn (trial or period) also represents a com- 
plete replication of the experiment. The pur- 
pose of this double grouping is to eliminate 
from the total variance the variation attribut- 
able to differences among the row means and 
differences among the column means The 
remaining variation can then be analyzed into 
two component parts: that attributable to dif- 
ferences among the means of the experimental 
conditions and a residual component called 
experimental error. 

For psychological problems, the Latin square 
design should be most useful when the number 
of experimental conditions ranges between 4 
and 9. With fewer than 4 experimental condi- 
tions, the reduction in the degrees of freedom 
available for the error variance may offset any 
gain resulting from the control of the row vari- 
ation and column variation. With more than 


9 experimental conditions the problems in- 
volved in repeated testings of the same subject 
may prove to be difficult. 

More complex designs involving the Latin 
square can be arranged. It is possible, for ex- 
ample, to replicate the same Latin square with 
an additional set of r subjects. The analysis of 
the separate Latin squares will then provide a 
test of technique as described earlier, as well as 
a test of the particular orders or sequences of 
the experimental conditions involved. It is 
also possible to provide replications of the com- 
plete experiment through the use of additional 
sets of r subjects for which the arrangement of 
the experimental conditions is determined by 
other randomized Latin squares which differ 
from the original square. Procedures for pool- 
ing and analyzing the observations obtained 
from these designs are described by Edwards 
(19502) and Grant (1948). Other variations 
in the Latin square design are discussed by 
Cochran and Cox (1950). 

Some disagreement has centered about the 
use of the Latin square design in psychological 
research. The tests of significance involve the 
assumption that interaction effects are not pres- 
ent, and McNemar (1951) has stated his belief 
that this assumption is not tenable. The pos- 
sible interactions are those involving subjects, 
trials or test periods, and experimental condi- 
tions. The Latin square design, however, has 
been extensively applied in biological research 
and in other fields and it is somewhat difficult 
to believe that the conditions of application in 
psychology are vastly different from those in 
biology. 

From the experimenter's point of view, the 
major question would seem to be this; If one 
particular experimental design is used and cer- 
tain conclusions are drawn concerning the dif- 
ferences observed among the experimental 
means, would these conclusions be changed 
upon repetition of the experiment using an- 
other experimental design? It is undoubtedly 
true that some differences will be observed with 
respect to the probabilities attached to the tests 
of significance, yet, at the same time, it seems 
unlikely that with a good experimental tech- 
nique the conclusions drawn from two different 
designs would be reversed. This is a matter, of 
course, that could be put to empirical test. 


EXPERIMENTS: THEIR PLANNING AND EXECUTION 
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BALANCED LATIN SQUARE DESIGN 


In some experiments in which a Latin square 
design is to be used, it may be desirable to ar- 
range the square so that the experimental con- 
ditions are balanced with respect to the imme- 
diately preceding and following conditions. 
Bugelski (1949) cites some cases where this 
may be so. In a balanced Latin square, each 
experimental condition is preceded and fol- 
lowed by every other experimental condition 
equally often. The following is an example of 
a 4 X 4 balanced Latin square. 


ABCD 
BDAC 
CADB 
DCBA 
Taking successive pairs of experimental con- 
ditions, we have: AB, BD, CA, DC, BC, DA, 
AD, CB, CD, AC, DB, and BA. Rearranging 
these pairs of experimental conditions gives 
AB AC AD BC BD CD 
BA CA DA CB DB DC 


and it may be noted that each condition pre- 
cedes and follows every other condition but 


once. This design would tend to minimize any 
systematic tendency for one condition to in- 
crease or depress performance upon a subse- 
quent condition. It is possible to construct a 
balanced Latin square of the sort described for 
any situation where the number of experi- 
mental conditions is an even number. 

When the number of experimental conditions 
is an odd number, a balanced design may be 
arranged with two Latin squares. For this bal- 
anced design, each condition will precede and 
follow every other condition twice. An ex- 
ample involving 5 experimental conditions is 
shown below: 


ABECD DCEBA 
BCADE EDACB 
CDBEA AEBDC 
DEGCAB BAGED 
EADBC CBDAE 


Methods of constructing balanced Latin 
square designs for both an even and an odd 
number of experimental conditions are de- 
scribed by Edwards (1951b). 


CONCLUSION 


This chapter has touched briefly upon the 
problems of experimental design: the planning 
and execution of experiments. The principle 
of randomization, the nature of replication, 
methods of reducing experimental error, ex- 
perimental technique, systematic errors, tests of 
significance, selection of subjects, and the gen- 
eral problem of the design and analysis of ex- 
periments are treated in greater detail in Fisher 
(1942), McNemar (1949), Snedecor (1946) , 
Edwards (1950b), Tippett (1941), Goulden 
(1939), Cochran and Cox (1950), Dixon and 
Massey (1951), Mather | (1947), Johnson 
(1949) , and Kempthorne (1952). 

Although general principles of experimental 
design can be discussed in books, books them- 
selves cannot provide the answers to, or even 
make one aware of, the specific problems of re- 
search that one encounters anew each time a 
research problem is undertaken. This ability 
to deal with specific problems is something that 
perhaps can only be learned by doing, i.e., 
through an intensive period of on-the-job train- 
ing, at first under the supervision of a skilled 


research worker, and then, later, through years 
of individual research. 

The beginning research worker in psychology 
can bring to his first research project a thorough 
knowledge of statistical analysis and the prin- 
ciples of experimental design. He should also 
possess a good background in the techniques 
and methods previously developed for the mak- 
ing and recording of observations. He also 
needs a sound basis in the factual material of 
his subject matter specialty, and some degree 
of theoretical sophistication is also an asset. 
And finally, he should come with a thorough 
knowledge of the organisms he plans to investi- 
gate, whether they be mentally deficient or nor- 
mal children, normal adults or abnormal pa- 
tients, white rats, or cocker spaniels. But all of 
this knowledge cannot be a substitute for actual 
experience in research, It is only by way of 

“preparation for research. 

That on-the-job training for research is de- 
sirable is recognized by most major departments 
of psychology in universities which specialize 
in the training of psychologists. One of the 
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primary purposes of the master’s thesis is to pro- 
vide a research problem that is ordinarily car- 
ried out under the supervision of a senior 
adviser. In those relatively few departments 
which do not require a master’s thesis, on-the- 
job training in research may be provided for 
through minor research projects which are un- 
dertaken as part of the traditional academic 
course work. 

After three to five years of graduate academic 
training, the future research psychologist usu- 
ally undertakes his first major individual re- 
search project, his doctoral dissertation. His 
first concern at this time is the selection of the 
problem to be investigated. Psychology is a 
relatively young science, and the problems for 
which we require answers are still more numer- 
ous than the problems for which answers are 
already available. Yet, the selection of a re- 
search problem is a task that has caused many 
graduate students a prolonged period of anxiety 
and concern. 

As has been pointed out elsewhere (Edwards, 
1946) an obvious and excellent source of re- 
search problems can be found in the theories 
advanced by various writers in a given field. In 
the psychology of learning, for example, the 
theories of Hull (1943) , Tolman (1932), Guth- 
rie (1952), Skinner (1938), and Miller and 
Dollard (1941), have been the starting point of 
many experiments. Sears (1943) has summar- 
ized many experiments which had their origins 
in psychoanalytic theory. The exposition of 
Gestalt theory by Koffka (1935) and the vari- 
ations of Gestalt theory proposed by Lewin 
(1935, 1936) have resulted in many important 
research projects. The theoretical writings of 
Allport (1937) and Murray (1938) in the field 
of personality are still other examples of a rich 
source of research problems, 


Students who have been indoctrinated with 
critical attitudes and who read challengingly the 
literature in their field of specialization will not 
fail to find innumerable projects demanding in- 
vestigation. Students who read with blind ac- 
ceptance of authority and the printed word, on 
the other hand, are apt to overlook assertions and 
statements made by writers which are not based 
upon research evidence. Such assertions and 
statements, when questioned, may very well be- 
come the basis of research problems for the criti- 
cal reader. The proverb “The grass looks greener 
on the other side of the fence” and the many 
variations of this proverb, when questioned, be- 
came the basis of a series of ingenious experi- 
ments for Wright (1937). And Irwin's (1943) 
questioning attitude applied to certain aspects 
of Wright's research led to additional experimen- 
tation. (Edwards, 1946, pp. 278-279) 


As has been emphasized in the introductory 
sections of this chapter and again in the above 
discussion, the posing of a question is the first 
step in research, A question, properly phrased, 
becomes a hypothesis which can then be sub- 
jected to empirical test. After a question has 
been formulated, the next step involves the 
planning of the research, This planning con- 
sists of determining what data will need to be 
collected, the manner in which they will be col- 
lected, and the methods by which they will be 
analyzed, Many hours go into the planning 
stage of an experiment, but these hours are well 
spent. It is at this time that consideration is 
given to the controls that will be necessary, the 
construction of the apparatus that may be 
needed for the experiment, the technique and 
method by which the observations are to be 
recorded, and the actual conduct of the experi- 
ment itself. 

Once the experimental procedure has been 
determined, there is another long period that 
the research worker spends in collecting his 
data. The research may require many hours in 
testing subjects and putting them through the 
experimental conditions. It is not unusual that 
this stage of the research may require anywhere 
from several months to a year to complete. 

After the data have been collected, there is 
still the problem of analyzing the data. Even 
with modern calculating machines, many hours 
may be required before the statistical analysis 
is complete. *With the completion of the sta- 
tistical analysis, the good research worker will 
still spend a good many additional hours on 
the interpretation of his results and in specula- 
tion about new problems which his results raise. 

At each of the various stages of a research 
project the research worker encounters problems 
that are unique to his particular research, prob- 
lems that require solutions and decisions be- 
fore he can move on to the next Stage. It is in 
the doing of research, in the finding of solutions, 
in the making of decisions, that one learns to 
appreciate the complexities and difficulties of 
research, as well as the satisfactions that come 
from having planned and executed a well- 
designed experiment. This is a difficult feeling 
to convey in words, but those who have experi- 
enced the feeling of accomplishment which 
comes from the completion of even a minor 
but successful research problem will recognize 
what is meant. 

Writing a good research report and making 
the results of the research available to other in- 
vestigators is itself an arduous task. The article 
must be in keeping with scientific brevity, yet it 
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should be sufficiently complete so that the reader 
will understand in detail the experimental 
procedure as well as the major findings. The 
article which eventually appears in one of the 
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psychological journals may take only ten or fif- 
teen minutes to read, but it is a tribute to the 
patience, persistence, and ingenuity of the 
research worker. 
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CHAPTER 8 


Selected Quantitative Techniques 


FREDERICK MOSTELLER AND Rosert R. Busu* 
Harvard University 


In the preceding chapter, Edwards has given 
a general introduction to the problem of the 
design and execution of experiments. That 
chapter included a general background of com- 
monly used statistical methods. In this chap- 
ter we follow with certain additional quantita- 
tive techniques that should be of value in the 
design of experiments or the analysis of their 
results. Three main considerations have guided 
us in choosing this set of techniques. First, we 
have for the most part avoided presenting again 
materials available in the several excellent ele- 
mentary texts on statistics widely available to 


"psychologists; indeed, we assume our readers 


have already digested one such text. We must 
emphasize therefore that the present chapter is 
not designed for the beginner but is an ad- 
vanced chapter for a person concerned with a 
particular research problem or with a general 
interest in the analysis of data. It is ironic that 
an advanced chapter contains some of the very 
easiest techniques to apply. Second, we have 
tried to present topics that students and col- 
leagues frequently encounter and inquire 
about, but for which it is not usually easy to 
provide a reference. Third, we have tried to 
deal with some problems which are not so often 
raised, but which we believe deserve much more 
attention than they ordinarily get. Wherever 
possible, we have tried to make the discussion 
self-contained, but it will be understood that 
some extensive tables and expositions could not 
be reproduced here. On the other hand, except 
for common tables of the normal, t, y?, F, 
squares and logarithms, and the not-so-common 
inverse sine tables, we have provided necessary 
tables or charts to accompany any method we 
recommend, 

The material in this chapter, as the title im- 
plies, represents a rather arbitrary selection of 
techniques. Therefore, the reader should not 
expect much continuity between the main sec- 
tions. But all the techniques have one com- 
mon bond: each has been useful in helping 
social psychologists solve some of their quanti- 
tative problems. 


Instead of a simple table of contents, we give 
the titles of the sections, together with a few 
remarks about the techniques discussed. 

1. Operating Characteristics or Power Curves 
for the Common Statistical Tests of Signifi- 
cance. How large a sample do I need? Can I 
detect a real difference between experimental 
conditions? The purpose of this section is to 
discuss such questions, insofar as they can be 
discussed in the light of considerations prelim- 
inary to the experiment. This section is more 
difficult to read than the remainder of the 
chapter. 


“We are indebted to the Laboratory of Social 
Relations for facilitating this work. Doris Entwisle 
assisted us in all but the final stages of preparing 
the manuscript, Cleo Youtz assisted in preparing the 
manuscript for publication, and Lotte Bailyn, 
Dwight Bartlett, and Thurlow Wilson assisted in 
preparing certain tables. We are also indebted to 
various persons and organizations for permission to 
reproduce tables or materials. Any errors are our 
responsibility, of course. The material on operating 
characteristics is largely drawn from Ferris, Grubbs, 
and Weaver (1946); permission for reproduction 
of the figures was given by these authors and the 
Institute of Mathematical Statistics. Tables for the 
short-cut method of allowances were made available 
by R. F. Link and D. L. Wallace, who produced 
them in connection with research sponsored by the 
Office of Naval Research. We also had advice from 
J. W. Tukey on the allowances section. L. E. Moses, 
the editors of the Psychological Bulletin, and the 
American Psychological Association gave permission 
to draw heavily on Moses (1952) for the section on 
nonparametric methods. Permission to reprint 
Dixon and Mood’s (1946, pp. 560-562) sign test 
tables (VI and VII) was given by those authors and 
the American Statistical Association. Table VIII 
for Wilcoxon’s (1945) signed rank test is reproduced 
by permission of its author and the Editor of Bio- 
metrics. Kruskal and Wallis and the American Sta- 


‘tistical Association gave permission to reproduce the 


significance levels for the ranked one-way analysis of 
variance (Table XIII). Tables XIV and XV are 
derived from more extensive tables by Swed and 


Eisenhart (1943). 
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2. Short-cut Allowances in the Analysis of 
Variance. When a number of groups of sub- 
jects are studied under differing experimental 
or field conditions, it is desirable to be able to 
recognize differences between specific pairs of 
group outcomes. The classical analysis of vari- 
ance does not do this. A very easy method for 
making such comparisons is provided. 

3. The Matching Problem. One extremely 
common form of experimental design in social 
psychology involves subjects or judges who try 
to place pictures, statements, protocols, or other 
similar stimuli into categories. These subjects 
are given scores equal to the number of times 
they place objects in the correct category. Or 
pairs of judges get scores equal to the number 
of objects they place in the same category—for 
example, in reliability studies. Although this 
technique is frequently used, students are usu- 
ally at a loss to compare the data with chance 
outcomes because the appropriate discussion 
does not usually appear in an elementary text, 

4. Nonparametric Methods. These tech- 
niques are designed to replace the usual statis- 
tical tests in order to avoid stringent assump- 
tions. Usually nonparametric methods are easy 
to apply, and they have an intuitive appeal for 
many research workers. 

4a. The Sign Test. When matched pairs of 
subjects are studied under two conditions, the 
better condition is the one scoring higher in 
most pairs. A table of sample sizes necessary to 
detect given differences is provided. 

4b. The Median Test. A simple method for 
comparing two samples. It is one replacement 
for the t-test in unmatched groups. 

4c. Wilcoxon's Matched Pairs Test Using 
Signed Ranks. Like the sign test and the me- 
dian test, this is a replacement for the t-test, and 
it seems to be the best nonparametric replace- 
ment known for matched pairs. 

4d. Mann-Whitney Test. Ranking methods 
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are used for testing differences betweeri two un- 
matched groups. 

4e. Whitney's Extension of the U-Statistic. 
A device for testing whether one of three groups 
performed best or, alternatively, a device for 
testing whether performance of the three groups 
was in a predicted order. 

4f. The Problem of m Rankings. Do judges 
generally agree about the ranking of a set of 
objects? A method is given for testing the con- 
cordance of several rankings. 

4g. The Complete Ranked One-Way Analysis 
of Variance. The usual analysis of variance is 
replaced by one using ranks. This is the natural 
generalization of the Mann-Whitney test to 
more than two groups of subjects. 

4h. Wald-Wolfowitz Run Test. Do high and 
low observations alternate too frequently? Or 
do certain kinds of observations seem to be 
bunched together in time, space, or size instead 
of being evenly spread? The run test is a 
method of studying over- or underclustering of 
sequences of observations. 

5. Quick Methods. Devices for using the 
range as a measure of variation in small samples 
for those who are weary of squaring numbers, 
but are willing to lose a little information in 
the analysis of data. 

6. Transformations. Data that arise from 
counting things (successes, acts, choices, etc.) 
sometimes contain extra information about va- 
riability that can be obtained by adjusting or 
transforming the original observations. If 
there are important uncontrolled variables in 
the study, analysis of transformed data may re- 
veal this fact. 

7. Combining Tests of Significance. On a 
given psychological problem some studies have 
shown significant results, while others have not. 
How can we combine all these results to make 
one grand summary? Pitfalls and pros and cons 
of three techniques are described, 


OPERATING CHARACTERISTICS OR POWER CURVES FOR 
THE Common STATISTICAL TESTS OF SIGNIFICANCE 


A question of general interest to an investiga- 
tor planning an experiment is whether his de- 
sign, including sample size, has a good chance 
of detecting differences among conditions, meth- 
ods, or ethnic, age, or other population groups. 
The primary value of a study of group differ- 
ences is lost if the investigation is so insensitive 
that important differences are obscured by sam: 
pling variation. We seldom enter an investiga- 
tion completely in the dark about the magni- 
tudes of differences we expect to observe (al- 
though there seems to be great cultural value 


in pretending no one has ever done anything 
like our experiment before). This general in- 
formation about magnitudes of effects and the 
variation ordinarily observed can be very use- 
ful in planning a study. Of course, one does 
not know exactly what to expect, but it is 
obvious, for example, that a person comparing 
groups on a 6-point rating scale is unlikely to 
find average group differences as large as 4 
points. Many paper and pencil tests have been 
standardized on known populations, so some 
notion of the means and standard deviations is 
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available for them. Similarly, various experi- 
mental devices have been used in many differ- 
ent situations and for them various kinds of 
baseline data are available to the investigator 
willing to take the trouble to look. These data 
can be used in a formal way in the design of an 
experiment to see what size of difference the 
experiment is likely to detect, or to help in 
revising the experiment if it is too insensitive 
to possible differences. To illustrate the use of 
such information it will be necessary for us to 
go into more detail on the matter of Type II 
errors than did Edwards at the close of his sec- 
tion on Tests of Significance in Chapter 7. The 
reader may wish to review that section before 
proceeding. 

If we state a null hypothesis, draw a sample, 
and make a statistical test such as the t-test or 
the F-test, we may make a mistake in our final 
decision. These mistakes or errors have been 
classified by Neyman and Pearson as: 

Type I error: rejecting the null hypothesis 
when it is true, 

Type II error: accepting the null hypothesis 
when it is false. 

The chosen level of significance, for ex- 
ample, 0.05, controls the probability of mak- 
ing a type I error, and this probability does 
not change. This is the error that has cus- 
tomarily received the lion's share of experi- 
menters' attention. The probability of making 
a type II error is not constant, rather it depends 
on the wrongness of the null hypothesis, the 
sample size, the significance level chosen, and 
the particular method of testing significance. 
Everyone agrees that it is important in experi- 
mental or survey design to give consideration to 
the type II error. The experimenter wants to 
be able to detect a difference when there is a 
fairly substantial difference — otherwise he 
would not be bothering to do experiments. In 
the social sciences attention to type II errors has 
been scanty, although people quite often ap- 
proach the problem obliquely by wondering 
how many observations they should take. One 
reason for this neglect of the type II error may 
be that materials for studying the type II error 
are not readily available. However, Ferris, 
Grubbs, and Weaver (1946) present curves 
showing the probability of making type II er- 
rors for some common statistical tests. "These 
curves are called "operating characteristics." 
The power of a test is the probability of not 
making a type II error. 

Strictly speaking, the operating characteristic for 
a test gives the probability of accepting the null 
hypothesis for each possible situation, and the power 
curve gives the probability of rejecting the null 
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hypothesis for each possible situation. When the 
definition is given in terms of type II errors a slight 
confusion can arise about the existence of a type II 
error when the null hypothesis is, in fact, true. 
Nevertheless, it seems worth while to emphasize the 
notion of type II errors in the text. 

The term "operating characteristic" is such a good 
one that it is used for more than one thing. In 
Chapter 9, the term is used to refer to the behavior 
of an item on a test. The similarity in idea is not 
as remote as it would seem at first glance. 


It follows that the power of a test and its oper- 
ating characteristic are two ways of giving the 
same information. There are cases, particu- 
larly for nonparametric tests, where it is not 
easy (in fact, it is often impossible) to write 
down the power of the test. The reason is that 
nonparametric tests offer the possibility of test- 
ing a great variety of hypotheses all at once. 
We shall see later that in order to know how 
strong or powerful a test is we need to know the 
specific distribution from which the sample is 
drawn, exactly what the null hypothesis is and, 
even more, exactly what the alternative hypoth- 
esis is. This section will discuss cases where we 
do know distributions and hypotheses explicitly. 

We begin by describing the operating char- 
acteristic for a binomial situation, where com- 
putation is easy. Suppose we have a factory 
making "fair" coins, coins that have a proba- 
bility of 14 of coming up heads each time they 
are spun. The inspection procedure for each 
coin consists in spinning the coin 5 times and 
observing the outcomes. The null hypothesis is 
that the coin has a probability 14 of coming up 
heads. 'The alternative hypothesis is that the 
probability is not 14. Coins with better than 
a 50-50 chance of coming up heads will produce 
more heads than fair coins. Our acceptance 
and rejection procedure is taken arbitrarily to 
be 

(1) accept the coin as "fair" if 1, 2, 3, or 4 

of the 5 coins fall heads, 

(2) reject the coin as “unfair” if 0 heads or 

5 heads appear. 
In the language of testing hypothesis, the null 
hypothesis Hy and the alternative hypothesis 
H, are 


Hy: p = 

Hy p 7# Ves 
where p is the probability of a head on a single 
spin. 


The significance level æ is the sum of the 
probabilities associated with rejection when the 
coin is, in fact, fair. Since the probability of 5 
heads is 345 — Déi . Déi - (34) - 04) - Déi, 
and that for 0 heads is similarly 345, the value 
of a — Lia — 0.0625. This is also the proba- 
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bility of a type I error. The probability of not 
making a type I error is 1 — o = 0.9375. 

When the actual value of p is different from 
0.50, the probability of rejection is p> + 
(1. — p) 5, the two terms corresponding to the 
probabilities of 5 heads or 5 tails in 5 throws. 
The significance level, of course, remains 0.0625. 
In fact, the formula just given represents the 
power of the test, for by letting p take on values 
different from 14 we can compute the probabili- 
ties of rejecting the null hypothesis for every 
value of p. For the operating characteristic, 
we have 


probability of accepting Hyp = 1 — p5 — 
(1— p) 5. 


A table of this probability of accepting Ho 
against p is the operating characteristic. This 
is given under Test ] in Table I. We see that 
when the coin has a bias of 0.1 in favor of heads 
or tails (ie, p — 0.6 or 0.4), the probability 
of accepting it as fair is still 0.91, scarcely differ- 
ent from the 0.9375 that goes with fair coins. 
We accept coins with p = 0.8 (these coins come 
up heads 4 times out of 5 on the average) about 
two-thirds of the time. How bad must coins be 
before we will be almost certain (say 95% sure) 
of rejecting them with our test? The coin has 
to be so defective that it throws heads (tails) 
99 times out of 100, What has the study of this 
operating characteristic told us? If we are the 
"fair coin" factory owner we would feel that 
such a test would scarcely be worth using be- 
cause we would not be able to give much of a 
guarantee about single coins. We have discov- 
ered that the particular test is rather ineffective 
for detecting differences from the null hypothe- 
sis p = Lë. How can it be improved? We could 
change the significance level. We don't like to 
increase it because that would increase the 
probability of type I errors as well as decrease 
the probability of type II errors. Increasing 
the significance level lowers the entire curve but 
we have to pay for this by scrapping more per- 
fect coins. We are already scrapping about 
7% of the perfect coins if there are any—why 
scrap more? We can't really find a more sensi- 
tive statistical test based on the same number of 
observations. (We could, of course, take di- 
mensional measurements—these would take us 
away from this introductory discussion—but we 
would again wish to study the operating char- 
acteristic of new test statistics and similar prob- 
lems would crop up.) The only other solution, 
if we wish to improve the inspection procedure, 
is to increase the number of trials per coin and 
adjust the rejection numbers. 


TABLE I 


OPERATING CHARACTERISTIC FOR BINOMIAL 
TESTS OF p = 0.5. 


In Test 1, the sample size is 5, and we reject 
if 0 or 5 successes appear. 

In Test 2, the sample size is 19, and we reject 
if 0-5 or 14-19 successes appear. 


Test 1 Test 2 
Significance level Significance level 
0.0625 0.0636 
P (accepting P (accepting 

True p p = 0.5) p = 0.5) 
1.0 0.0000 0.0000 
0.9 0.4095 0.0086 
0.8 0.6720 0.1631 
0.7 0.8295 0.5260 
0.6 0.9120 0.8340 
0.5 0.9375 0.9364 
0.4 0.9120 0.8340 
0.3 0.8295 0.5260 
0.2 0.6720 0.1631 
0.1 0.4095 0.0086 
0.0 0.0000 0.0000 


Before discussing this point, let us relate this 
same operating characteristic to a social science 
investigation, Instead of “fair coin” producers, 
we produce some social psychology hypotheses 
for test. Scientists have fertile minds and pro- 
duce hypotheses in quantity and, of course, 
there are many of them producing these ideas. 
Many of the hypotheses are of a tendential sort 
—groups of individuals of type A differ from 
those of type B on such and such a measure. 
Suppose for each such hypothesis we get 5 
matched pairs of individuals, and whenever all 
5 type A members score higher (lower) we re- 
ject the hypothesis of equality of the two kinds 
of individuals on the measure. The operating 
characteristic for Test 1 describes the perform- 
ance of this new test. The hypothesis that a 
difference exists corresponds to the alternative 
hypothesis in statistical language: p — 1⁄4. The 
null hypothesis is that no difference exists: 
p — VY. Now only half the hypotheses in which 
87% of the type A individuals in the popula- 
tion have larger values on the measure than 
their type B mates will be "accepted" or con- 
firmed on the basis of this test. The shape and 
properties of this particular operating curve 
turn statisticians prematurely gray when a small 
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pilot study is proposed to see if there is "any- 
thing in" a particular hypothesis. It may be a 
wonderful hypothesis, but we are not likely to 
get much support for it from this test. How- 
ever, a small pilot study may be very useful 
from the standpoint of planning and design. 
An intelligent examination of an operating 
characteristic before and during the design of 
an experiment can suggest not only the order 
of magnitude of the number of observations 
that may be required, but it can also suggest 
that some experiments will almost certainly 
yield equivocal results because of lack of facili- 
ties, or inadequate measuring instruments, or 
lack of time, and therefore that the experiment 
should either not be undertaken or its plan 
revised. 

What would it be worth in the social-science- 
hypothesis experiment to use a sample of about 
20? (We use 19 so that we can make the sig- 
nificance levels of this test and the test with 5 
observations almost identical.) The test now 
consists of rejecting the null hypothesis if 0, 1, 
2, 3, 4, 5, or 14, 15, 16, 17, 18, or 19 outcomes 
of the same kind are observed. The operating 
characteristic is that of Test 2in Table I. With 
n = 5 and p = 0.8, the probability of accepting 
the null hypothesis was about 0.67, but with 
n — 19, the probability of accepting the null 
hypothesis is only 0.16. How big a p do we 
need to be almost certain (0.95) to reject the 
null hypothesis? A p of about 0.85 will be re- 
jected this often. With n — 5, a p of 0.99 was 
required. So the increase from 5 observations 
to 19 observations does make a substantial dif- 
ference in ability to discriminate between hy- 
potheses, as Table I shows. 

What about a man who says no null hypothe- 
sis is ever correct? 1f we had 200,000 flips of a 
coin, for example, almost certainly we would dis- 
cover that it or the method of spinning was im- 
perfect. The point seems to us to go something 
like this. It is unlikely that any mathematical 
model for a situation is perfect, and therefore 
we agree that it is unlikely that any null hy- 
pothesis is really true. We are interested in 
detecting departures from the model, and there- 
fore from the null hypothesis, that are “substan- 
tial,” i.e., that are consequential as far as deci- 
sions are concerned. Reasonable size samples 
will not discriminate well between measures 
that are very close, say p 0.500 and p — 0.501. 
Such close results are not of great consequence 
in most behavioral situations. Consequently, 
from the investigator's viewpoint, there is really 
a "don't care" band in the neighborhood of the 
value of the null hypothesis, and as we depart 
further and further from the null hypothesis 
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value, the investigator takes a more interested 
attitude toward the difference. The operating 
characteristic behaves in an analogous manner. 
In the vicinity of the null hypothesis value, the 
null hypothesis is rarely rejected, and values 
nearby are not often rejected either. As the 
departure from the null hypothesis increases, 
an operating characteristic shows that we are 
more and more likely to detect the departure. 
We do not wish to confuse the issues of practical 
importance with "statistical significance" On 
the other hand, when an investigator chooses a 
test, a significance level, and a sample size, he 
has almost automatically implied that his atti- 
tude toward differences (operationally) is con- 
sistent with the shape of the operating charac- 
teristic. 

Suppose the investigator says: “I don't like 
making all these mistakes—I want to reject for 
sure if the true value is outside 0.45 to 0.55 and 
forget about the matter otherwise." This state- 
ment specifies the operating characteristic pre- 
cisely and, incidentally, the sample size is un- 
wittingly determined too. The sample size is 
infinite, so our investigator will not get enough 
data in his lifetime to settle the matter as far 
as he is concerned. Once this point is firmly 
established in an investigator's mind, an inter- 
est in operating characteristics is usually appar- 
ent. For further discussion, see Number of Sub- 
jects, in Chapter 7. 

For purposes of illustration we have discussed 
a binomial example. We can apply the same 
ideas to the usual statistical tests, and we shall 
explain how this is done, but we shall not derive 
the operating characteristics. Instead, we pro- 
vide charts for some commonly used tests, to- 
gether with illustrations of their use. In the 
operating characteristic figures the significance 
level chosen is 0.05. The appropriate tables: 
x2, F, t, and the normal, will be found in many 
elementary texts. 

We shall discuss two applications of opera- 
ing characteristics to chi-square tests. These 
chi-square tests are used to compare observed 
sampling variation with the variation expected 
from a standard population. Thus we want to 
test whether the scores of our sample are more 
homogeneous or more heterogeneous than those 
of a large known population tested on some 
other occasion. (There may be confusion when 
we call these “chi-square tests" because social 
scientists usually employ this name exclusively 
or contingency tables or for goodness-of-fit tests. 
"These latter tests are not being discussed.) 

Power function of the chi-square test. The 
present chi-square test is used to accept or reject 
the hypothesis that the true standard deviation 
e 
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v, of a normal population equals a specified 
value c. 


The hypotheses are 
Ho: 
Hi: 


kt —0, 
Gye Ao Ua 


Thus, we are making a one-sided test (for the 
distinction between one- and two-sided tests, 
see Tests of Significance, in Chapter 7) , because 
we are asking whether or not our standard devi- 
ation is larger than the standard (Ge, À larger 
than unity). Let us suppose that we have a 
set of observations drawn from the normal, and 
that their values are x1, X9,..., Xn where n is 
the sample size. The test statistic is 


(n — 1) 5? 
Lien eir Em 


n 
p d 
i=l 
ak NE 
(Hereafter if we drop the limits on the summa- 
tion, it is to be understood that summations run 
over all the observations of the sample for 
which they are appropriate.) 

The test consists of rejecting the null hypothe- 
sis if y? > y ¢» and accepting otherwise. Here 
xd is the a percentage point of the chi-square 
distribution as it is commonly tabulated, i.e., 
large chi-square values go with small values of 
«. The number of degrees of freedom is one 
less than the number of observations. 

Example of chi-square test. A standardized 
paper and pencil test is to be given to hospital 
patients. The social science hypothesis is that 
owing to the disease they suffer from, the pa- 
tients will have more variation in their scores 
than will the group who served to standardize 
the test. The standard deviation of the test 
for normal groups is ¢ — 10. The investigator 
guesses that the hospitalized group will have 
a standard deviation of about c, — 20. Thus 
A is conjectured to be about 20/10 — 3. If the 
value of 3 is this large, and if the investigator 
wants to be at least 90% sure of rejecting the 
null hypothesis that c, — 10, how big a sample 
does he need when using the 5% level of sig- 
nificance? On the horizontal axis of Fig. 1 go 
to à — 2.0, then read up until a curve is found 
crossing the ordinate at a probability of accept- 
ing of 0.1. A sample size of 10 meets the re- 
quirement nicely. Because of the facilities 
available, it will be easy to get 15 observations 
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and so the chance of rejecting is now up to 0.96, 
and even if ) is only 1.7 the chance is about 0.9 
of rejecting the null hypothesis with n — 15. 

Another use of the chi-square test. Fig. 2 is 
similar to Fig. 1. The same test statistic is used; 
the hypotheses are 


Ho: 
H;: (kel); 
and we reject if y2 < x2 — , accept otherwise. 


01 =C, 


oy = AS, 


The test is again one-sided; we are testing 
whether the standard deviation of the special 
normal population under study is smaller than 
the standard value c. 

Power of the F-test. Yn simplest terms, the 
F-test is a comparison of independent sample 
estimates of two normal population variances. 
The one-sided tests inquire whether the popu- 
lation variance from which sample | is drawn 
is larger than the population variance from 
which sample 2 is drawn, We no longer have 
a known standard c, as we did earlier. 

The hypotheses are 


Ho: 0j = 05, 
Hi: 91 = Aë, A> 1). 
The test statistic is 
5? 
$92 s 
where 
52 Zh Sai 2 X, — xa) ? 
1 n, —1 A. n; — 1 


If the observed value of F exceeds the tabled 
value Fy, we reject the null hypothesis of equal- 
ity of variances, otherwise we accept, Note that 
there is no question of putting the larger mean 
square over the smaller mean square. We are 
clear that ¢, is to be larger than, or the same as 
75; therefore we divide 5,2 by s3? to get F and 
enter anybody's F-table with n} — 1 and n5 — 1 
degrees of freedom respectively, Of course, if 
5,? is less than 552, there is no need to consult 
the F-table at all—we accept the null hypothesis. 

Example of the F-test. We have the hypothe- 
sis that men in a certain population are more 
variable in their attitudes on a certain issue 
than are women. We measure the variability 
by the sample variance of a battery of questions 
with scores roughly normally distributed. The 
test statistic F is the ratio of the estimate of 
variance for men to the estimate of variance for 
women on the basis of the observed sample. 1f 
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LO, 


LET O, AND Ki BE THE STANDARD DEVIATIONS 
OF THE TWO NORMAL UNIVERSES FROM WHICH 
SAMPLES OF D AND n, ITEMS ARE DRAWN. 
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the sample sizes for men and for women are 
both 10, how much larger does the standard 
deviation for men have to be for us to be 80% 
sure of rejecting (20% sure of accepting) the 
null hypothesis? Reading from the curves for 
n — 10 from Fig. 3 we see that it needs to be 
2.4 times as large. Suppose the standard devi- 
ation for men is twice that for women; how 
large a sample do we need to be sure of rejecting 
90% of the time? We read the answer as about 
20 in each group. To take another example, 
suppose we have a ratio of standard deviations 
of 2 (men to women) and we have the choice 
of a sample of 40 men and 20 women or a sam- 
ple of 30 men and 30 women—which is prefer- 
able? From Fig. 5 we find that with 40 men 
and 20 women we accept the null hypothesis 

(wrongly) about 595 of the time, with 30 men 
and 30 women about 4% of the time, so the 
equal split is more desirable. 

In principle, these curves can be used for 
the study of the power of single classification 
analysis of variance, with n} = number of 
groups, n9 — 1 — number of degrees of free- 
dom forerror. But the charts do not give suffi- 
ciently disparate numbers for n, and no to 
make them very useful for this purpose. See 
Ferris, et al. (1946, p. 188) for discussion. 

Power of the normal test, known standard de- 
viation, two-sided. 'The problem is to test on 
the basis.of a sample whether the mean of a 
normal population departs significantly from a 
standard. Often the standard is the value zero 
(when we are testing differences), or the mean 
of a census population, or a wellestablished 
standard published with an intelligence or other 
test based on a large number of cases Hom a 
specified population. 

The null hypothesis and its alternative are 


Ho: y —a, 
Hy: | y —a |, =n, (0). 


Here y. is the true population mean, a is the 
standard, c, is the known standard deviation of 
the normal population, and 3 is a measure of 
the departure of the alternative hypothesis from 
the null hypothesis. The vertical bars in H4 
mean absolute value (magnitude of the differ- 
ence without regard to sign) and this implies 
that the test is two-sided. We are just as inter- 
ested whether y. is smaller than a or bigger than 
a. The test function is the critical ratio 


vn (x —a) 


oy 


u = 


where n is the number of cases in the sample, 
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and x is the sample mean. If | u | > 1.96 we 
reject the null hypothesis at the 5% level. This 
is the classical critical ratio test, properly used 
because c, is known. Figure 6 gives the oper- 
ating characteristic for this test. 

Example of normal test, known standard de- 
viation, two-sided. For sixth grade school chil- 
dren the mean and the standard deviation of 
scores on a personality test are 100 and 10, 
respectively. How likely are we to detect a 
departure of 15 from this mean in a certain 
special class of schools if we have 50 children 
drawn at random from their sixth grade classes? 
Here A — 15/10 — 1.5. The curve for n = 50 
is kissing the zero line for accepting the null 
hypothesis, so we seem practically certain to 
reject. Indeed, practical certainty extends down 
to } — 0.7, or a departure of 7 points in true 
special population mean from grand mean of 
100. 


Actually, not all these curves are needed. 
They are all related in this way: if n^ and 17 are 
a sample size and a departure index we are 
interested in, then some other reference curve 
—that for n—can be used to determine the 
probability of rejection by using the new n with 
a new) — X y n'/n. For example, if n^ = 12, 
Al = | be the pair of interest, then we need not 
interpolate between curves, for we can use 
n = 3, à = 1l V 12/3 = 2 to get  — 2, and 
read 0.07 from the curve. This transformation 
device greatly increases the utility of the curves, 
provided that we always choose an n for which 
the curve is given. 

This trick is especially useful for the two- 
sample test, where we have samples drawn from 
two normal populations with unknown means 
vo and po and known standard deviations 94 
and o». The sample means are x, and x5, and 
sample sizes are n, and nọ. We want to test 
whether the population means are equal or not 
(two-sided test). The test function is 


DS X1 — X2 
0,2 052 
ni fi» 


Again the hypotheses are 
Ho: 
Hi: 


Wa = p2, 


| p — ue | =No; (Vv > 0). 


We take o/c; — k. Then we introduce the 
concept of effective sample size n’. Its formula 
is 

nus 


N’ -————* 
ken, + ng 
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We use n’ as if we had a single sample instead 
of two samples. Suppose we guess p, — po is 
about 10, and know 2; — 5, ën — 10; then 2’ 
= 2 and k — 2. If we have sample sizes n, — 
5, na — 20, then 


5x 20 Sc 


4x5 +20 


D 


Sadly enough, there is no curve for n' — 2.5, 
but we can use the transformation trick. If we 
take n — 1, then 


Lee UE Ty C KT 
n 1 


This value of 3 gives a reading for the curve 
n — 1 of about 0.11 as the probability of ac- 
cepting. 

Power of the t-test, two-sided, This is the 
classic test used to accept or reject the hypoth- 
sis that the true mean y. of a normal distribu- 
tion equals a standard a. 


The hypotheses are 


Ho: ya, 


Hy: |y —a| = s, DD. 


The test function is 


sth (%;— x) 2 
n—1 


Pu vin (x—a). 
D 


We reject the null hypothesis if t > tg, where 
iy is the two-sided percentage point of the or- 
dinary Student t-table (example: (9,95 = 2.086 
with 20 d.f., i.e., n = 21). 

Example of the t-test, two-sided. Edwards 
illustrates the t-test with a comparison of per- 
formance of matched pairs working under con- 
ditions of noise, where 4 members have the 
attitude that noise increases output, while B 
members have the contrary attitude (Edwards, 
1946, p. 173). Let us take his observed meas- 
ures of mean and errors as true and ask how 
large a sample of matched pairs we need to be 
90% sure of getting significance. He has 


sı = 0.63, | y—a | —1. 


Then à = 1/0.63 = 1.6, and from Fig. 7 we see 
that we need about 6 matched pairs. 

Power of the t-test, two-sided, for testing dif- 
ference between two sample means. We can 
also use Fig. 7 for the operating characteristic 


of the test of significance of differences between 
means of two independent groups when the 
true standard deviations are equal. The test 
function is 
M a TUE 
1 1 
512 y m 


ni Mg 


S12 = VEE 
i n; dna —2 


where t has ny + n; — 2 degrees of freedom. 
Here X, and x, are the sample means and the 
sums of squares are taken for each sample sep- 
arately. 


Our hypotheses are 


Ho: p = po 
Ay: dpa — u | =o, Q>9), 
where ¢, is the unknown value of the standard 
deviation common to populations | and 2; the 
ratio of the standard deviations is unity 
(k = D 

As before, we can take the effective sample 
size as 
flent U D 

ny + Ng 


but for entering the figure we use 


ae 
n 


where n — n; + n; — 1. Note that we no 
longer have an arbitrary choice for n. 
Example of two-sided t-test, for two sample 
means. If an attitude battery has standard de- 
viation c, roughly equal to 10, and the true 
difference between two ethnic group means is 
5, how likely are we to accept the null hypothe- 
sis of no difference with samples of 25 and 50? 


5 25 x 50 
(get e LD Pee ee 
Mer guit Bn Et 


but we do not enter Fig. 7 with these; we still 
need 


€ — 
MN os 16.7 "0.238, n 74, 
n 74 


and we read the probability of accepting the 
null hypothesis as abput 0.45. 


€ 
"T c 
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SHORT-CUT ALLOWANCES IN 


In classical analysis of variance, we use the 
significance test (F-test) to decide whether or 
not all group means come from the same popu- 
lation. At best this is a trivial kind of question. 
Once we decide the groups are not identical, we 
want to know whether some specific groups have 
scored higher than others, and about this the 
classical analysis is strangely silent. In this sec- 
tion we present an easy way of making such 
comparisons. For general background, see 
Analysis of Variance, in Chapter 7. 

The obvious way to compare groups in a 
one-way analysis of variance design is to make 
t-tests between pairs of groups, and this tech- 
nique has often been used in the psychological 
literature. The shortcoming is that we may be 
fooling ourselves about the confidence level 
being used. Suppose we have 10 groups, all 
from the same population, then on the average 
it turns out that the largest observed mean and 
the smallest observed mean fall 3.1 standard 
errors (Gin apart. A test of the difference 
of these extreme means gives quite a good 
chance of rejecting the equality of population 
means. If we use Fig. 6, we can regard each 
observation on a mean as one observation, and 
the standard deviations of these means can be 
regarded as 1. The effective sample size for the 
two means is 


mr wl Ege teen 


1 
1+1 
Choose n = 1, then } = 3.1 y 0.5/1 = 2.2, 
and we read that the probability of accepting 
Hp is about 0.4, which is a far cry from the 
desired value of 0.95. This difficulty has pre- 
vented some authors from using all possible 
t-tests, while others have gone ahead, no doubt 
rightly feeling that there was little else they 

could do. 

In the past few years several authors. (Dun- 
can, 1951, 1952; 'Tukey, 1949, 1951; Moshman, 
1952; Link and Wallace. unpublished: Scheffé, 
1953) have given attention to various rami- 
fications of this problem. In 1953, we are a 
long way from seeing this problem wrapped 
up, but in this section we give one easy ap- 
proach that may meet with favor, Following 
Tukey's lead, Link and Wallace developed 
some methods of setting allowances on differ- 
ences between group means. These allowances 
are similar to confidence limits. 


ONE-WAY CLASSIFICATION 


The method of Link and Wallace can be used 
on one-way analysis of variance, and two-way 


HANDBOOK OF SOCIAL PSYCHOLOGY 


THE ANALYSIS OF VARIANCE 


analysis with one observation per cell. It uses 
ranges as a measure of variation rather than 
sums of squares. (A range is the largest minus 
the smallest observation.) 


It seems not to be well known among social scien- 
tists that the range is quite a good measure of varia- 
tion as long as sample sizes are reasonably small. 
As sample sizes get large the statistical efficiency of 
the range does decrease relative to the sample stand- 
ard deviation. Nevertheless, it would be instructive 
to compare the accuracy of estimates of standard de- 
viations computed from ranges and from sample 
standard deviations by a representative group of 
scientists. The thought is that the mistakes in add- 
ing, squaring, subtracting, dividing, and square 
rooting required by the sample standard deviation 
may actually contribute more serious discrepancies 
in routine calculations than is generally realized. 
Estimation from ranges seems to require fewer opera- 
tions. Or do we all check our calculations so care- 
fully that this thought can be put aside as unworthy? 


More complicated designs require other ap- 
proaches which we shall not attempt to discuss 
here. We illustrate with an example from Ed- 
wards (1950, p. 298-299) on maze learning 
under different instructions. In Table II we 
present the data and computations useful in the 
analysis. Note that there are three groups of 
seven members each. 


"TABLE II 


ERRORS ON FIFTH TRIAL OF STYLUS MAZE FOR 
THREE GROUPS OF SEVEN SUBJECTS, EACH GROUP 
HAVING DIFFERENT INSTRUCTIONS 


Instructions 

(1) (2) (3) 
“Intelligence “Average "Few errors 
test" 20 errors" as possible" 

20 20 29 

22 17 22 

26 26 23 

20 23 21 

26 18 32 

26 21 34 

21 22 24 

Sum 161 147 185 

Range 6 9 18 


Sum of ranges 28 


We enter Table HI with number of groups 
(—3) and number in group (—7) and observe 
the critical factor is 1.17, Multiplying the sum 
of the ranges by 1.17, we get 32.8. The impor- 
tance of the number 32.8 is: 
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(i) If the totals of any two groups differ by 
more than 32.8 we "recognize" the ex- 
istence of a real difference between the 
pair of groups. 

'The true difference between any two 
group totals is "recognized" as lying 
within + 32.8 of that observed. 

If we "recognize" as described we shall 
make one or more false “recognitions” 
in approximately 5 experiments out of 
100. 


(ii) 


(iii) 


For our example we "recognize" that the “Aver- 
age 20 errors” instruction is different from the 
“Few errors as possible” instruction, but we 
do not "recognize" any other real differences. 
We can still set 95% allowances on the differ- 
ences as follows: 
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The last column is obtained from the next to 
last by dividing by the number in a group. 
These limits are like confidence limits, in that 
we "recognize" that the true differences lie be- 
tween these limits. At least one of these “recog- 
nitions" will be wrong in five experiments per 
100, but all "recognitions" will be correct in 
95 experiments per 100. 


TWO-WAY CLASSIFICATION 


Five judges rated affective value of 10 differ- 
ent colors on a nine-point rating scale (Guil- 
ford, 1936, p. 281). The data are presented in 
Table IV along with computations to assist in 
the allowance analysis. 

In Table IV row and column totals have been 
computed as usual. The numbers in parenthe- 
ses are the differences between adjacent col- 


Diff. of allow- Limits on Limits umns, the last of which is the difference between 

Comparison totals ance totals on means the E and A columns or, in general, between 
Instr. (1) — the last and first columns. The range of the 
Instr. (2) 14 +33 —19to47 —27to 6.7 differences is just the range of the numbers ap- 
Instr. (3) — pearing in parentheses in the same column. We 
Instr. (1) 24 +33 —9to57 —13to 8.1 could just as well have taken the differences 
Instr. (3) — the other way, between adjacent rows instead 
Instr. (2) 38 £33 Soit +0.7to10.1 of between adjacent columns, but if one does 

TABLE DI 


CRITICAL FACTORS FOR ALLOWANCES FOR ONE-WAY (BALANCED) DIVISION INTO GROUPS 


5% RISK 


Number of groups = number of ranges 


2 3 4 5 6 T5 E 


2 343 2.7 1.78 140 1.16 1.00 .87 .78 
Se 3 1.91 144 118 .94 .80 .70 62 Ap 
S 4 Le 125 101 .84 .72 68 57 A 
g 5 159 119 96 a 70 61 .55 .50 
g 6 150 118 95 .80 .69 61 55 49 
$ 7 149 117 95 80.69 61 .55 .50 
3 8 149 117 96 81 .70 .62 55 .50 
Z g 150 118 97 .82 .71 62 56 A 
lio 1.52 1.20 98 .839 .72 68 .57 .52 
Su 154 121 99 .84 78 64 58 52 
&12 1.56 1.28 1.00 .85 .74 65 .59 .53 
£13 158 125 102 .86 .75 66 .59 .54 
E 14  L60 126 1038 .87 .76 .67 .60 .55 
£15 1.62 128 105 .89 .77 .68 .61 .56 
Z16 164 130 106 .90 .78 .69 .62 .56 

17 166 131 1.08 a .79 20 A8 .57 

18 168 1.3 109 .92 .80 21 64 .58 

19 170 134 110 mm .81 72 .65 59 

20 172 186 111 .95 .82 78 .65 .59 


46 .42 .89 .97 85 A8 .81 .29 .28 27 25 
47 A8 40 37 35 A8 .91 80 .28 .27 26 
47 A44 Al 38 35 A4 82 20 .29 27 26 
48 .44 41 38 36 A4 28 A .29 .28 27 


54 .50 47 44 41 89 .97 


Entries are to be multiplied by the sum of the ranges within groups to obtain allowances for group totals. 


D 
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Taste IV both he should average the allowances obtained 
by the two approaches, rather than take the one 
with the smaller allowances. 
Judge We enter Table V with number of ranges = 
Color A B [e D E Totals number of judges (—5) and number per range 
= number of colors (—10) and find the two 
(8) 4(2) 6 (4) 2(-8) 5(-2) 24 entries 0.59, 0.49. 
(4) 4(-8) 7 (5) 2(-5 7 (-1) 28 The upper entry corresponds to the column 
(4) 4 (-4) 8(2 6 (1) 5(-8) 81 (judge) totals, while the lower entry corre- 
(2) 5 (-2) 7 4) 3 (-1) 4(-8) 36  spondstothe row (color) totals. We multiply 
2 (-5) 7 (4) 3 CH 4(-1) 21 the two numbers by the sum of the ranges and 
5 1 is 6 © 3 Ga éi e get 
) 8(-3) 6 (3) 3 (3) ) 2 
(0) 3(-9) 6() 5 () 4 () 21 nie 25 — 14.8 — allowance for column 
16 24 Ue a ges 3 P T Ze 1 d e 0.49 X 25 — 12.3 — allowance for row totals. 
tr Een FEN sj uec DEE — Justas in the one-way classification, we can 
Totals 55 57 68 87 50 247 make comparisons of column totals and of row 
Range of totals. We "recognize," for example, that Judge 
differences 6 4 4 6 5 C differs from B, D, and E, but not from A; and 
we “recognize” that Color 6 differs from colors 
Sum of ranges 25 2, 3, 9, and 10, but we “recognize” no other dif- 


RATINGS OF 10 COLORS Bv 5 JUDGES 


Q o - 0 Oo Nx 
C^ G6 A Dër si © D A 
= 
[7] 
= 


TABLE V 
CRITICAL FACTORS FOR ALLOWANCES FOR TWO-WAY (BALANCED) DESIGN 
Method—Successive Differences 
5% RISK 


Top entries lead to allowances for Variable A Totals 
Bottom entries lead to allowances for Variable B Totals 


Number of ranges = Number of Variable 4 totals 


Sens Ree eo G7). as go qo Ste SPI. 1718. 19 20 


6.86 2.17 1.53 1.17 96 .80 68 62 54 49 45 41 38 ae 4.81 80 28 .27 
6.36 1.95 1.40 1.12 .96 .84 .76 .70 .65 .61 .58 .55 52 .50 48 46 45 A4 .42 


1.95 1.17 .90 78 ee .54 48 .49 .88 .95 .92 .80 28 .26 .25 .28 .22 .21 .20 
2.7 1.17 .91 .77 .68 61 56 .52 .49 46 .44 .49 40 .38 An .36 .35 .34 .33 


7 145 mp .76 63 .54 47 42 .88 .84 .32 29 an 25 Sé 28 21 .20 .19 .18 
1.56 .94 .76 op .58 58 .49 45 A8 An .39 97 Ap .84 .38 .92 ai .30 .29 


5 127 .90 .72 60 .51 .45 .40 .86 .83 .30 .28 26 .25 .28 .92 .21 .20 .19 .18 
69 60 .53 .49 45 42 40 .38 .36 .34 .33 32 A1 .30 .29 .28 .27 


6 118 .87 .70 58 .50 A .39 ap .83 .30 .98 oe 24.29 .29 21 20 .19 .18 
114 .78 o .56 .50 .46 43 An .88 .96 A4 38 31 .80 .29 ap 27 .97 26 


7 Lë .85 .69 .58 .50 44 mm .36 .33 .30 .28 .26 24 93 22 1212590. .19 ..18 
L07 .74 .61 54 48 A A .88 .36 .34 .88 .31 .80 .29 28 27 .26 .26 325 


1.15 .85 .69 .58 .50 A 40 $6 .33 .30 .98 .96 25 .28 .22 a .20 .19 .18 
1.00 .71 .59 .52 47 .49 40 .87 Ap .33 .82 .30 29..28 .27 .26 26 25 24 


Ll4 .85 .69 .58 .50 .45 40 36 A8 .91 .28 .97 25 .24 .22 21 .20 .19 .18 
‘96 68 .57 .50 .45 42 39 ap .34 A8 .31 .80 29 .28 .27 26 ap 24 24 


1.15 .86 .70 .59 51 45 40 87 A4 A1 29 o 25 .24 .28 .21 20 .19 .19 
93  .66 56 49 44 41 38 .36 .34 .32 a .29 28 .27 .26 25 .25 24 23 


TO 


Number of Variable B totals 
oo 


io 
© 
Ze 
= 


Number of differences per range 
© e 


= 
e 


continued 
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ferences in color. We can also set limits on dif- 
ferences, as in the one-way analysis. All state- 
ments are made with a 5% error rate per group 
of comparisons, i.e., per experiment. Among all 
row (column) comparisons, we will be wrong 
at least once in five experiments per 100. 
Although we cannot devote more space to 
this problem here, we remind the reader that 


much work is being done, and that further re- 
sults should be made available to the psychologi- 
cal literature. Tables exist for 1% error rate, 
and methods exist for using classical analysis of 
variance computations. But the speed and 
simplicity of the present method go far to rec- 
ommend it. 


THE MATCHING PROBLEM 


The matching problem occurs frequently in 
psychological investigations, but its existence is 
not widely recognized among social psycholo- 
gists because it is not treated in many elemen- 
tary texts and because it arises in subtle and 
unexpected places. For example, an experi- 


menter presents pictures of faces and requests 
the subject to classify the pictures into their 
correct religious categories—Catholic, Jew, or 
Protestant. When the data are in, it is desired 
to test whether the number of correct responses 
hovers around that expected from guessing or 


Taste V (continued) 


CRITICAL FACTORS FOR ALLOWANCES FOR TWO-WAY (BALANCED) DESIGN 


Method—Successive Differences 


5% RISK 


Top entries lead to allowances for Variable A Totals 
Bottom entries lead to allowances for Variable B Totals 


Number of ranges = Number of Variable A totals 


2 3 4 Ne CAN Ce, sent 


10 11 19 31$ 14 15. 1G e 


LS ` SN 71 ..60 .52 6 41 97 


4 91 65 .55 A8 .49 40 A0 28 
Zu Ll6 .88 2 60 .52 46 A .38 
s 88 o A A .48 .89 .37 A8 
Sag 1 89 72 61 58 47 42 38 
3 86 .62 .58 47 42 .89 36 34 
$ j4 Lë .90 .73 .62 .54 48 48 9 
4 185 262, :52) 46) 42, 1.597.560) 54 
Z jg Lä 91 .74 68 55 48 48 .39 


Ce 
-- 
a 
or 
N 
»* i 
a 
A 
= 
Qo 
o 
bo 
a 
bo 
E 


1296  .92 $149 67 ;.58. 51 46. 42 
79 58 50 .44 .40 .97 34 .82 


Number of differences per range 
Ze 


34° Al .29 .27 26 24 .99 122 (2) 20S 
33.31.80. 29 28 27 .26 .25 24 24 28 


Ap .82 .80 .28 .26 ap 28 .22 .21 .20 .19 
A8 .81,.80 .28 .27 .26 :25 .95' 26 29 28 


35 .82 .80 .28 .26 .25 24 .22 .21 20 .19 
32 A1 .29 .28 27 .26 .95 24 24 23 .22 


486. 693) 23]) (29.27 BE 1240.28) EA 
A8 .30 .29 .28 27 .26 .25 .24 23 23 ZS 


36 .38 .91 29 27 .26 24 .29 .22 21 20 
32) 30 -29 .28° 27 26 251 24 1.23) 1251022 


37 A4 .81 .29 .28 .26 .25 29 22 .21 .20 
51 .80 .29 .27 .26 .25 .25 .24 23 25, 22 


.97 A4 .82 ap 28 .26 .25 .94 .28 22 .21 
$1 .90 .28 27 26 ap .94 .94 .23 22 22 


38 .85 .92 30 28 27 .25 24 .23 22 21 
A1 20 28 21 26 25 24 24 23 22 22 


38 Ap 38 31 29 27 26 .24 .23 22 21 
A1 20 28 .27 26 .25 .94 .28 23 .22 .22 


29 .36 .33 .31 .29 27 .26 .25 23 22 .21 
.31 28 .28 .27 .26 .25 


34 .93 28 22 221 


Entries are to be multiplied by the sum of ranges of differences between adjacent measurements to obtain 
allowances for Variable A totals (top entries) and Variable B totals (bottom entries) . 
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whether the subject does get cues about religious 
affiliation from physical characteristics shown 
in pictures. Further examples are the follow- 
ing: (i) Two judges try to do the same content 
analysis on a set of stories, classifying each story 
by its main theme. Before we go further with 
their training, we stop to see whether there is 
any agreement at all between them. The num- 
ber of stories they have called identically is the 
number of their matches. (ii) Two experimen- 
tally independent criteria are used to classify 
subjects (a group judgment on the basis of pro- 
tocols and a projective test not given to the 
group), and it is desired to measure agreement 
using the number of identical classifications. 
(iii) Perhaps the most classical use of the match- 
ing problem is in extrasensory perception ex- 
periments, where the subject tries to name the 
face of a card not seen by him. Again the num- 
ber of correct calls is a measure of the agree- 
ment. In all such procedures the preliminary 
question is whether the agreement is better than 
that expected from random guessing. If it is 
better, further questions of serious interest 
usually enter the discussion, such as exploration 
of factors that tend to increase the number of 
matches, We supply some results to assist in 
making the base-line computations (Battin, 
1942; Wilks, 1943) . 

To facilitate discussion of this problem we 
shall call the stimulus materials the target deck 
of cards, The categories will be called suits. 
"The number of separate stimuli in a particular 
category will be called the number of cards in 
the suit. The series of calls or sortings into 
categories by the subject will be called the 
matching deck. It too will have suits (usually 
the same as the target deck) and the numbers 
of cards in its suits will depend on the fre- 
quency with which each category is chosen. 
This implies that the subject is informed in ad- 
vance of the number and nature of the suits. 

The distribution of the number of correct 
calls in such experiments under a chance hy- 
pothesis can be obtained by counting as shown 
in the following example. 

Example: Six handwriting specimens are 
available, three from convicted criminals whose 
crimes are against persons, and three from con- 
victed criminals whose crimes are against prop- 
erty. A subject attempts to classify the hand- 
writing into the two categories—three specimens 
to each. What is the chance distribution of 
the number of correct classifications? The first 
step is to lay out the target deck, and beneath 
it to write out all possible arrangements of the 
matching deck. We shall let P stand for prop- 
erty, / for persons. (individuals) . 
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No. of correct 


Target PPPIII matches 
Arrangements 1. PPPIII 6 
ken bh sd aS A OE 4 
deck LEE DOE 4 

SEANCE q 
eee LE 4 
GP TPO Poy 4 
TAE PAESE E 4 
BOPITIPPI 2 
VERII POP 2 
IQ RIL PSP 2 
ILI PPP Ty 4 
EC E ATP I 4 
HOEP P I IP 4 
TEROP E P P I 2 
IERP L P LP 2 
PAR ek P P 2 
IP EI PIP PI 2 
ISIEIPPIP 2 
EE EE E 2 
20) DLE RPP 0 


If we take all matching arrangements as 
equally likely, then there is only one chance in 
20 of getting all six correct. The mean and 
variance of the chance distribution of the num- 
ber of correct matches can be obtained from the 
frequency distribution given below. 


No. of matches: x Frequency: f fx fx? 
6 1 6 36 
4 9 36 144 
2 9 18 36 
0 1 0 0 
Totals ER E oe 
mean = mi — 9 3, 


Us z = 
s= Noa = Vid = 1.54. 


Thus if 10 subjects average 4.0 correct matches 
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we can test the chance hypothesis with the nor- 
mal deviate 


Consulting normal tables, we find this beyond 
the 5% level whether we are using a one-sided 
or a two-sided test. Actually, we might use a 
correction term here because subjects can get 
only an even number of correct matches. To 
do this properly we set up z based on the total 
number of correct matches: 


| 40—30 | — 1 9 
z= cots 


ewe =2.12, 
Bet 4.24 


which is still significant at the 596 level, whether 
one- or two-sided. The correction | is used only 
when there are exactly two suits. Otherwise 
the correction is Lë. 


We use the correction for continuity liberally in 
this chapter, so a word of explanation may be in 
order. Discrete distributions have their probability 
concentrated at points, and in most problems we 
discuss these points are integers: 0, 1, 2, . . When 
we try to approximate such a discrete distribution 
by a continuous distribution such as the normal, the 
discrete density at a point such as 3 is usually 
thought of as being approximated by the area under 
the normal curve between 2.5 and 3.5. So if we 
want the probability of 3 or more heads in 4 tosses 
of a coin we would sum the probabilities of 3 and 
4 heads from the discrete distribution to get 5/16 = 
0.3125. The expected number is 4 X V4 = 2 and the 


standard deviation is \/ 4 X Ve X Y = |. So the 
normal approximation is given by the area to the 
right of (3— 15) — 2 = 14. This approximation is 
0.3085. Had we not used the correction, the approxi- 
mation would have been 0.1587, not nearly as close. 
In the present example the normal approximation 
is being made to a distribution that takes only even 
values, so the probability at 40 matches is repre- 
sented by the area of the normal from 39 to 41, so 
in this case the correction term is 1. It might 
be remarked that the correction is being made on 
the normal deviate to adjust for the discontinuity 
of the discrete distribution. Such a correction could 
just as well have been called a correction for dis- 
continuity. 


We will not usually have the exact distribu- 
tion of the number of matches at our disposal. 
(It is easy to see that computations of the exact 
distribution will become extremely laborious 
as the number of cards and suits gets large.) 
Consequently, we shall have to rely on approxi- 
mate normality, and compute means and stand- 
ard deviations for approximate t-tests. 


Identical target and matching decks, same 
number of cards in each suit. Suppose there are 
s suits, with c cards per suit in both the target 
and the matching deck. The total number of 
cards per deck is n — sc. Then the average and 
variance of the number of matches under 
chance arrangement of the matching deck are 


where p — 1 » q(q—1—fp 


neq (GZ) 


Here p is the probability that the first card is 
put in the correct suit. Because there are s 
suits of equal size, the chance is 1/s. This ele- 
gant way of writing the mean and variance in 
terms of n, p, and q was pointed out by C. Hor- 
ace Hamilton. It shows us that, properly inter- 
preted, the mean and variance are approxi- 
mately those of a binomial. 'The mean is exactly 
in binomial form, while the binomial variance 
npq is multiplied by a factor n/ (n — 1) which 
is very close to unity when n is large. 

If we apply these results to the handwriting 
example, we have 


m-—c-np, 


Mns Ga Ts, 
m (n — 1) 


c? 


s=2, c = 9, 


m c = 3, 


n 


gen nig (A) (9) (49) (6/5) 
BS: — 9/5 — 1.8 


o — V1.8 — 1.34. 


These results agree with our more detailed 
calculations. : 

Identical target and matching decks, differing 
numbers of cards per suit. By design we may 
not have the same number of stimulus materials 
in each category. Things become more com- 
plicated. Suppose there are s suits with c, cards 
in the ith suit, 


el a 
n—1 i=l 
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Example: The handwriting expert has three 
classes of criminals to classify. The numbers 
actually in the three classes are 5, 3, and 2. He 
is told the number in each class and asked to 
put these numbers into the classes. Then 


n—5--3-- 2 = 10, 
—_ +944 _ 88 


10 mos 
c2 — 16 [3.8 (3.8 + 10) — (2/10) (125 + 27 + 8) 
20.44 

— 9 AE 2.27. 

If the expert gets 6 correct our normal deviate 
is 

Za eg EN 1.18. 
V2.27 1.51 


The 0.5 is the continuity correction. A devia- 
tion of 1.13 is not significant at the 0.05 level 
(one-sided) . 

The actual table of data might have been 
shown in a 3 X 8 table, as we illustrate. 


3 X 3 TABLE FOR HANDWRITING EXAMPLE 


Target deck 
Categories 


The number of matches is the sum of the diag- 
onal cells: 3 -+ 1 + 2 — 6. 

Nonidentical target and matching decks, tar- 
get deck having suits of same size. Sometimes, 
unfortunately, the number of times a subject 
calls each suit cannot be controlled. In such 
situations the computations are still fairly sim- 
ple if each suit of the target deck has the same 
number of cards. 

Let s be the number of suits, and c the num- 
ber of cards per suit of the target deck. The 
matching deck (the calls) has c,, c5, .. ., c, cards 
in the suits, Zc, — n. Then 


m = C, 


a c[n? — Seil 


n (n — 1) 
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Example: We have three categories for hand- 
writing, with 5 specimens of each. The subject 
puts 7 in category 1, 5 in category 2, and 3 in 
category 3. Then 


Cy = 1, Co = 5, Cg = 3, n = 15. 


m=5, 
qa 5 (225 — [49 + 25 +9]) 
15 (14) 
e yw 
3 (14) 


Had the subject put the same number in each 
category, the mean would still be 5, but the 
variance would be 


225 (14) (3%) _ 50 
E AH 


instead of 3.38. In general, the use of unequal 
numbers of calls per category when the target 
deck has equal numbers will reduce the vari- 
ance. This means that in testing significance 
for the mean scores of several subjects the use 
of the equal-category variance will give a con- 
servative test in cases where the experimenter is 
unwilling to compute the variance for each sub- 
ject separately. 

Here the subject has probably called freely. 
We have tested his accuracy, given the particu- 
lar number of calls in each category he has 
made. The question is sometimes raised 
whether the subject should not get credit for 
accurately reproducing the correct number of 
calls in each category. It is difficult to conceive 
of a probability model that would accurately re- 
produce the physical situation. We cannot tell 
what suspicions the subject has about our cate- 
gory sizes, or from what kind of distribution he 
chooses his calls. It seems difficult therefore to 
do much about this aspect of the problem. 

Different decks. In special cases the matching 
problem will occur with target and matching 
decks of quite different compositions. Let the 
number of cards in each suit of the target deck 
be ti, to, . . , t, and, as before, the number of 
cards in the suits of the matching deck be cu. 
Cas» Cy Then n — Dc, — Xt. The mean 
number of correct matches is 


EN Dic 
n 


Pm p y (Ort elte 


4- n? Del, 
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Example: In an open-end pilot study of clas- 
sification of emotions from photographs, a stu- 
dent has 20 photographs: 5 expressing “anger,” 
7 expressing “joy,” 8 expressing “sorrow.” The 
first subject gives 3 responses of "joy," 4 “‘sor- 
row,” 6 "fear," 2 “tranquility,” 5 "I don't 
know." The total number of categories is ob- 
tained by counting the number of differently 
named categories that are used in one deck, the 
other, or both. There are 6. The set-up shown 
below for "Computations for nonidentical 
decks" will facilitate the calculations. 

Suppose the subject gets 5 correct matches, is 
he doing better than chance at approximately 
the 5% level, considering the categories he used? 
We should probably use the continuity correc- 
tion of Lé for his score, because he can get only 
an integral number of matches. Then 


PX (5 Ys) — 265 m 185 m 10477 
V/1.596 1.26 


Because the normal table gives 1.65 for 5% one- 
tailed test we would not reject the hypothesis of 
chance sorting on the basis of these data. This 
subject could have gotten at most seven correct 
matches. Although the sensitivity (or power) 
of the matching test has not been worked out, it 
is clear that a situation that requires a subject 
to get 6 out of 7 agreements for significance does 
not give the subject much chance to display his 
ability if he has only a little. This is why open- 
end responses in matching situations are usu- 
ally not desirable; they cut down the possibility 
of a significant result for a single subject by 
decreasing the number of possible matches. Fur- 
thermore, each subject (especially when the 
target suits have unequal sizes) requires a spe- 
cial computation for his mean and variance. 
Suppose we have several subjects in the experi- 
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ment just described with the following data: 


Subject — No. of correct matches m c? 
1 5 2.65 1.60 
2 4 2.00 1.45 
3 6 2.34 1.36 
4 7 6.51 1.00 
Totals 22 13.50 5.41 
The significance test would be 
im eee 
v5.41 2.33 
significant well beyond the 5% level. Note: 


(1) the 14 correction is made on the total num- 
ber of hits, (2) the hypothetical mean is the 
sum of the means, (3) the hypothetical vari- 
ance is the sum of the variances. 

From the discussion the reader will note that 
the application of the test of significance for 
matching is the same whether the experimenter 
informs his subjects of the number of suits and 
number within a suit, or whether he tells his 
subjects nothing about the target deck. It is 
usually preferable for the experimenter to give 
his subjects complete information about the 
composition of the target deck because, as can 
be seen from the examples, this gives greater 
chance for subjects to display their ability at 
matching if they have any. A possible excep- 
tion to this general rule is the case where an 
experimenter is conducting a pilot study, not 
only to get some idea about matching ability, 
but also some information about what "nat- 
ural" categories might exist for his particular 
matching problem. 


COMPUTATIONS FOR NONIDENTICAL DECKS 


Anger Joy Sorrow Fear Tranquility DK Totals 
Photographs t, 5 7 8 0 0 0 20 
Responses c, 0 3 4 6 2 5 20 
Se E dam CR ae o rs os 
ue 5 10 12 6 2 5 
tie, 0 21 32 0 0 53 
te, (ty + 6) 0 210 984 0 0 0 594 
— 53° 
-—À— = 2.65, 
dpt s 
! 2.2 2 - 12129 — 1.596. 
g2 == 719) [ (53) 20 (594) -++ 202 (53) ] “7,600 


1 © 
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NONPARAMETRIC METHODS 


The following discussion of nonparametric 
methods is drawn almost entirely from Moses 
(1952) , with the exception of the discussion of 
Whitney's extension of the U-test, the analysis 
of variance for ranks, the tables, and some opin- 
ions of our own. Moses states (pp. 123-124) 
the advantages and disadvantages of nonpara- 
metric methods as follows: 


Advantages of nonparametric methods: 

1. Whatever may be the form of the distribu- 
tion from which the sample has been drawn, a 
nonparametric test of a specified significance 
level actually has that significance level (provided 
that the sample has been drawn at random; in 
certain cases, as will be noted, it is also necessary 
to assume that the distribution is continuous). 

2. If samples are very small, e.g., six, there is 
in effect no alternative to a nonparametric test 
(unless the parent distribution really is known) . 

3. If the sample consists of observations from 
several different populations there may be a suit- 
able nonparametric treatment. 

4. The methods are usually easier to apply 
than the classical techniques. 

5. If the data are inherently of the nature of 
ranks, not measurements, they can be treated di- 
rectly by nonparametric methods without pre- 
cariously assuming some special form for the 
underlying distribution. 

6. In certain cases data can only be taken as 
“better” or “worse,” that is, an observation can 
only be characterized as a plus or minus. Obvi- 
ously, the classical tests are not directly applicable 
to such data. 


Disadvantages of nonparametric methods: 

1. If nonparametric tests rather than normal- 
theory tests are applied to normal data then they 
are wasteful of data. The degree of wastefulness 
is measured by the “efficiency” of the nonpara- 
metric test. If, for example, a test has 80 percent 
efficiency, this means that where the data are from 
a normal distribution, the appropriate classical 
test would be just as effective with a sample of 20 
percent smaller size. The efficiency thus expresses 
the relative merits of the nonparametric test and 
the classical test under the conditions where the 
normal test is correct, but does not tell us how 
the tests will compare on non-normal data. 

2. The nonparametric tests and tables of sig- 
nificance values are widely scattered in the peri- 
odical literature. 

3. For large samples some of the nonparametric 
methods require a great amount of labor, unless 
approximations are employed. 


It might help to translate disadvantage 1 
into the same terms we used to discuss the power 
of some of the common statistical tests. If we 
think of how likely we are to detect a true dif- 
ference when it exists, to say the nonparametric 
method is less powerful than the test based on 


normal theory is to say that we will be less 
likely to detect a true difference of a given size. 
On the other hand, this is a two-edged sword, 
for if observations are cheap and computation 
using normal theory is expensive, it may be 
efficient in the ordinary sense of the word rather 
than the statistical sense to use nonparametric 
methods anyway. 

The sign test. This test gets its name from 
the fact that we replace attribute or measure- 
ment data with plus and minus signs, and use 
the binomial distribution with p = q = 14 as 
the null hypothesis. Suppose we have Treat- 
ment A and Treatment B; we are interested in 
seeing whether these two treatments differ. 
Values observed under Treatment B are sub- 
tracted from values observed under 4, and then, 
depending on whether this difference is positive 
or negative, we write down a plus sign or a 
minus sign. If there are ties, throw away all 
tied pairs and work with the reduced sample 
size, [Although this is not the usual recom- 
mendation, it seems appropriate from an exam- 
ination of Tocher's handling of Fisher's Flower 
Problem (1950).] The assumptions underly- 
ing this test are (1) the variable under consid- 
eration has a continuous distribution, (2) that 
both members of a pair are treated similarly ex- 
cept for getting Treatment A or Treatment B. 
Notice that assumption (2) does not imply 
similar treatment of various pairs. The hypoth- 
esis being tested is that the population medians 
are equal. Under the null hypothesis, we ex- 
pect roughly equal numbers of plus and minus 
signs, and the probability level of a particular 
result can be assessed using the binomial dis- 
tribution with n — number of pairs, and p = d 
= Lë, There are tables of the binomial dis- 
tribution for n up to 49 (Tables, 1950) and, of 
course, even without the tables we can use the 
normal approximation to the binomial (with 
the correction) : 


ue Riesa [om | x — Yn | — 14 d 
vapa yy n 


where x is the observed number of plus (minus) 
signs, n is the number of pairs, and 1/ is the 
continuity correction. his test can easily be 
extended to cases where the question is slightly 
reformulated: Is A better than B by 5 points, 
or is A better than B by 10%? In order to do 
this, we merely add 5 points or 10% to the 
values of the B's before proceeding with taking 
differences. It is easy to see that we can manip- 
ulate the formula to suit a large variety of 
hypotheses. 
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To facilitate the use of the sign test we in- 
clude a table of critical values of x for the sig- 
nificance levels 1%, 5%, 10%, 25%. The tabu- then we merely look in Table VI to see whether 
lation is planned for use in a two-sided test. If the observed r is less than or equal to the tabu- 


r — smaller of x and n — x, 


Taste VI 
TABLE OF CRITICAL VALUES OF f FOR THE SIGN TEST 
Percent level of significance Percent level of significance 

n 1 5 10 25 n 1 5 10 25 

1 - - - - 51 15 18 19 20 
2 = - - - 52 16 18 19 21 
3 = — = 0 53 16 18 20 21 
4 - -— - 0 54 17 19 20 22 
5 - 0 0 55 17 19 20 22 
6 = 0 0 1 56 17 20 21 23 
7 = 0 0 1 57 18 20 21 23 
8 0 0 1 1 58 18 21 22 24 
9 0 1 1 2 59 19 21 22 24 
10 0 1 1 2 60 19 21 28 25 
11 0 1 2 H 61 20 22 23 25 
12 1 2 2 3 62 20 22 24 25 
13 1 2 3 3 63 20 23 24 26 
14 1 2 3 4 64 21 23 24 26 
15 2 3 3 4 65 21 24 25 27 
16 2 3 4 5 66 22 24 25 27 
17 2 4 4 5 67 22 25 26 28 
18 3 4 5 6 68 22 25 26 28 
19 3 4 5 6 69 23 25 27 29 
20 3 5 5 6 70 23 26 27 29 
21 4 5 6 7 71 24 26 28 30 
22 4 5 6 7 72 24 27 28 30 
23 4 6 7 8 78 25 27 28 31 
24 5 6 7 8 74 25 28 29 31 
25 5 7 7 9 75 25 28 29 32 
26 6 7 8 9 76 26 28 30 32 
27 6 7 8 10 77 26 29 30 32 
28 6 8 9 10 78 27 29 31 33 
29 7 8 9 10 79 27 30 31 33 
30 7 9 10 11 80 28 30 32 34 
31 7 9 10 11 8l 28 31 32 34 
32 8 9 10 12 82 28 31 38 35 
38 8 10 1 12 88 29 32 35 35 
34 9 10 1 13 84 29 32 33 36 
35 9 11 12 13 85 30 32 94 36 
36 9 11 12 14 86 30 33 34 37 
37 10 12 13 14 87 31 33 35 37 
38 10 12 18 14 88 31 34 35 38 
39 11 12 13 15 89 31 34 36 38 
40 ll 13 14 15 90 82 35 36 39 
41 11 13 14 16 91 32 35 37 39 
42 12 14 15 16 92 33 36 37 39 
43 12 14 15 17 98 33 36 38 40 
44 18 15 16 17 94 34 37 38 40 
45 18 15 16 18 95 34 37 38 41 
46 13 15 16 18 96 34 37 39 4l 
47 14 16 17 19 97 35 38 39 42 
48 14 16 17 19 98 35 38 40 42 
49 15 17 18 19 || 99 36 39 40 48 
50 15 17 18 20 100 36 39 4l 43 


For n > 100, approximate values of r may be found by taking the nearest integer less than Yan —kv/n, 
where k = 1.3, 1, .82, .58 for the 1, 5, 10, 25 percent values respectively. A closer approximation to the 


i i -1)- £ k, 1.2879, .9800, 8224, .5752. 
values of r is obtained from Léin — 1) — ky/n -F 1 and the more exact values 9 


D 
€ © 
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lated one. For example, with n — 23 observa- 
tions we observe 5 -+-’s and 18 —'s, Then r — 5. 
We find this number significant at the 5%, 10% 
and 25% levels, but not at the 1% level. Had 
the direction of the difference been predicted, 
we would have used a one-sided test and halved 
the percentages at the top of the columns. In 
one-sided tests we must be careful that the ob- 
served result was in the direction of the alterna- 
tive hypothesis. 

A table related to the power of the sign test 
is also helpful. Table VII shows the sample size 
needed in two-sided tests at various levels of 
significance to be very sure (95%) of rejecting 
the null hypothesis p — 15, when it is indeed 
false. For example, if the significance level is 
5% and the actual p is 0.40, we need a sample 
of about 327, and we reject if x or n — x is less 
than or equal to 145. Alternatively, suppose it 
is planned to use a sample of size 50, the true 
would have to deviate from 0.50 by 0.25 before 
we would be very sure of getting significance at 
the 5% level. Like the earlier curves of the 
power of the common tests of significance, 
"Table VII deserves more serious attention than 
it has ordinarily been given. It suggests that 
if deviations of p from 0.50 of the order of 0.05 
are being sought, we need samples of the order 
of 1000; for deviations of 0.10, we need samples 


Taste VII 


MINIMUM VALUES OF SAMPLE SIZE n AND CRITICAL 
VALUES f, NECESSARY TO FIND SIGNIFICANT 
DIFFERENCES 95% OF THE TIME FOR 
VARIOUS GIVEN PROPORTIONS 


n = Sample size r = Critical values 


fo "ra 5% 10% 25% “1% 5% 10% 25% 


45 (.55) 1,777 1,297 1,080 780 833 612 512 373 
-40 (.60) 442 327 267 193 193 145 119 87 
-35 (.65) 193 143 118 86 78 59 49 87 
-30 (.70) 106 79 67 47 39 30 26 19 


.25(.75) 66 49 42 82 2 
.20(.80) 44 35 28 21 18 11 9 7 
.5(85 82 29 18 14 8 
Mocsy Sh 37 Jg 41 53 
05(9) 15 12 HM 6 2 


The italicized values are approximate. The maxi- 
mum error is about 5 for the value of n, and 2 for 
the value of r. The values of n and r for 59% were 
taken from MacStewart (1941), who gives a table 
of values of n and r for a range of confidence co- 
efhcients (the above table uses only 959%) and a 
single value a = 5%. 
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of the order of 300; and it is not until huge 
deviations such as 0.35 occur that ordinarily 
used samples such as n — 25 become reasonably 
adequate. 

The median test. Sometimes we wish to 
phrase the question we ask as follows: Do two 
samples, n X's and m Y's, have the same median? 
"There do not have to be equal numbers of 
observations in the two sets. First, compute the 
median for the pooled samples. If the null 
hypothesis is true and the two sets of observa- 
tions do have the same median, we should find 
roughly half the X's and half the Y's above the 
combined median. 

To carry out the test, record all the observa- 
tions falling above the common median as --'s, 
and all falling below as —'s. When the total 
number of observations, n + m, is odd, one 
observation will fall at the median; this observa- 
tion should be thrown away. Then record the 
number of e and —'s in each sample in a 
2 x 2 table: 


Lis —'s 
X's T n—rt n 
Y's s m— s m 
r+s n+ m—r—s n+ m 


Make the usual chi-square test for independ- 
ence with one degree of freedom, using Yates’ 
correction. The question we are asking when 
we make this test is: Is the attribute of being 
an X associated with the attribute of being 
above the common median? 

Moses says (p. 126): “The assumptions un- 
derlying this test are that the X's and Y’s are 
random samples from their respective popula- 
tions, and that the population distributions are 
of the same form, differing only by a translation 
up or down the scale. Although the test is 
derived using the second assumption, Mood 
states (1950, p. 394) that the test ‘is sensitive 
primarily to differences in location and very 
little to differences in shape." The cutting 
point need not be the median of the combined 
sample but should be chosen before looking at 
the data. As a general rule, use of the median 
should give a more powerful test, so it might 
as well be used consistently. 

Wilcoxon's matched pairs test using signed 
ranks, This test uses the same information the 
sign test uses, and in addition takes some ac- 
count of the size of the observed differences. 
This means it is a more powerful test than the 
sign test (smaller probability of a type II 
error), because it uses more information. 

The first step is to take differences by pairs, 
and then rank these differences in order of ab- 
solute size (disregarding sign). The smallest 
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numerical difference is ranked as 1. Sum sepa- 
rately the ranks of the positive and of the nega- 
tive differences; use the smaller sum. Suppose 
we have 7 pairs, with the values shown in the 
example. 


WILCOXON SIGNED RANK EXAMPLE 


Ranks 
of same 
Rank sign 
Treat. Treat Differ- of abso- With 
Pair ment ment ence lutedif- lower 
A B A—B ference total 
1 65 51 14 6 
2 60 44 16 7 
3 71 64 7 4 
4 52 57 —b5 2 2 
5 62 49 13 5 
6 41 38 3 1 
" 58 52 6 3 —— 


Sum of ranks with 
the lower total 2 


If the sum of the ranks for either the negative 
differences or the positive differences is too 
small, the null hypothesis is rejected. As men- 
tioned above, we take the sum of the ranks of 
the observations with smaller rank total. 

We give tables of critical values of the sum of 
the ranks, T, for n from 7 to 16. For values of 
n larger than 16 we suggest the approximate 
normal deviate 


T—T+% 
op d 


n (n 4- 1) 


mm ————, 


4 


Z- 


where 


(Qn LIT 
Op? wem MM 


6 


and the 1/4 isa continuity correction. Note that 
when we reject the null hypothesis T will be 
less than T and therefore z will be negative. 
The data presented above, when tested using 
the sign test, show no difference between Treat- 
ments A and B. However, when signed ranks 
are used, the conclusion is that Treatment A is 
significantly different from Treatment B at the 
5% level (see Table VIII). This happens be- 
cause the smallness (—5) of the one negative 
difference adds its weight to the fact that only 
this one negative difference is observed. It is 
obvious from the way the test is carried out that 
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Tastre VIII 


WILCOXON SIGNED RANK TEST 


Given matched pairs of observations. Rank dif- 
ferences in order of increasing absolute magnitude. 
Compute the sum of ranks for positive differences 
and for negative differences. Let n equal the num- 
ber of pairs, T be the smaller sum of ranks, and P 
be the probability of obtaining a given T or less 
by chance. 


n T P 
7 0 0.016 
7 2 0.047 
8 0 0.0078 
8 2 0.024 
8 4 0.055 
9 2 0.0092 
9 3 0.019 
9 6 0.054 
10 3 0.0098 
10 5 0.019 
10 8 0.049 
1 5 0.0093 
11 7 0.018 
11 11 0.053 
12 T7 0.0093 
12 10 0.021 
12 14 0.054 
13 10 0.0105 
13 18 0.021 
13 17 0.050 
14 13 0.0107 
14 16 0.021 
14 21 0.054 
15 16 0.0103 
15 19 0.019 
15 25 0.054 
16 19 0.0094 
16 23 0.020 
16 29 0.053 


one negative difference with rank 6 will count 
just as heavily against the hypothesis as three 
negative differences with ranks ], 2, and 3. 
Readers interested in setting confidence limits 
using signed ranks should see Moses (1952) . 
Mann-Whitney test. The parallel nonpara- 
metric test to use with ranks when two samples 
are not composed of matched pairs, and may be 
of unequal size, is the Mann-Whitney test. 
[Wilcoxon (1945) seems to have invented this 
test too, as Mann and Whitney note, but the 
*latter authors made a rather extensive investi- 
gation of its properties. Because of its elegant 
simplicity it is understandable that it has been 
invented independently quite often.] The null 
hypothesis is that the two samples have been 
" 
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drawn from a common population. If there are 
two populations 4 and B the test is sensitive to 
differences in the population that make it more 
likely that an observation drawn from 4 will 
be larger than one from B. An easy example 
is that of two populations identical in shape 
except that their means differ. 

We suppose there are X's and Y's, m of one 
kind and n of the other. The first step is to 
arrange the (m -+ n) observations in order 
from least to greatest, and then to give the ob- 
servations their ranks. For example, suppose 
we have samples of X's and Y's as follows: 


X: 1, 8, 17, 22, 45 
Ranks: 1, 5, 9, 10, 11 

Fe AG 7 9; 10/12 
Ranks: 2, 3, 4, 6, 7, 8 


Sum of ranks of smaller sample 
T = 36 


Having ranked the observations, we sum the 
ranks in the sample of smaller size (if the sizes 
are equal it does not matter which is chosen for 
a two-sided test). We call this sum T, and in 
our example its value is 36. The size of the 
smaller sample is called m, and the size of the 
larger n. In our example m = 5, n = 6. To 


test significance we enter Table IX with these 
values of m and n. Because 36 does not fall on 
or outside the limits 18 and 42 given there, we 
will not reject at the 5% level the hypothesis 
that the populations are identical. Table IX 
was derived from Mann and Whitney's tables 
(1947) of the distribution of their U-statistic. 
The statistic T seems slightly easier to use be- 
cause it involves less computation than U. For 
onesided tests, where the directionality of the 
difference is suggested in advance of the data, 
the significance levels in Table IX can be 
halved. 

With pairs of samples that fall outside Table 
IX, provided m is greater than four, we should 
obtain a close approximation to the correct 
significance level by using as a normal deviate 


lE Tg 
gp 4 
where 


T--- (m 4-n4- 1), 


edu. nn En 1) 
12 


It is clear that T has the form given because 
the average rank in the whole set is (m -+ n + 


TABLE IX 


MANN-WHITNEY TEST 


Critical values of T, the sum of the ranks for the smaller sample. The smaller sample has size m, and the 
larger one size n. The hypothesis of equality of distributions is rejected if the observed T falls at either 
limit, or outside the range of the pair of numbers shown in each cell. The upper pair gives critical lim- 


its at the 5% 


level (2-sided) , and the lower pair in parentheses gives the critical limits at the 10% level 


(2-sided) . 
n\m 2 3 4 
d = (6.15) 
^ - - 10,26 
- (6,18) (11,25) 
i - 6,21 11,29 
: (5,13) (7,20) (12,28) 
7 - 723 12,82 
(3,15) (8,22) (13,81) 
d S 7,26 13,35 
4 (3,17) (8.25) (14,94) 
" 3,19 8,28 14,38 
(4,18) (9,27) (15,37) 


17,38 
(19,36) 
18,42 26,52 
(20,40) (28,50) 
20,45 27,57 36,69 
(21,44) (29,55) (39,66) 
21,49 29,61 38,74 49,87 
(25,47) (31,59) (41,71) (51,85) 
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1) /2, and there are m things ranked. The vari- 
ance of T is also obvious from the formula for 
the variance of sums of observations drawn 
from a finite population, namely, mo? (N — 
m) /(N — 1), where N is the population size, 
m is the sample size, and c? is the variance of 
the original finite population. Here N = m + 
n, m is our smaller sample size, c? is the vari- 
ance of the first m + n integers, which is well 
known to be [(m + 7)? — 1]/12. Making 
these substitutions gives c7? immediately. 

Whitney's extension of the U-statistic. When 
there are three samples rather than two, we may 
wish a nonparametric test to decide whether all 
three came from the same population, or 
whether to prefer an alternative such as 

(i) a particular population, specified in ad- 
vance, tends to yield smaller observations than 
either of the others; or 

(ii) a particular order for the populations, 
specified in advance, exists such that if one 
observation is drawn from each, and any par- 
ticular number is chosen, the observation from 
the population A is more likely to be smaller 
than the chosen number than the observation 
from population B, and the observation from 
population B is more likely to be smaller than 
the chosen number than the observation from 
population C. 

If the frequency distributions A, B, and C 
do not overlap and their means are 10, 50, 120, 
respectively, then they are clearly in the order 
A, B, C. When the frequency distributions do 
overlap it becomes necessary to introduce the 
words "stochastically smaller." Perhaps a nu- 
merical example will make this idea clearer. 
We shall work with test scores on four groups, 
A, B, C, D. The main point concerns the cumu- 
lative frequency distributions, and that is what 
we give below. 


Cumulative frequency distributions of four groups 
of test scores, Entry is percent of group scoring at 
or below the number in the "test score" column. 


Groups 

Test score A B C D 
TENEO 

10 100 100 100 100 

9 100 100 100 95 

8 100 100 100 80 

7 100 100 90 75 

6 100 90 80 60 

5 100 75 50 55 

4 100 40 30 30 

3 75 25 15 25 

2 50 20 10 20 

1 20 15 5 18 

0 10 5 0 8 
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In the table the scores from group A are 
stochastically smaller than those for B, C, or D 
because group 4 always has at least as many 
scores at or below a particular number as any 
of the other groups. Similarly, the scores from 
group B are stochastically smaller than those 
from group C. Note that it is the test scores 
that tend to be smaller, the entries in the table 
are larger. Group D scores are neither stochas- 
tically larger nor stochastically smaller than 
those for groups B and C. This happens be- 
cause group D has too many high scores and 
too many low scores. Comparing D with B, for 
scores from 4 to 9, B has the more frequent 
small scores, but for scores 0 and 1, D has the 
more frequent. We return now to the alterna- 
tives given above. 

If alternative (i) were true we would say 
that observations from the first population 
were stochastically smaller than those for either 
of the other two populations. If alternative 
(ii) were true we would say that observations 
from population 4 were stochastically smaller 
than those from the other two, and that ob- 
servations from population B were stochasti- 
cally smaller than those from C. An easy 
example occurs when we have three normal 
populations with equal standard deviations but 
different means, Then observations from the 
population with smallest mean are stochasti- 
cally smaller than observations from either of 
the other populations. 

Our interest in a test with alternatives like 
(i) and (ii) is in the spirit of our discussion 
of allowances earlier. We are not satisfied to 
discover whether the three populations differ, 
we want to know whether the results fall in an 
order specified in advance. When we have a 
theory about three groups, the theory will usu- 
ally speak about the order of the outcomes. 
We want to have a statistical test to go with 
such specific predictions. It is some trouble to 
work out the test, but if one has this specific 
problem he may find it worth the effort. We 
have provided tables to facilitate the work. 

Suppose we have the following setup: 


Populations 4 B C 
Samples x y z 
Sample sizes | m n 


We deal first with alternative (i). Are the x's 
stochastically smaller than the y's and the z's? 
If the x's are smallest in the population, they 
should exhibit it in the sample. We define 
two quantities U and V. The first is a measure 
of how often y's are smaller than x's, and the 
second a measure of how often z's are smaller 
than x's. Clearly, if the x's are stochastically 
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smallest, U and V should be small. Specifically: 


U is the number of times a y is smaller than 
an x. Each y is tried against each x. 


V is the number of times a z is smaller than 
an x. Each z is tried against each x. 


Example: When the observations are arranged 
from least to greatest, we observe for 6 x's, 3 y's 
and 3 z's the order 


yyxxxyxzxzzx. 


The first and second 5's are ahead of 6 x's each 
and the third y is ahead of 3 x's, making a total 
of U — 15. Similarly, the first z is ahead of two 
x's and the last two z's are ahead of one each, 
making a total V — 4. 

When the null hypothesis is true and the 
sample sizes are not too small (it is not actually 
known what too small is; we would advise 
samples of at least 5) , it turns out that U and V 
are approximately distributed according to a 
normal bivariate distribution with means 


U — Im/2 


and variances 


V = In/2 


a Im(l E m 4 1) 
an 12 


So 
T NC egret), 


and correlation 


Pov = uiid , 
VAUER EY 


The observed U and V can be transformed to 
standard normal deviates: 


u = ——— ——-, 
Hp ey 
and we ask whether u and v are both too small. 
To make significance tests we have to use 
Pearson's tables of the bivariate normal (Pear- 
son, 1931). The tables give for selected pairs 
of values of standardized normal deviates h, k 
with various correlations, the probability that 
both h and k will be exceeded. This means 
that we should be able to set the size of the 
region for rejection of the null hypothesis by 
using the tables. If the sample sizes l, m, n are 
roughly equal, then to test whether x is the 
smallest we want simultaneously 


Tus ET T NEAN LT 
cr Cy 
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where h and k are chosen to give an appropriate 
significance level. Whitney suggests that h and 
k might be set equal, the critical value being a 
value of h such that the probability that both 
inequalities hold when the null hypothesis is 
true is about 5%. 

Since we expect U and V to be small, the 
values of h and k will be negative. Table X 
gives the values of h for various correlation co- 
efficients p at three significance levels: 1%, 5%, 
10%, i.e., 

Prob. (u = h, vÆ h) — 0.01, 
Prob. (u Æ hk, vÆ h) = 0.05, 


Prob. (u=h, vÆ h) — 0.10. 


TABLE X 


VALUES FOR h FOR TESTING NULL HYPOTHESIS 
AGAINST ALTERNATIVE HYPOTHESIS 


x«y x«z 
Significance level 
Correlation 0.10 0.05 0.01 
0.00 —0.48 —0.76 —1.28 
0.05 —0.51 —0.79 —1.33 
0.10 —0.54 —0.83 —1.97 
0.15 —0.57 —0.86 —1.41 
0.20 —0.59 —0.89 —1.46 
0.25 —0.62 —0.93 —1.50 
0.30 —0.65 —0.96 —1.54 
0.35 —0.68 —1.00 —1.59 
0.40 —0.71 —1.03 —1.63 
0.45 —0.74 —1.07 —1.67 
0.50 —0.77 —1.10 —1.71 
0.55 —0.80 —1.14 —1.76 
0.60 —0.84 —1.18 —1.80 
0.65 —0.87 —1.21 —1.85 
0.70 —0.91 -135 —1.89 
0.75 —0.95 —1.29 —1.95 
0.80 —0.99 —1.34 —1.99 
0.85 —1.03 —1.39 —2.05 
0.90 —1.08 -144 —2.11 
0.95 -1.15 -.51 —2.18 
1.00 —1.28 -1.65 —2.55 


Null hypothesis will be rejected if simultaneously 

— eke — 
"p Ze 

Example: We suppose x is stochastically 

smallest and observe for | —5, m — 5, n = 5 

the result 
XXXyxyzyxyzrnyiii 
U m4, V —1, U — 125, P — 12.5, 
LR 4.8, Gy — 4.8, for 0.45. 


u E 
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From Table X we read that for a correlation of 
0.45 the critical value of h at the 5% level is 
—1.07 


4— 125 1— 
ffe uc e DIT Ur 
Because both u and v are less (more negative) 
than the critical value of h, we reject the null 
hypothesis, and accept x as stochastically 
smallest. 

We turn now to testing whether the three 
populations are in an order prescribed in ad- 
vance. It will be a great convenience if we can 
keep the same definitions for U and V and 
rearrange the stochastic order. We still have l 
x's, m y's, and n z's. Let us suppose that the 
order specified in advance is y smallest, x next 
smallest, z largest. We order the observations 
from least to greatest as before. If y is to be 
smallest, it should appear often before x, and 
therefore U should be large. If z is to be largest 
it should seldom appear before x and therefore 
V should be small, as before. All computations 
are as before, but we want 


u 2 hand v £ —h 
simultaneously, where k is positive. 


Example: We suppose y is smallest, x next, z 
largest, and we observe the order 
yyyxzyxyxzxxzzz l=m=n=—5 
U—22,V — 6, U — V — 125, 
Pov = 0.45, 
u = 1.98, v — —1.35. 


Op = oy = 4.8, 


From Table XI we find the critical value of h 
for the 5% level to be 0.45, and because u is 
larger than 0.45 and v less than —0.45 we reject 
the null hypothesis and accept the supposed 
order. 


The problem of m rankings. If we have a 
two-way analysis of variance table where several 
judges have each ranked the same set of stimuli, 
we may wish to measure the concordance (or 
discordance) of the rankings. Each judge will 
have used the same ranks, so the total ranking 
for any one judge is the same. We cannot tell 
anything, therefore, about differences between 
judges. We can tell whether the judges are 
agreeing better than a dummy who does his 
rankings by random placement. (The same 
procedure may be used on measurement data if 
they are appropriately ranked.) For example, 
Guilford (1936, p. 247) gives ranks assigned to 
12 handwriting samples by various judges. We 
shall use his first 5 judges to illustrate. 
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Taste XI 


VALUES FOR h FOR TESTING NULL HYPOTHESIS 
AGAINST ALTERNATIVE HYPOTHESIS 


yEx <z 
Correlation Significance level 

T 0.10 0.05 0.01 
0.00 0.48 0.76 1.28 
0.05 0.45 0.73 1.24 
0.10 0.42 0.69 1.20 
0.15 0.39 0.66 1.15 
0.20 0.37 0.65 1.10 
0.25 0.34 0.59 1.06 
0.30 0.31 0.56 1.01 
0.35 0.28 0.52 0.96 
0.40 0.25 0.49 0.91 
0.45 0.22 0.45 0.86 
0.50 0.19 0.41 0.81 
0.55 0.17 0.38 0.76 
0.60 0.14 0.34 0.70 
0.65 0.10 0.30 0.64 
0.70 0.07 0.26 0.58 
0.75 0.04 0.21 0.51 
0.80 0.01 0.17 0.44 
0.85 0.00 0.12 0.37 
0.90 0.00 0.07 0.28 
0.95 0.00 0.00 0.17 


Null hypothesis will be rejected if simultaneously 
U—U v—V 


u= >h, v= «—h 
Su Y 
Handwriting Sample 
Judge ABCDEFGH J KL 
1 12 81110 96 5 47 12 8 
2 11121097 864 13 52 
3 10119 9 7 B'"6 B*S VOTI 
4 9121110 8 7 435 6 2-I 
5 12101 9 47 8 16 2 5$ 
Totals 5453 55 47 35 36 29 17 22 16 15 11 


If all stimuli lead to similar outcomes we ex- 
pect the sum of the ranks for each column to 
be roughly the same size. But if some stimuli 
produce larger and others smaller effects, we 
expect such stimuli to produce differing column 
sums. To evaluate the significance of the dis- 
parity among column sums, we compute y? 
with p — | degrees of freedom: 


e. ee TES x sum (rank total) ? | 


— 3n (p + 1), 


where we are summing the squares of column 
totals, and 


^ 
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n — number of replications (rows), 
p — number of treatments (columns) . 


The subscript r on y? serves notice that this 
is a y? derived from ranks and, in fact, is an 
approximation to y? rather than an exact y? 
statistic. 

For Guilford's handwriting example, we get 


sum (rank total) 2 — 15,696, 


12 (15,696) 
% = $03) (13) 


= 46.5, 


— 3(5) (13) 


with 11 degrees of freedom, Clearly this result 
is highly significant, and it means that the 
judges agree that the stimuli are different. In 
a case like the present one where the ranks are 
obviously in good agreement, the test gives little 
information that could not be had from casual 
observation of the table. Although the matter 
has not had theoretical investigation, we would 
suggest that the Link-Wallace short-cut to al- 
lowances in one-way tables, discussed above, 
might reasonably be applied here. Rather than 
testing for concordance, we would be attempt- 
ing to decide which pairs of stimuli should be 
“recognized” as different. The method of m 
rankings was developed by Friedman (1937) , 
but expositions of this and other ranking meth- 
ods together with references are probably more 
readily available in Kendall (1943, Chapter 16) 
and Kendall (1948). 

The complete ranked one-way analysis of 
variance, The previous subsection treated 
ranked data where the ranking was done only 
in single rows, rather than over the entire table. 
A new development (Kruskal and Wallis, 1952) 
concerns one-way analysis of variance where the 
ranking is done for all observations in the table. 
(The same paper offers a survey of the literature 
of ranking methods.) 

The H-test presented below requires that all 
observations be ranked together, and that the 
sum of the ranks be obtained for each sample. 
The test statistic to be computed if there are 
no ties (no two original observations equal) is 


[4 
12 RAM. s 
“WWD ds WD. 
D 


(no ties) 
C — number of groups, 

n, — number of observations in the ith group, 
N — En, the total number of observations, 


R, — sum of the ranks in the ith group. 
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Large values of H lead to rejection of the hy- 
pothesis of equality of the original C popula- 
lations. If the n, are not too small, H is 
approximately distributed like chi-square with 
C — 1 degrees of freedom. We present tables 
for sets of three samples whose sizes are each 
less than six. 

If there are ties, each observation is given the 
mean of the ranks for which it is tied. Then H 
is computed as above, and divided by 


ET 
N3—N’ 
where T — (t — 1) t (t + 1) = t3 — t for 
each set of ties, ¢ is the number of tied obser- 
vations in a set, and the summation is over 
all sets of ties. The following table helps: 


t 1020882. dee B 6 ZUR 9 10 
T 0 6 24 60 120 210 336 504 720 990 

Example (with ties). In Table XII we analyze 
some artificial data constructed to display the 


problem of ties. We have three groups of un- 
equal sizes, so C — 3. 


correction for ties — 1 — 


TABLE XII 


EXAMPLE OF THE H STATISTIC FOR RANKED 
ANALYSIS OF VARIANCE 


‘Treatment 


1 


Obser- 
vation. Rank 


2 


Obser- 
servation Rank 


Obser- 


servation 


Rank 


Ties: 5at0,3at1,3at 2. 


ET = 120 + 24. 24 = 168. 
N3—N = 183—18 = 5,814. 


t8 Lamm 
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We conclude that there is not a significant difference 
between the groups. Note that the huge observation 
88 in Treatment 1 ultimately contributed little more 
than the observation 15. Note also that the correc- 
tion for ties changed matters very little (about 3%) . 


Wald-Wolfowitz run test. The run test in- 
quires whether two samples (not necessarily 
of equal size) are drawn from a common popu- 
lation. No matter in what way populations 


321 


differ — medians, variability, skewness — the 
run test will reject the null hypothesis of no 
difference with as large a probability as we wish 
with big enough samples. 

To make the test, pool the samples and ar- 
range the data in order of increasing size. Moses 
presents the following example: 

X1Xo¥iXgVo¥gVsV¥sXsXo¥o 


Re CX CHER CE 


TABLE XIII 


TRUE DISTRIBUTION OF H FOR THREE SAMPLES, FACH OF SIZE FIVE OR LESS, IN THE NEIGHBORHOOD OF THE 
10, 5, AND 1 PERCENT POINTS 


The probabilities shown are the probabilities, under the null hypothesis, that H will equal or exceed the 
values in the column headed H. 


True 

Sample sizes H proba- 
mu ma Ny bility 

NEN ee ee 

2 2:74 4.57 .067 
3 2 1 4.28 .100 
3 25.8 4.71 048 
4.50 .067 

3 5 1 5.14 0438 
4.57 -100 

3 $2 5.36 032 
4.55 .100 

3 3 3 7.20 004 
5.60 050 

4.62 .100 

4 2 l 4.50 076 
4 2 2 5.33 033 
4.45 .100 

4 3 1 5.20 050 
4.05 093 

4 8. 2 6.44 .008 
5.44 046 

4.51 .098 

4 52:58 6.74 .010 
5.72 .050 

4.70 .092 

4 4 1 6.66 010 
4.96 048 

4.16 .082 

4 4 2 7.03 006 
5.45 046 

4.55 098 

4 4 5 7.14 010 
5.59 .049 

4.54 .099 

LX DE 7.65 008 
5.69 049 

4.65 .097 

5 2 H 5.00 .048 
4.20 095 

Ee Ss 6.53 :008 


True 
Sample sizes H proba- 
Ny Ng Mg bility 
Due c ES 
5.16 ER 
4.37 090 
cy pat TID: 4.96 048 
4.01 095 
5 3 2 6.82 010 
5.25 049 
4.65 091 
Blocs 7.07 .009 
5.64 DE 
4.53 097 
$. ie e 6.95 .008 
4.98 044 
3.98 098 
5 4 2 7.1 .010 
5.26 .050 
4.54 .098 
5.7458 7.44 010 
5.63 .050 
4.54 .099 
NI TRE] 7.76 BO 
5.61 .050 
4.61 .100 
bipctore tl 7.30 .009 
5.12 046 
4.10 .086 
LAN, SH 7.26 010 
5.33 4047 
4.50 .100 
Ae Pe, 7.54 010 
5.70 046 
4.54 .100 
BS X 7.79 010 
5.64 050 
4.52 099 
5. CR 7.98 010 
5.78 049 
4.56 .100 
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A run is a group of consecutive values drawn 
from only one population. Runs of X’s are 
underlined, and runs of Y's appear between 
the underlined X's. In this example there are 
nine runs altogether. If two samples are drawn 
from the same population, we expect that the 
number of runs will be fairly large, since we 
expect that X and Y values will be well inter- 
mingled. There is no reason to expect all the 
Y's to fall at one end and all the X's to fall at 
the other end, for instance, unless the popula- 
tions are really very different. This situation 
would lead to our observing only two runs. 
"Too few runs, then, will cause rejection of the 
null hypothesis. 

We present in Table XIV critical values of 
the number of runs d for m less than or equal 
to n. Here m and n are the numbers of ele- 
ments or observations of each kind. This table 
is used to test at the 595 level whether there 
are too few runs, If there are too few runs, the 
populations from which the samples are drawn 
differ. In our example there are 10 X's and 7 
Y's; therefore, m — 7, n — 10. We observed 
d — 9. Reading from Table XIV, we see that 
the critical number of runs is 5, and so our re- 
sult does not imply too few runs. 

A somewhat different problem is that of test- 
ing randomness of two kinds of elements. For 
example, a half-hour before the popular lunch 


H 
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hour the following arrangement of empty and 
occupied seats at a lunch counter might be ob- 
served (Swed and Eisenhart, 1943) : 


EOEEOEOEOEEEOEOE 


The question is whether the people are ran- 
domly seated or whether they prefer elbow- 
room (leading to too many runs). Here we 
have 10 E's and 6 O's, so m — 6 and n — 10; 
also the number of runs is d — 13. From Table 
XV, the critical value of d is 12. Because our 
observed number is larger, we reject the hy- 
pothesis of random seating. 

For values of m and n outside the table we 
can use as a normally distributed variable 
(Mood, 1950, p. 393) 


pera Ae 
Ca 
where 
EI 2mn 
Tien 


2mn (2mn — n — m) 
= mfn (m-fn—ly 


In the correction for continuity we use +14 if 
we are interested in the probability of d or 
fewer runs, —14 if we want the probability of d 
or more runs, 


oq? 


TABLE XIV 
CRITICAL VALUES OF d (NUMBER OF RUNS) FOR VARIOUS VALUES OF M AND n, THE NUMBERS OF THE 
TWO KINDS OF ELEMENTS 


The values tabled are the largest values of d that give significance at the 5% level. If the observed d is as 
small as or smaller than the entry, there are too few runs. 
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Moses points out that tests of this kind which 
guard against many alternatives are not very 
good at guarding against some one particular 
alternative. Thus, to test specifically whether 
two samples came from a population with the 
same median, we will be more likely to detect 
this departure using the Mann-Whitney test, a 
test specifically designed to guard against the 
alternative of interest. Furthermore, when a 
test guards against many alternatives and we 
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reject the null hypothesis, we find ourselves in 
the unsatisfactory position of not being able to 
pin down what alternative (s) has led to the 
rejection. To many persons engaged in experi- 
mental work it is often the specific nature of a 
departure that is interesting, but, as with many 
other things, it is only by sacrificing some gen- 
erality in the hypothesis that particular speci- 
ficities in the data have a chance to display 
themselves. 


'TasLE XV 


CRITICAL VALUES OF d (NUMBER OF RUNS) FOR VARIOUS VALUES OF M AND n, THE NUMBERS OF 
THE TWO KINDS OF ELEMENTS 


The values tabled are the smallest values of d that give significance at the 5% level. If the observed d is as 
large or larger than the entry, there are too many runs. 
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Quick METHODS 


Standard deviation of a mean. Often one 
wants to know quickly an approximate esti- 
mate of the standard deviation of a mean for a 
small set of data (in order to make a rough t- 
test, for example). A good approximation 
(Mantel, 1951) can be obtained quickly by di- 
viding the range by the total number of obser- 
vations, i.e., 


largest observation — smallest observation 
number of observations 


— estimated standard deviation of mean 


for samples of size 15 or less. This estimate 
seems to be good regardless of the form of the 
population distribution — the estimate works 
well for even the uniform distribution up to at 


least 10. For example, with the six observa- 
tions 6, 7, 10, 12, 15, 18 Mantel's procedure 

18—6 

6 

square formula gives 1.89. 

t-Tests. To test whether a mean is equal to a 
standard or whether two samples are drawn 
from populations with equal means, Lord 
(1947) suggests two tests based on the range 
suitable for samples up to about size 20. He 
suggests replacing the ordinary t-test by the 
statistic 


gives — 2. 'The longer root mean 


x—m 
ANCHO d g 


where x is the sample mean and R is the range 
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of the sample. Similarly, in testing for the 
equality of means with two samples of equal size 
he proposes the statistic 


Lm 
Va (Ry + Ra) 


where the x's are the sample means and the R's 
represent the ranges of the samples. Both tests 
are for use with small samples, say samples of 
sizes from 2 to 20. Lord provides special tables 
for using these statistics in significance testing. 
Since these tables are not widely available and 
since it is possible to relate them closely to a 
t-table by introducing a conversion factor, we 
give conversion factors which will make it pos- 
sible to use these statistics with an ordinary 
ttable and with the same numbers of degrees 
of freedom as are customarily used in such tests. 
In particular, for the one-sample case 
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get exactly the results given by Lord's tables, 
but they will be extremely close. 

The principal value of such statistics is that 
they avoid the necessity of the root mean square 
computations ordinarily used, while sacrificing 
little accuracy. 


Taste XVI 


FACTORS FOR CONVERTING LORD'S ONE-SAMPLE AND 
TWO-SAMPLE /-TESTS TO ORDINARY -TABLES 


Here n is the sample size. We multiply —— 


by C, and enter a t-table with n — 1 degrees of free- 
dom. For two samples, both size n, we multiply 


PELA 
V& (Rı + Rə) 
with 2n — 2 degrees of freedom. 


by K, and enter an ordinary t-table 


Single sample size One-sample test Two-sample test 


x—m 
tr = (x Cw n GC. K, 
where 2 2.00 1.96 
C, = 5 /n + 0.5 — 6. 3 3.30 2.19 
4 4.44 3.02 
The t,-, can be used for the one-sample test, 5 5.48 3.77 
where the subscript n — 1 on the t refers to the 6 644 447 
number of degrees of freedom for entering an 7 7.35 5.14 
ordinary t-table. 8 891 575 
In the two-sample case x i 
P , 9 9.04 6.35 
E o coo MEER 
P ser Medi Dagesrees meet ns ü ^ 
Vo (Ry + Ro) : 
wer tte: 
—— e! m 
K, = 4 Wn + 24—7.14. t 12.7 9.03 
In Table XVI we present values of the cor- 15 13.4 9.50 
rection terms C, and K, for single samples of 16 14.1 9.98 
size n — 2 to 20. 'Thus the approximate for- 17 17.7 10.5 
mulas need not be used. It should be under- 18 15.4 10.9 
stood that if one uses C, and K,, either from 19 16.0 11.3 
the tabled values or the formulas, one will not 20 16.6 11.8 
‘TRANSFORMATIONS 


Any method of systematically transforming a 
given set of numbers into a new set may be 
regarded as a transformation, Such a general 
definition does not get us very far, so let us 
discuss a few examples. (i) We are familiar 
with transforming observations to standard 
‘scores by multiplying each observation by one 
number and then adding another number. 
This transformation gives us a new set of obser- 
vations whose mean and standard deviation 
have been adjusted to our satisfaction. If in- 
dividuals have been measured on several charac- 
teristics, and each measure expressed in stand- 


ard scores, then an individual has a profile of 
scores readily interpretable at a glance. One 
use, then, of transformations is to make data 
more comparable. (ii) The standard score 
transformation is just an example of the gen- 
eral linear transformation y — a + bx, where 
x is an original observation and a and b are 
constant. This general transformation makes it 
possible to adjust a set of numbers so they have 
any desired zero point and any desired unit of 
measurement. (iii) Another transformation 
that often improves comparability is the use of 
percentages or ratios rather than raw numbers. 
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This use is so common we shall not discuss it 
further. (iv) We often see graphs of one vari- 
able against another where the relation between 
the two is a straight line, but one axis repre- 
sents, say, the logarithm of the time, and the 
other perhaps rate of responding. Straight lines 
always look good in graphs, and most authors 
prefer to mangle (transform) one or both of 
the variables if there is any possibility of dis- 
playing linearity. Such adjustments sometimes 
suggest theoretical possibilities. (v) To get 
back to analytical statistics, we have seen trans- 
formations to ranks used on a number of occa- 
sions in the present chapter. We speak of not 
needing to know the original form of the distri- 
bution if we use ranks. Just what does this 
mean? Take the Mann-Whitney test as an 
example. If we do indeed have two identical 
continuous populations and we draw a sample 
of two observations from each, then we know 
that each of the six possible outcomes of order- 
ing the four observations has exactly the same 
chance of occurring. They are: 


xxyy yxxy 
xyxy yxyx 
xyyx yyxx 


The simple process of ranking the’ observa- 
tions then replaced a situation where we knew 
practically nothing usable about the original 
observations by a new situation where we know 
everything about the distribution of the ranks 
when the null hypothesis is true. Thus the 
transformation to ranks can sometimes divorce 
us from the unknown. (vi) The transforma- 
tion to signs plays the same role for the sign 
test. (vii) When we replace an observed value 
of chi-square or of a normal deviate by a 
probability level read from a table we are mak- 
ing a transformation whose utility lies in its 
ease of interpretation as a frequency of occur- 
rence. (viii) Some people make transforma- 
tions so that their adjusted data will be more 
nearly normally distributed than the original 
observations, for example, the logarithms of 
the lengths of sentences written by some famous 
authors are approximately normally distrib- 
uted, whereas the original distribution of 
number of words has a rather extended tail and 
is quite asymmetric. In this example, a good 
deal of information about the original distri- 
bution can be transmitted by reporting the 
mean and standard deviation of the logarithmic 
sentence lengths. Such transformations designed 
to achieve approximate normality are not alone 
a feature of empirical distributions but also are 
used for the theoretical distributions. To 
illustrate, many chi-square tables carry instruc- 


tions in a footnote to the effect that beyond 
n = 30 degrees of freedom the expression 
V/2y2 — \/2n — 1 is approximately a normal 
deviate with zero mean and unit standard devia- 
tion. Similarly, the z-transformation for testing 
the significance of correlation coefficients is a 
device for achieving approximate normality. 
We wish to describe a type of transformation 
that has a slightly different purpose from the- 
ones mentioned above. In many kinds of meas- 
urements the larger the thing being measured 
the more variable the measurements. For ex- 
ample, ask subjects to close their eyes and press 
a button 2 seconds after a buzzer sounds, and 
then ask them to press after 60 seconds. Times 
till pressing will be much more variable for the 
60-second data than for the 2-second data. If we 
have data for several such times we will find 
then that the different times do not have the 
same variability, (This terrible statistical dis- 
ease has been christened “heteroscedasticity,” 
a fact which has blighted the lives of many stu- 
dents. Some years ago much was made of other 
statistical diseases such as kurtosis, but the fad 
seems to have passed.) Sometimes a transforma- 
tion such as taking logarithms, square roots, or 
cube roots of the original data will make the 
variances of the adjusted observations approxi- 
mately equal for each time or, more generally, 
each condition of an experiment. If such a 
transformation can be found it is called, ap- 
propriately enough, a “variance-stabilizing 
transformation.” An important feature of a 
variance-stabilizing transformation is that it 
leaves the group means in a reasonable order. 
A few readers may have been thinking “Why 
not convert each time group to standard 
scores?” But this technique will remove the 
relation between instruction time and observed 
waiting time, which is of interest to the experi- 
menter. Logarithms or square roots will at least 
leave the increase of observed time definitely 
dependent on the length of time requested. 
But why do we want equal variances anyhow? 
One of the requirements of analysis of variance 
is that each cell of the original table should 
represent a normal population with the same 
variance, although not necessarily the same 
mean. So a move to transformed data with 
more nearly equal variances would more nearly 
meet the requirements of analysis of variance. 
To suggest a specific use, suppose we try to use 
tke residual mean square to assist in setting 
confidence limits on the mean of a specific cell 
in the analysis of variance table. If we happen 
to have unequal variances in the cells we may 
find that the cell we chose has observations in 
it that are outragequsly disparate compared 
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with the prediction that is being made. Had 
we made a variance-stabilizing transformation 
before the analysis, we could have set confidence 
limits on the transformed mean, and then 
transformed these limits back to the original 
units without the same risk of error. 
Up to this point we have talked about vari- 
ance-stabilizing transformations in the absence 
of theoretical notions about the form of the 
‘distribution. Much data is derived from count- 
ing things. If we have a fixed number of trials, 
we count the number of successes. If a subject 
has the same chance of success p on each trial, 
then scores for this subject should be binomially 
distributed with mean proportions of successes 
p, and variance of the proportion from one set 
of trials to another p (1 — p) /n, where n is the 
number of trials in a set. Note that the variance 
depends on the mean. When this happens we 
can usually find an approximate variance-stabi- 
lizing transformation, and for the binomial we 
shall give one below. We shall also give an 
additional reason for the use of such a trans- 
formation. | 
A second kind of counting distribution is the 
Poisson. It usually arises when we have a fixed 
exposure (in space or time or both) and count 
the number of events occurring in the exposure. 
For example, subjects observe a staircase illu- 
sion for 20 seconds and press a button every 
time it changes position. If the subject does this 
several times under nearly identical conditions 
we expect the distribution of counts to be ap- 
proximately Poisson. If the mean number of 
alternations per occasion is m, then the variance 
is also m. Again we can find an approximate 
variancestabilizing transformation. It turns out 
to be the square-root transformation. To take 
another example of a distribution that may be 
approximately Poisson, Eichler (1951) proposes 
using the square-root transformation on Ror- 
schach scores as a method of reducing the 
influence of total productivity on subscores. 
Because the total number of productions of a 
particular Rorschach sign is small, it would be 
reasonable to suppose the distribution of the 
number on repeated takes by the same person 
to be approximately Poisson. The square-root 
transformation is appropriate for Poisson data. 
Counts from binomial or Poisson data (or Ror- 
schach data) have variances that depend on 


Theoretical variance 


n p=050 92 
10 78.2 76.1 
Ratio of computed to theoretical 0.97 
20 40.0 39.2 


Ratio of computed to theoretical 0.98 
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the unknown true means — this difficulty can 
often be circumvented by using the inverse sine 
transformation and the square-root transforma- 
tion to obtain new observations whose variances 
do not depend much on their means. 

Freeman and Tukey (1950) suggest for the 
Poisson that the variance-stabilizing properties 
of the square-root transformation are improved 
by using 

Vx+/x+1, 
where x is the observed count, instead of just 


vyx or \/x + Lé The variance of Poisson data 
so transformed is approximately unity. In 
Table XVII we provide a set of tables of this 
square-root transformation for Poisson variables 
and its square for x running from 0 to 50. This 
table should facilitate making analysis of vari- 
ance computations using this transformation. 

The third column in Table XVII is the square 
of the two-decimal place number given in the 
second column, rather than the correct four- 
decimal result, because these are the appropriate 
squares to use if the two-decimal transformation 
in the second column is employed for analysis 
of variance purposes. 

For the binomial Freeman and Tukey pro- 
pose a slight modification of the usual trans- 
formation, i.e., 


Ha c Vee + sin-1 va) 


and data thus transformed have approximately 


variance for angles in degrees. Here 


821 
7nd 05- 
x is the count, n is the sample size. Values of 
sin-1\/p can readily be obtained because 
Snedecor (1946, pp. 449-450) has reproduced 
C. I. Bliss's tables of this function. 

'The inverse sine transformation must be 
computed separately for each sample size. We 
have investigated how well the Freeman-Tukey 
inverse sine transformation stabilizes the vari- 
ance, and below the reader can see that this 
transformation does indeed produce values very 


close to the theoretical value of 


n -+ 0.5 
even for p's as low as 0.10 and small sample 
sizes. We show results of two computations, 
one for n = 10 and one for n = 20. 


Computed variance 


p=040 p=030 p=020  p-010 p=0.05 
76.7 78.8 81.8 74.9 53.7 

0.98 1.01 1.05 0.96 0.69 
39.2 39.3 40.0 42.2 38.0 

0.98 0.98 1.00 1.05 0.95 
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Taste XVII 
TABLE OF VALUES FOR THE FREEMAN-TUKEY SQUARE-ROOT TRANSFORMATION 


The observed count is x and the transformed values are \/x + \/x + I. The squares of the transformed 
values are also given for use in analysis of variance computations. 


x00 VE+ VET (Vet yeti)? Vet vet (OVER 
0 1.00 1.0000 26 10.30 106.0900 
1 2.41 5.8081 27 10.49 110.0401 
2 3.15 9.9225 28 10.68 114.0624 
5 5.75 18.9129 29 10.86 117.9396 
4 4.24 17.9776 30 11.04 121.8816 
5 4.69 21.9961 31 11.22 125.8884 
6 5.10 26.0100 32 11.40 129.9600 
7 5.47 29.9209 33 11.58 134.0964 
8 5.83 33.9889 34 11.75 138.0625 
9 6.16 87.9456 35 11.92 142.0864 
10 6.48 41.9904 36 12.08 145.9264 
11 6.78 45.9684 87 12.25 150.0625 
12 7.07 49.9849 38 12.41 154.0081 
13 7.35 54.0225 39 12.57 158.0049 
14 7.61 57.9121 40 12.75 162.0529 
15 7.87 61.9369 41 12.88 165.8944 
16 8.12 65.9344 42 18.04 170.0416 
17 8.37 70.0569 43 13.19 173.9761 
18 8.60 73.9600 44 13.34 177.9556 
19 8.83 77.9689 45 13.49 181.9801 
20 9.05 81.9025 46 13.64 186.0496 
21 9.27 85.9329 47 13.78 189.8884 
22 9.49 90.0601 48 13.93 194.0449 
23 9.69 93.8961 49 14.07 197.9649 
24 9.90 98.0100 50 14.21 201.9241 
25 10.10 102.0100 


As one would expect, the stability of the vari- 
ance improves as the sample size increases. 
Freeman and Tukey show that their square-root 
transformation stabilizes Poisson variance satis- 
factorily for true means as small as 1.00. 

"The most valuable property of these trans- 
formations does not seem to be the variance- 
stabilizing property however. Rather, when 
these transformations are used we are provided 
with a known base-line variance. In an analysis 
of variance of the transformed data the error 
variance can be compared with the theoretical 
error variance. This provides an experimenter 
with important knowledge, for if he can esti- 
mate how much of the variance in an experi- 
mental situation is "natural variance," he is in 
a good position to assess the residual variance. 
Thus, he can decide whether to try to improve 
control and, if he does, he has a measure of how 
much the new control reduces the error vari- 
ance. Hence, for the practical man, these trans- 
formations give not only better statistical tests, 
but also a notion of the limiting accuracy of 
the experiment or, alternatively, they give a 


measure of the variability in an experiment 
owing to causes that could in principle be con- 
trolled. 

To give an example or two, different subjects 
have studied a long word list 0, 10, 30 minutes. 
Words from the list are exposed in a tachisto- 
scope at three different exposure times. A 
different subject observes 20 words for each con- 
dition of training and each exposure time. The 
number of correct recognitions is recorded. The 
distribution of successes may be binomial. If 
the inverse sine transformation is used and an 
analysis of variance performed we anticipate 
the residual mean square (error variance) to 
be 821/20.5 — 40.0. If we observe a residual 
mean square of from 40 to about 80 (depending 
on the number of degrees of freedom) , we will 
be fairly satisfied with the control achieved. 
Seppose we observe 160. We begin to wonder 
whether there are not factors in the experiment 
that ought to be further controlled. Perhaps 
the subjects vary in ability to study the list and 
in visual acuity, and it would be better to use 
the same subject for all conditions, so that he 
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serves as his own control. Perhaps a replication 
within each cell would provide a better estimate 
of the base-line error, because any interaction 
between exposure time and amount of training 
would contaminate an error time. What if, far 
from observing a mean square of 40, we observe 
a small number like 10? First, look for a mis- 
take in computation. If one cannot be found 
by you or an independent computer, reconsider 
the experiment. Such a thing might happen 
in the present experiment (if it were poorly 
designed) if the same list of 20 words were al- 
ways used, and a certain few were always recog- 
nized and another set were never recognized. 
Thus the actual variation in number recognized 
would be less than anticipated because the 
words on the list were not equated for ease of 
recognition. However, it is a rather rare day 
when the observed residual variation is smaller 
than anticipated. 

As another example, nursery school children 
are observed during the second 10 minutes of a 
regular play hour for several days at the begin- 
ning, middle, and end of their first semester. 
There are several types of play programs used 
throughout the year. The observers record the 
number of acts of overt aggression against others 
made by each child during these recording 
periods. Here the number of such acts might be 
approximately Poisson distributed, especially if 
an act is defined so that an extended altercation 
by the same pair of children only counts once. 
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So it might be reasonable to try the square root 
transformation, in which case the base-line vari- 
ance is 1. If the observed residual variance 
were a good deal larger than this we would 
look around for matters other than time of year, 
program, or between children differences to 
help account for the extra variability. In this 
example, variation of scoring procedure and 
unreliability of scoring are important possible 
contributors to variance. On the side of small 
variance there is the occasional retiring child 
who commits practically no acts in the category 
under study. 

Two warnings are in order. First, the fact 
that data appear in percentage form does not 
automatically justify the assumption of binomi- 
ality, and therefore the inverse sine transforma- 
tion is not always appropriate. As examples: the 
percentage of body weight consumed at a sitting 
under free feeding, or the percent of a color 
wheel made blue to match a stimulus card 
are not cases where the variance-stabilizing 
properties would be expected on the basis of 
binomial theory. Secondly, when an analysis of 
variance is planned for binomial data, it is well 
to try to assure in advance that the sample sizes 
in the cells are always the same. If there are 
unequal numbers of opportunities for success 
in the cells a very sticky problem arises which 
we cannot treat here, but see Cochran (1943, 
pp. 287-301) . 


COMBINING TESTS OF SIGNIFICANCE 


Suppose an experimenter wants to combine 
the results of tests made on several independent 
groups of data into a single grand significance 
test for the set. “There may be several samples 
which cannot or should not be combined, per- 
haps because reported by different investigators 
or because the original data are lacking, but 
to each of which separately a test of significance 
can be applied” (Wallis, 1942, p. 229). Before 
discussing the principal techniques for combin- 
ing, we wish to forewarn the reader of the 
importance of independence of the separate 
results. Suppose a pollster asks a sample of men 
and women several different questions, but 
each question is more or less related. The 
pollster thinks the women's responses show 
more hypochondriasis than the men's on several 
of the questions. The pollster wants to pool 
results from several items to show the over-all 
significance of a sex difference. The methods 
of combining significance tests to be discussed 
here are inappropriate, because the several dif- 

ferent measures (i.e, responses), almost cer- 
tainly correlated, are being combined. If the 


2 " 


methods of this section are applied and "no 
significant difference" is concluded, fine. But 
if "significant difference" is concluded, all that 
is implied is that probably the answers to the 
several questions are correlated and we can say 
nothing about a man-woman difference. This 
pitfall is charted early and at length, because 
it has taken a large toll of the unwary in the 
past. Indeed, one of the main functions of this 
section is to warn of the misapplication of com- 
bining dependent results, 

We can combine significance tests when we ` 
have independence of the separate tests; an ex- 
ample is when we have test results for several 
groups of individuals (all different) in the 
same or possibly different laboratories. "The re- 
sults of the single-group analyses vary. We want 
to pool the results and make a single signifi- 
cance test. Two examples where combination 
is appropriate are correlation between frustra- - 
tion and aggression for each of several groups ` 
of different subjects, and t-tests between scores - 
of rural and urban children on a personality ` 
test. In fact, almost any statistic which leads | 
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to a probability statement may be combined if 
it meets the criterion of independence. 

First, a word about a method that almost any- 
one’s intuition suggests — just multiplying the 
observed values of P together and calling the 
result the significance level. Suppose we agree 
to reject if the product of the probabilities from 
two experiments is 0.05 or less, how often will 
we reject when the null hypothesis is true? The 
answer, remarkably enough, is 0.20, and not 
0.05, as common sense might have suggested. 
To see why, we need an example. Suppose P, — 
0.5 and P, — 0.1; their product is 0.05 and 
we would reject by the above rule. It is per- 
fectly true that the probability of points with 
P, = 0.5 and P, = 0.1 is 0.05. But we would 
also reject if we had P, — 0.1 and P, = 0.5, so 
there are additional points for which we would 
reject. The total probabilities of all these points 
adds up to 0.20 — a good distance from the 
desired 0.05. This intuitive procedure would 
be fine if we had easily available the probability 
distribution of such products in tabular form. 
Unfortunately we do not, but more unfortu- 
nately persons who want to do this are apt to 
overlook the fact that the distribution of prod- 
ucts of even two probabilities is far from uni- 
form. Since the appropriate distribution is not 
available, people who combine tests this way 
are not in a position to evaluate the likelihood 
of their observed result. The Fisher method 
described below properly takes care of this. 

The fundamental property back of most com- 
bining methods is that every probability value 
is equally likely if the null hypothesis is true. 
"This says that there are just as many outcomes 
with P between 0.1 and 0.2 as there are between 
0.7 and 0.8 (or any other interval of the same 
size). This implies that any transformation we 
can find that makes it easy to add up several 
results is all we need to be ready to combine 
tests. "Easy to add up” means that whatever 
final statistics we come out with, it should be 
easy to compute and have a distribution that is 
widely available in tables. 

Fisher (1938, pp. 104-106) noticed that if P 
is an observed probability level, then 

—2 log, P 
has the chisquare distribution with two de- 
grees of freedom. Now the sum of quantities 
independently distributed according to chi- 
square has again the chi-square distribution 
with degrees of freedom equal to the sum of the 
degrees of freedom of each of the component 
chi-squares. So Fisher's proposal is to combine 
significance tests by taking minus twice the sum 
of the natural logarithms of the probabilities. 
We are thus able to go to a chi-square table to 
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complete the test. The method has some in- 
tuitive justification because when we add loga- 
rithms, we are performing the same operation 
as multiplying numbers, as discussed above. 
Gordon et al. (1952) have provided an ex- 
tended table of chi-square for two degrees of 
freedom which should facilitate the computa- 
tions involved in combining probabilities from 
independent samples. The reason for the table 
is that one wants to work with values of P 
which are correct to two or three decimals and 
to have chi-square values that are correspond- 
ingly accurate. Furthermore, their table makes 
it possible to go directly to the chi-square values 
and sum these and thus avoid the intervening 
step of using logarithms altogether. Such a 
table is a great help in two ways: we avoid the 
interpolation problem and we avoid the loga- 
rithm problem. 

Others, see for example Stouffer et al. (1949, 
p. 45, footnote 15), have suggested that we 
transform the observed P-value to a standard 
(unit) normal deviate, using any of the stand- 
ard normal tables that are available. The sum 
of normal deviates is again a normal deviate 
with variance equal to the number of observa- 
tions summed. So the final statistic we refer to 
the normal table is 


sum of normal deviates 


\/number of observations 


and we note in passing that it has the nice 
additive property we want, and also is a statistic 
widely tabulated. Stouffer's method is easier to 
compute than Fisher's, because starting from 
the same basic data all we need for Stouffer's 
method is a standard normal table and the com- 
putations are minimal. 

One peculiarity of the Fisher approach may 
have occurred to the reader. It happens that 
a large set of P's, all less than 0.5, does not 
necessarily lead to a significant result — this 
defies one's intuition about what should hap- 
pen with a sufficiently large number of results. 
This result is not encountered with Stouffer's 
method. If we have many P's, ranging from 
say 0.37 to 0.49, Stouffer's method would achieve 
significance as the number of groups increased, 
ultimately giving a very small (nearly zero) 
combined probability level. For the same set of 
P's, the Fisher chisquare values would ulti- 
mately lead to a very large combined probability 
(nearly unity) . LE 

A third method of combining significance 
tests is suggested by a footnote in Gordon et al. 
(1952, p. 315). We think the suggestion flows 
from analysis of variance considerations. If 
several laboratories are involved, it may not 
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actually happen, when the null hypothesis is 
true, that the distribution of P-values is uni- 
form. Perhaps there are deviations from uni- 
formity so that there are actually laboratory- 
to-laboratory differences. If this is the case and 
we use, say, Stouffer's method, we may be un- 
derestimating the actual variance of the normal 
deviates by assigning their variance the value 
unity. The result would be that we would too 
often detect, as real differences, variations owing 
to laboratory differences rather than to sys- 
tematic trend. Now, in addition, suppose the 
null hypothesis is in fact false, then it could 
easily occur that the standard deviation of the 
transformed deviates is actually smaller than 
the value unity assigned, and we might benefit 
from increased sensitivity by using as a measure 
of variation the estimated variance from the 
observed data. The suggestion then would be 
that one transform from the observed values of 
P to normal deviates, and consider these devi- 
ates as a sample from a normal distribution 
with unknown variance. Then a t-test is per- 
formed on these deviates to determine whether 
their population mean could reasonably have 
been zero, using as degrees of freedom one less 
than the number of groups. At this writing we 
would recommend this modified Stouffer tech- 
nique, with one reservation. When there are 
very few groups (say 4 or fewer), the size of t 
required for significance is very high. In a 
sense, too much is being paid for the value of 
estimating the variance. For this reason most 
people may prefer the well-established Fisher 
approach or Stouffer's technique unmodified. 

There are three further matters that need 
serious attention. The first has to do with com- 
bining significance tests when the distribution 
of the original statistic is discrete, for example, 
combining results from several 2 X 2 tables. 
The exact significance test results in a discrete 
distribution. The sign test would be another 
example, or any of the nonparametric tests (ag- 
gravated when sample sizes are small). In such 
cases the ordinary methods of combining can 
lead to serious misinterpretations. We cannot 
go into the matter here, but we refer the reader 
to Wallis (1942) for a careful discussion. 

The second matter has to do with directional- 
ity of result, This discussion can best be fol- 
lowed with a t-table open before us. Suppose 
we have three t-tests on a measure for men 
minus a measure for women with t-values 
—0.70, 1.37, 2.23 with 10 degrees of freedom 
each. What probabilities should we combine? 
Often people are planning to use a two-tailed 
test, so they would ordinarily say that the 
probabilities associated with these three fs are 


respectively 0.50, 0.20, 0.05. They would take 
these probabilities and pool them. We feel it 
is better practice to choose a direction as posi- 
tive, obtain probabilities of getting the observed 
value or a larger one, and pool the resulting 
P's. In the present case we would have as P's 
0.75 (because of the minus sign on the original 
t), 0.10, and 0.025. After combining these P's 
we can use a two-tailed test for the grand result 
if we so desire. Notice that if we use a two- 
tailed P in the first place we are in the position 
of suggesting that t-values like —2.0 and +-2.0 
both contribute the same small P to the over-all 
result, thus enhancing its significance. This 
may cast doubt on the null hypothesis, but the 
appropriate alternative hypothesis is that these 
results are too far from one another. ‘The un- 
wary investigator proceeding in the ordinary 
way might blunder by thinking that he had 
established a significant difference in a positive 
direction. Consequently when the original test 
statistic yields P's corresponding to a two-sided 
result, we recommend that directionality be 
established and that it be decided whether an 
observed two-sided P such as 0.10 should be 
transformed into a one-sided P of 0.05 or of 0.95 
before proceeding. The chi-square test for large 
2 x 2 tables is a good example of the case in 
point (disregarding the discreteness issue) . 'The 
results yield a value of chisquare which cor- 
responds to a two-tailed test, directionality may 
have been confounded, and we may have been 
measuring heterogeneity of 2 X 2 tables rather 
than consistency in a particular direction. 

A third matter has to do with sample size. 
In combining P's up to now we have spoken as 
if each experiment to be combined was to be 
given equal weight. That is the way Fisher 
originally introduced the test. If we use Fisher's 
technique, it is not obvious how we can make 
use of weights, although a theoretical investi- 
gation might provide such information. If 
Stouffer's technique is used (unmodified) there 
seems to be an obvious way to do the weighting. 
If we assume that the transformed deviates are 
independently normally distributed with zero 
mean and unit variance (null hypothesis) , then 
the statistic 


T — wx; Lan +. -o + wx 
where the w's are the positive weights, to be 
decided upon by the investigator, and the x's 


are the transformed deviates, will have zero 
mean and variance 


Gr? — w? + we? +... 


T 


?r 


+ w,?. 


This means that z = 
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can be regarded as a normal deviate, and no 
difficulty arises from the weighting. In the 
modified Stouffer method using a t-test, it again 
is not obvious how to proceed with weighting, 
although perhaps something could be worked 
out. 

We now give an example of the equally 
weighted methods, treating the same data all 
three ways. The example is from McNemar 
and Terman (1936) as given by Gordon et al. 
(1952). Boys and girls were compared on a 
paper-and-pencil intelligence test in thirteen 
age groups. We give here the sample number, 
the one-sided probability of more extreme devi- 
ations, the two-degree of freedom chi-square 
value, and the corresponding normal deviate. 
In each sample there were about equal numbers 
of boys and girls, minimum sample size 325, 
maximum 1,829. 

(i) Using Fisher's technique, we look at a 
chi-square table with 26 degrees of freedom. 
We find that 43.8 for a one-sided test is between 
the 0.025 and 0.01 and therefore for a two-sided 


test it is between 0.05 and 0.02. (ii) Using 
Stouffer's technique, we find z — a = 2.81, 
NAE) 


which for a normal table gives a one-sided level 
of 0.0025, or a two-sided 0.005 level. (iii) Using 
the t-test technique, we get for 12 degrees of 
freedom 
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0.78 
t—-o187 — 4.17, 

which for a onesided test gives between 0.005 
and 0.0005, therefore a two-sided result be- 
tween 0.01 and 0.001. It will be noted in the 
present example that the normal and t-test ap- 
proach are in close agreement, while the chi- 
square result is not very close to the others. 
"The fact that in this one case the normal deviate 
approaches were more sensitive than the chi- 
square approach should not be construed as evi- 
dence in their favor. 


Chi-square 

with 2 de- Normal 

One-sided P — grees of freedom deviate 
1 0.159 3.6777 1.00 
2 0.100 4.6052 1.28 
3 0.021 7.7265 2.04 
4 0.386 1.9038 0.29 
5 0.405 1.8077 0.24 
6 0.054 5.8375 1.61 
zi 0.111 4.3965 1.22 
8 0.371 1.9831 0.33 
9 0.309 2.3488 0.50 
10 0.255 2.7330 0.66 
11 0.121 4.2239 17 
12 0.648 0.8677 —0.38 
13 0.429 1.6926 0.18 
43.8040 10.14 

x = 0.780 


FINAL REMARKS 


We have assembled here a collection of tech- 
nical statistical devices we have found useful 
in helping to handle problems brought to us 
by social psychologists, so no great purpose 
would be served by trying to summarize these 
somewhat unrelated techniques. It might be 
better to limit ourselves to a few general re- 
marks about statistics and mathematical devices. 

We first wish to emphasize that formal sta- 
tistics provides the investigator with tools use- 
ful in conducting thoughtful research; these 
tools are not a substitute for either thinking or 
working. A major goal for the statistical train- 
ing of students should be statistical thinking 
rather than statistical formulas, by which we 
mean specifically: thinking about (1) the con- 
ception and design of the study and what it is 
that is to be measured and why, (2) the defini- 
tions of the terms being used, and how modifi- 
cations in definition might change both the out- 
come and the interpretation of a study, (3) 
sources of variation in every part of the study, 
including such things as individual differences, 


group and race differences, environmental dif- 
ferences, instrumental or measuring errors, and 
intrinsic variation fundamental to the process 
under investigation. In no circumstances do 
we think that sophisticated analytical devices 
should replace clean design and careful execu- 
tion, unless very unusual economic considera- 
tions arise. However, it may be worth remark- 
ing that crude data collected as best the investi- 
gator could may require the most advanced 
statistical tools, Here a quotation from Wallis 
(1949) may be appropriate: 

In general, if a statistical investigation . . . is 
well planned and the data properly collected the 
interpretation will pretty well take care of itself. 
So-called “high-powered,” “refined,” or “elabo- 
rate” statistical techniques are generally called 
for when the data are crude and inadequate — 
exactly the opposite, if I may be permitted an 
obiter dictum, of what crude and inadequate 
statisticians usually think. 


Next we shall comment on the use of sig- 
nificance tests. It has been aptly said that the 
* 
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main purpose of a significance test is to inhibit 
the natural enthusiasm of the investigator. The 
test is used to give some way of deciding 
whether to believe in the reality of the observed 
effect. Having established the reality, there is 
another question—whether the size of the dif- 
ference is of any material value. In a weight- 
reducing experiment an average reduction of 
half a pound can scarcely contribute much to 
the additional longevity of men who weigh over 
an eighth of a ton, however real the decrease 
may be. Such a point has often been made, 
and it can scarcely be made too often. 

Questions often arise about the choice of sig- 
nificance level, but it cannot be expected that 
any general answer will be forthcoming. The 
development of the statistical notion of decision 
functions rather than classical significance tests 
emphasizes the point all the more. In the deci- 
sion function approach one tries to decide how 
much data is needed to reach a decision which 
in some sense is most economical, but for the 
full treatment it is important to know the value 
of correct decisions and the costs associated with 
incorrect ones. In scientific work, as far as we 
can tell, we have only the vaguest notions of the 
costs of mistakes or the value of a correct find- 
ing. Often scientific work is not intended as a 
basis for action; we merely want to know what 
to believe. How shall we assess the value? 
What metric shall we use? The answer seems 
to be “to each his own,” and aside from general 
notions provided by the operating characteris- 
tics of various techniques of deciding, there is 
little we can offer. So the choice of significance 
level seems to be a matter for the investigator. 
As for a related topic of one-sided versus two- 
sided tests, we think there are plenty of situ- 
ations in which it is appropriate to test for 
directionality, but a factor of two in the signifi- 
cance level is scarcely worth arguing about. If 
it is a question of deciding between 0.05 and 
0.10, the experiment was not ecstatically con- 
clusive anyway. 

We have earlier alluded to the fact that sig- 
nificance levels are, at best, approximate values. 
Reasons for this are obvious—the assumptions 
on which the null hypothesis computations are 
based may be somewhat in error. Independ- 
ence from trial to trial, for example, may be 
lacking, randomization may be improperly per- 
formed, or the observations may not be normally 
distributed. Such departures from the ideal 
may not produce large errors in the true sig- 
nificance level, but they can make it only ap- 
proximate. 

We wish to turn now to the question of other 
mathematical matters. Except for analysis of 


variance and multiple regression the social psy- 
chologist, as such, has not generally put great 
efforts into multivariate techniques, such as fac- 
tor analysis. Without choosing a particular 
brand of multivariate analysis to push, it still 
seems reasonable to us that this direction of 
investigation is one appropriate for the disci- 
pline as far as quantitative methods are con- 
cerned. An anecdote from the classroom of 
one of the authors sums up the attitude of a 
reasonable number of psychologists. In the 
early part of the course dealing with one- 
variable problems the complaint was registered 
by a student: “The problems of social psychol- 
ogy are so much more complicated than this. 
How can you hope to handle the many variables 
surrounding a human problem with one vari- 
able?" Some weeks later after an introduction 
to factor analysis in which two factors were 
removed from 12 variables, the same student 
said "My God, isn't there some way we can 
just reduce this all to a single number?" The 
reply was that essentially multivariate prob- 
lems are likely to have multivariate answers. 
We are not arguing here for any special kind of 
multivariate analysis, but rather for a little 
more intense investigation of the subject. "This 
is a direction that may pay off, especially if 
the analysis is built for the problem rather than 
vice versa. Consequently, social psychologists 
may wish to persuade more mathematical statis- 
ticians to explore the possibilities. 

In this chapter we have been mainly con- 
cerned with methods for testing hypotheses. 
Tests we described, as well as all other such 
tests, are based upon certain assumptions. For 
example, most applications of operating charac- 
teristics involve an assumption of normality, 
and most nonparametric techniques are based 
upon the theory of equally-likely permutations. 
The assumptions behind a particular statistical 
technique constitute a model. Whenever a psy- 
chologist makes a set of measurements on two 
groups of people he implicitly employs a rather 
general model which says that each set of meas- 
urements comes from a population and each 
population can be represented by some kind of 
a distribution function. Then he usually hy- 
pothesizes that the two populations have dif- 
ferent means or different variances or different 
somethingelse. But when he uses a particular 
statistical technique he implicitly or explicitly 
makes further assumptions about the two dis- 
tributions and so is using a more specific model. 

Testing of hypotheses is one of the main 
branches of statistics. Another is the estimation 
of parameters. Whenever a social psychologist 
draws a sample and makes inferences about the 
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population from which it was drawn, he is esti- 
mating a parameter, e.g., the population mean 
or variance. Here again there is an underlying 
model; one necessarily makes some assumptions 
about the parent population, for otherwise the 
parameter estimated has little meaning or in- 
terest. To be sure, the model may be a very 
simple one, but it is still a model. There are 
many problems in estimation, however, where 
the model is not so simple, e.g., the analysis of 
time series, 

An outstanding feature of statistical models 
—whether for testing hypotheses or estimating 
parameters—is that they are selected to suit 
each experimental problem as it arises. The 
statistician has provided the psychologist with 
a rather large collection of models to choose 
from. The experimenter looks at his data, or 
preferably a projected experiment, and tries to 
find a suitable model. Moreover, he does not 
have the slightest hesitation to use an entirely 
different model for the next experiment. In this 
sense, the use of statistical models is data- 
oriented. We next discuss a different class of 
models which are theory-oriented. 

The term behavioral model is now often used 
to refer to a somewhat different type of applica- 
tion of mathematics to psychology. The be- 
havioral model is not so much designed to help 
in the analysis of a particular set of data as in 
the construction of theory. Hull's attempt to 
formalize rote learning is an example. When 
one attempts to "explain" a large class of phe- 
nomena — to tie together a broad range of be- 
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havior—he is building a behavioral model. The 
use of mathematics in such model construction 
is not at all essential, but there is a growing 
interest in these attempts. The distinction be- 
ween behavioral and statistical models neither 
is nor needs to be a very clear one. A behavioral 
model, if it is a good one, should suggest ways 
of handling data, and a statistical model often 
provides stimulation to theoreticians. The dif- 
ference is mainly in orientation, not use. While 
the statistical model is chosen to analyze a par- 
ticular set of data, the behavioral model should 
lead to the acquiring of new kinds of data or 
at least to the selection of data which is rele- 
vant to it; the “testing” of the behavioral model 
is of primary interest. 

It is interesting to note an important differ- 
ence in the role of significance tests when such 
models are tested. Instead of searching for sig- 
nificant differences and being pleased by finding 
them, the model builder is hoping that the null 
hypothesis can be accepted, and that his model 
does fit the data. 

In closing we would like to remind the 
reader that most statistical and mathematical 
model problems have yet to be investigated, and 
that his inquiry to the nearest statistician or 
mathematician for help will not always bring a 
ready answer. Indeed, social psychology is a 
prominent source of research problems for the 
mathematical statistician and perhaps is be- 
coming such a source for the applied mathema- 
tician. 
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CHAPTER 9 


Attitude Measurement 


BERT F. GREEN 
Massachusetts Institute of Technology 


In this chapter we shall discuss methods for 
developing meaningful quantitative measures 
of the attitudes of individuals. Our purpose is 
to present the logical basis of attitude measure- 
ment in general, and to give a detailed account 
of the prominent attitude-scaling techniques. 

Since McNemar (1946) wrote his compre- 
hensive critical review of opinion-attitude meth- 
odology, some major methodological advances 
have been reported by Stouffer, Guttman, Such- 
man, and Lazarsfeld (Stouffer, et al., 1950a) in 
connection with attitude research in the U. S. 
Army. This work has greatly influenced the 
formulations in this chapter. Another impor- 
tant influence, which the author gratefully ac- 
knowledges, is W. S. Torgerson’s monograph on 
the theory and methods of scaling, being pre- 
pared for the committee on scaling of the Social 
Science Research Council. Manuscript for por- 
tions of this monograph was kindly made avail- 
able by Dr. Torgerson. Specific citations will 
not be made in this chapter. 


THE CONCEPT OF ATTITUDE 


To be able to understand how attitudes can 
be measured, we first need to examine the con- 
cept of attitude. Like many psychological vari- 
ables, attitude is a hypothetical or latent vari- 
able, rather than an immediately observable 
variable. The concept of attitude does not refer 
to any one specific act or response of an individ- 
ual, but is an abstraction from a large number 
of related acts or responses. For example, when 
we state that individual A has a less favorable 
attitude toward labor organizations than indi- 
vidual B, we mean that A’s many different 
statements and actions concerning labor organ- 
izations are consistently less favorable to labor 
than are B's comparable words and deeds. We 
are justified in using a comprehensive concept 
like attitude when the many related responses 
are consistent. That is, if people who disap- 
prove of the closed shop are also likely to want 
to outlaw strikes, and to oppose minimum wage 
laws, then it seems reasonable to speak of an 
antilabor attitude. 


In general terms, a latent variable is used to 
describe the consistency or covariation of a num- 
ber of different responses to stimuli of the same 


general class. The variable is viewed as mediat- ` 


ing the stimuli and the responses. The re- 


sponses are said to covary because they are all | 


mediated by the same hypothetical variable. 
The latent variable is useful because it unifies 
a set of data, namely, the observed responses. 
Hypothetical variables of this kind have been 
called traits, intervening variables (MacCor- 
quodale and Meehl, 1948), latent variables 
(Lazarsfeld, 1950a) , genotypes (Coombs, 1952) , 
and factors (Thurstone, 1947). The observable 
data have been called manifest variables, pheno- 
types, and indicants (Stevens, 1951). 

Many definitions of social attitudes have been 
proposed. These definitions seem to have a 
common theme that suggests the manner in 
which attitude may be viewed as a latent vari- 
able. This theme is the concept of attitude as 
a consistency among responses to a specified set 
of stimuli, or social objects. 

Consider a few examples. Krech and Crutch- 
field (1948, p. 152) view attitude as “. . . 
enduring organization of motivational, emo- 
tional, perceptual, and cognitive processes with 
respect to some aspect of the individual's 
world." An enduring organization of psycho- 
logical processes seems to imply a consistency 


of response patterns. For Doob (1948) an at: ` 


tude refers to an implicit response that is both 
anticipatory and mediating in reference to pat- 
terns of overt responses, that is evoked by a 
variety of stimulus patterns, and that is consid- 
ered socially significant in the individual's 
society. Overt responses are consistent in that 


they are mediated by the implicit response. All-— 


port (1935, p. 810) , after reviewing many early 
definitions of attitude, concluded that "an 
attitude is a mental and neural state of readiness 
exerting a directive influence upon the individ- 
‘ual’s response to all objects and situations with 
which it is related." Fuson (1942) defines an 
attitude as the probability of occurrence of a 
defined behavior in a defined situation. Camp- 
bell (1950) suggests that a social attitude is evi- 
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denced by consistency in response to social 
situations. 

It is apparent from these examples that the 
concept of attitude implies a consistency or 
predictability of responses. An attitude governs, 
or mediates, or predicts, or is evidenced by a 
variety of responses to some specified set of 
social objects or situations. Campbell (1950, 
p. 31) has summarized this view neatly in pre- 
senting an operational definition of attitude: 

| “An individual's social attitude is an [enduring] 
` syndrome of response consistency with regard to 
| [a set of] social objects.” 

This definition does not divest attitudes of 
their affective and cognitive properties, which 
may be properties of, or correlates of, the re- 

_ sponses that comprise the attitude. However, 
attention is focused on the characteristic of 
attitude that is basic to all attitude measure- 
ment: response covariation. In each measure- 
ment method, covariation among responses is 
related to the variation of an underlying vari- 
able. The latent attitude is defined by the 
correlations among responses. 

The set of social objects that forms the refer- 
ence class of an attitude distinguishes attitude 
from other psychological variables such as 
habit, temperament, drive, or intelligence. It 
is of secondary importance whether we call the 
variable an attitude, or a trait, or a habit. The 
operational definition will always be in terms 
of the referent. class of stimuli. 

The content of an attitude is determined by 
the responses which constitute it. The set of 
behaviors comprising an attitude will be called 
an attitude universe. [Guttman (19502) de- 
veloped the idea of a universe in connection 
with his scaling method; his approach seems 
applicable to the general measurement prob- 
lem.] The elements of this universe are mani- 
fest variables—the responses to specific situa- 
tions. A sample of these elements is used to 
measure the attitude; from this sample of be- 
havior an inference can be made about the 

,entire universe, For example, if a sample of 
elements is homogeneous, it is reasonable to 
infer that the universe also is homogeneous. 
Furthermore, the scores of individuals on this 
scale may be taken as representative of their 
behavior as described by the universe of ele- 
ments. On the other hand, homogeneous sub- 
sets of elements may be found within the orig- 
inal universe, implying that several dimensions 
are represented in the original attitude universe. 

Conceptually, then, attitude measurement 
involves sampling a behavior universe, and 
measuring the universe by means of the sample. 
This means that the sample of elements should 


be representative. If only a small subset of the 
total attitude universe is sampled, then infer- 
ence beyond this subset is not legitimate. Of 
course, in practice we seldom have the universe, 
but only the sample. We may characterize the 
elements of the hypothetical universe by means 
of the sample; however, it is very important not 
to overgeneralize. From a sample of verbal 
responses to questions about opinions, one 
should not make inferences about behavior 
other than verbal responses to similar verbal 
questions, It may be that responses to these 
verbal questions are correlated with responses 
in nonverbal situations, but this must be deter- 
mined experimentally. 

'The concepts of attitude universe and sub- 
universe are certainly not precise, but they are 
helpful in formulating the measurement prob- 
lem, and will be particularly useful in discussing 
validity. 

As we have defined it, an attitude is a latent 
variable, since its meaning is derived from the 
covariation of responses in some attitude uni- 
verse. To obtain a more precise definition of 
an attitude, we need a mathematical model that 
relates the responses, or observed variables, to 
the latent variable. Each psychological scaling 
method either states or implies such a model. 
The problem of measuring an attitude is one 
of selecting a scaling model by which the re- 
sponse data can be related to the attitude 
variable. 


PROPERTIES OF SCALES 


Different scaling methods have different prop- 
erties. To select a scaling method wisely, it is 
important to understand the various properties 
of scales of measurement. 

The quantification of psychological attributes 
has been the subject of much debate. Some sci- 
entists, including Ferguson, Myers, Bartlett, 
et al. (1940), Bergmann and Spence (1944), 
Johnson (1936) , have argued that much psycho- 
logical measurement is not true measurement, 
since it does not conform to the methods and 
definitions used in physical measurement. They 
point out that most physical scales have the 
property of additivity, and that this property 
may be demonstrated experimentally. An oper 
ational definition of the sum of two lengths may 
be achieved by laying two sticks end to end; 
in the case of weight, two weights and a bal- 
ance are needed. By what psychological experi- 
ment, it is asked, may two pitches be added, or 
two attitudes? 

One answer has been given by Gulliksen 
(1946). He showed that a logical extension of 
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the concept of additivity will allow scales de- 
vised by the method of paired comparisons to 
be classed as additive. In this case the difference 
between two scale values is defined operation- 
ally. The experiment consists in determining 
the proportion of trials on which stimulus i is 
judged to be greater than stimulus j with respect 
to the attribute in question. A transformation 
based on the normal distribution relates this 
observed proportion to the difference between 
the scale values of the two stimuli. If the dif- 
ferences between the scale values of all pairs of 
stimuli are mutually consistent, e.g., (8; — S;) + 
(S, — Sr) — (S, — Sz) , then the additive prop- 
erty of the scale is established. 

The argument about the properties of a scale 
of measurement has led to a broader conception 
of the nature of measurement. Stevens (1951), 
Lorge (1951), and others have pointed out that 
in its broadest sense, measurement is the assign- 
ment of numerals to objects, events, or persons, 
according to rules. With the rules properly de- 
fined, the numerals may be used to represent 
relations among the objects, and to compare 
these objects with other sets of objects similarly 
measured. The result of a measurement is a 
scale. A scale comprises the set of numerals 
given to the objects by using a certain rule of 
assignment. The objects are then said to be 
scaled. 

Objects may be measured according to many 
different rules, each of which leads to a scale 
with special properties. Stevens (1951) dis- 
tinguishes among nominal, ordinal, interval, 
and ratio scales. In a nominal scale, objects are 
placed in several mutually exclusive categories, 
but there is no necessary relation among the 
categories. The number assigned to the objects 
in a category serves as a label, but has no quan- 
titative meaning. Any other set of labels would 
be adequate if each original label is replaced 
by a different label in the new set (i.e, a one- 
to-one transformation). When objects can be 
arranged in a rank order with respect to some 
variable, they are said to form an ordinal scale. 
Numbers may be assigned to the objects if the 
numbers are in the same rank order as the ob- 
jects. (Any monotonic transformation is per- 
missible) We might obtain an ordinal scale 
of the hardness of the several solid substances 
by determining which substances scratch which 
other substances. When the intervals between 
objects on a scale can be measured, then an 
interval scale exists. An interval scale of hard- 
ness would require some experimental method 
to determine how much harder substance A 
was than substance B. We could then assign a 
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number to each substance so that the difference 
between the two values indicated the extent 
of the difference in hardness. Another set of 
numbers could be substituted for the original 
set only if there was a linear relationship (y — 
ax + b) between the two sets of numbers. A 
ratio scale results when there is some way of 
showing how many times greater one object is 
than another. Numbers are assigned to the 
objects so that the numbers have the same ratio 
as the experimentally determined ratios of the 
objects. A ratio scale implies a fixed zero point, 
so that the only admissible transformation is 
multiplication by a constant. (y — ax). Weight 
forms a ratio scale because we can show that 
four pounds is four times as heavy as one 
pound, and similarly for all pairs of weights. 

This classification is not complete, although 
it includes the main properties of scales. 
Coombs (1950, 1952) has suggested several 
scales that fall between nominal and ordinal 
scales. For example, we could have an ordered 
set of categories, with objects within a category 
being unordered. A partially ordered set of 
objects would result if some objects could not 
be ordered with respect to some other objects. 
Coombs has also suggested an “ordered metric,” 
in which the objects are ordered, and the in- 
tervals between objects on the scale are also 
ordered. 

Ordinal, interval, and ratio scales are defined 
with respect to a single dimension along which 
the objects are placed. These definitions can 
be extended to include multidimensional 
scales. It is an experimental question whether 
the relations among the objects are consistent 
with a one-dimensional continuum. For ex- 
ample, we might find an inconsistent set of dif- 
ferences, such as A — B = 4, B — C = 8, 
A — C = 5. Here, if the result is not to be 
called “error,” the quantities must be re- 
ferred to a two-dimensional scale. A, B, and C 
could be represented as points on a plane, with 
the obtained differences taken as the distances 
between the points, whereupon A, B, and C 
would be the vertices of a right triangle. For 
example, in a scale of colors, 4 and B might 
differ only in saturation, and B and C might 
differ only in hue, while A and C might differ 
in both saturation and hue. In this case, 4 and 
C would appear to be more dissimilar than 4 
and B, or B and C. Although multidimensional 
scaling will not be treated here (see Torgerson, 
1952, and Bennett, 1951), the general concept 
is basic to the various definitions of unidimen- 
sionality that will be met later. 
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DEFINITIONS 


The terms measurement, scale, attitude, and 
attitude universe have been defined above. In 
this section we give definitions for other terms 
which we shall use in special senses. 

Attitude measurement consists in assessing an 
individual's responses to a set of social objects 
or situations. This is done by observing a 
sample of behavior from an attitude universe. 
Each behavioral element in the attitude uni- 
verse is the response to a particular situation 
or object. The particular situation or object 
that evokes the response, together with a speci- 
fied set of response categories, is called an item. 
An item is a manifest variable. The statement, 


rhe government ‘should own the coal mines," 


together with two response categories, "agree" 
and “disagree,” forms an item. The same state- 
ment, with five response categories, "strongly 


agree," "agree," "neutral," "disagree," "strongly 


. disagree," is a different item. In the statistical 


analysis, the response categories to the latter 
may be combined to produce a dichotomous 
variable, but from the respondent's point of 
view, the two items require slightly different 
kinds of responses. 

A number assigned to an individual's item 


— response is an item score. In the first example, 


v 


"agree" might be scored 1, and “disagree,” 0. 


v The proportion of people who give a certain 


.natives as the positive response. 


response to an item determines the popularity 
of that response. The popularities of the vari- 
ous responses to an item are the item marginals. 
For dichotomous items, it will be convenient 
to specify arbitrarily one of the response alter- 
It will be 
sufficient to consider only the popularity of the 
positive response, and thus a single marginal 
for each item. The popularity of a positive 
response to item i will be symbolized by p;. 
We shall also use Do for the proportion of 
people responding positively to both items i 
and j, pij; for the proportion responding posi- 
tively to all three items i, j, and k, etc. 

Any collection of items will be called a ques- 
tionnaire. Other terms, such as test, inventory, 
and scale, have been used in the literature. The 
term questionnaire is free from some of the 
connotations of the other terms. A question- 
naire may contain items from different attitude 
universes, although usually only a single uni- 
verse is represented. The distinction will be 
clear from the context. 

An individual whose attitude is being meas- 
ured will be called a respondent. A number, 
derived from his item responses, that represents 


his position on the latent variable is called a 
d 
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scale score, or simply a score. The scores form 
the attitude scale. The scale implies a system 
for obtaining a score from an individual's item 
responses. The term scale will occasionally be 
used to refer to this scoring system. 


THE EVALUATION OF SCALES 


To evaluate a scale, statistical evidence is 
needed concerning the properties of the scale. 
The reliability of the scale is of primary im- 
portance. A statement should also be provided 
about the attitude universe represented by the 
scale — that is, what items might have been 
used. Evidence of correlational validity is of 
paramount interest, since the utility of the scale 
will depend on its relationship with other vari- 
ables and its ability to predict other behavior. 

Facts such as these are necessary to specify 
what is meant by the label that is attached to 
the scale, and to ensure against false interpreta- 
tions of that label. Putting a name on a scale 
creates a serious social obligation that cannot 
be fulfilled without experimental evidence con- 
cerning the properties of the scale. 

Reliability. The reliability of the scale is an 
index of the extent to which repeated measure- 
ments yield similar results. We shall not discuss 
the reliability of group statistics such as means 
and proportions, where discrepancies are due 
to sampling different individuals. Rather, our 
primary concern is with the reliability of an 
individual’s score. If chance fluctuations cause 
relatively large shifts in an individual’s score, 
then any particular determination of the score 
is practically meaningless. We would not ex- 
pect such scores to correlate highly with other 
variables. Since attitude scales are developed as 
research tools for studying the relation of atti- 
tudes to other variables, high reliability is an 
indispensable property of a scale. 

Reliability has two aspects, which Cron- 
bach (1951) has called stability and equiva- 
lence. These are new names for test-retest 
reliability and parallel-form reliability, respec- 
tively. 

If the same questionnaire is administered to 
the same group of respondents on two sepa- 
rate occasions, there will be some shifts in the 
scores of individuals. The correlation between 
these scores is the test-retest reliability or the 
coefficient of stability. Memory and familiarity 
with the scale will tend to increase stability; 


° extending the time interval between adminis- 


trations will tend to decrease stability. 

Low stabilities of attitude scales have been 
defended on the grounds that attitudes are 
dynamic and should be expected to change. 
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This may be true, but it is difficult to see how 
an attitude scale can be a useful predictive 
instrument if the scores are not stable. Fur- 
thermore, there is a considerable difference 
between consistent shifts in response syndromes 
and sporadic changes in specific response ten- 
dencies. In studies of attitude change we are 
concerned with consistent shifts of response syn- 
dromes. In this case, low stability of the score 
is expected. But it is important to know the 
reliability of the change. This can be discov- 
ered by means of equivalent scales. 

If our purpose is to measure an attitude uni- 
verse by means of a sample of items, then we 
must determine what score differences could 
be expected if a different sample of items were 
chosen. Scales based on equivalent samples of 
items from the same universe should yield 
very similar scores. The correlation between 
such “parallel” scales is called the coefficient 
of equivalence, or the parallel-form reliability. 
It measures the extent to which the scale is 
specific to the particular items used. We are 
not justified in generalizing beyond the specific 
items in the scale unless equivalence is demon- 
strated. The concept of an attitude universe 
implies that there are many items which might 
have been used in the questionnaire. If, in fact, 
an equivalent set of items cannot be produced, 
the notion of an attitude universe becomes 
meaningless. Of course, in some situations we 
are interested in a scale for a specific set of 
items, but more often we would like to gen- 
eralize to a larger class of possible items. 

Two scales derived from different samples 
of items from the same universe should be 
highly correlated. However, we would also like 
the two scales to have similar scale-score dis- 
tributions, similar patterns of inter-item 
correlations, and similar reliabilities. Such 
equivalent, or parallel, scales are best con- 
structed from matched or paired samples of 
items. If possible, items with similar response 
distributions should be paired, i.e., items with 
equal marginals. Paired items should also have 
a similar degree of relationship with the total 
scale, Parallel scales constructed in this way, 
if they have high coefficients of equivalence, 
can be used interchangeably. 

There are several methods for estimating the 
coefficient of equivalence from a single set of 
items, These methods depend on the strength 
of the interrelationships among the items, that 
is, on the homogeneity of the scale. The most 
familiar of these methods is the split-half tech- 
nique. In this technique, the items are divided 
into two subsets, either matched or random 
(odd-even), and a scale is developed for each 


subset. The correlation between these scales 
may be used with the Spearman-Brown formula 
to estimate the equivalence of the original 
scale. (Technically, the Spearman-Brown for- 
mula is applicable only when the scale score is 
a weighted or unweighted sum of the item 
scores.) Guttman (1945, 1946) has shown that 
his coefficient of reproducibility is in some 
cases a lower bound of equivalence. The Kuder- 
Richardson formulas, especially formula (20) 
which Cronbach (1951) has called coefficient 
a, have been widely used as estimates ot 
equivalence. Loevinger's (1947) coefficient of 
homogeneity is a direct index of the homogene- 
ity of the items; it does not estimate the 
reliability of the scale. A discussion of some 
of these methods, as well as a thorough treat- 
ment of reliability, may be found in Gullik- 
sen (1950b). 

Unidimensionality and homogeneity. A uni- 
dimensional scale measures a single variable. 
In general, the property of unidimensionality 
means that people with the same score have 
about the same attitude. If the scale is actually 
a reflection of more than one attitude, so that 
it acts as the combination of two or more scales, 
then the same score can represent very different 
attitude patterns. For example, a scale of eth- 
nocentrism composed of five items concerning 
attitudes toward Negroes, five items concerning 
attitudes toward Italians, and five items con- 
cerning attitudes towards the British would 
probably not be homogeneous. A median score 
might indicate median attitudes toward all three 
groups, or a strong anti-Negro attitude com- 
bined with a median attitude toward Italians 
and a strong pro-British attitude, or any other 
similar combination. Of course, if the three 
attitudes are highly correlated, then such a 
scale could be considered homogeneous and 
unidimensional, since ambiguous interpreta- 
tions of scores would be improbable. 

Operationally, unidimensionality and homo- 
geneity have essentially the same meaning. Both 
are defined in terms of the degree of covariation 
among the items. In general, if the items on a 
scale are highly interdependent, then the items 
are homogeneous and the scale is unidimen- 
sional. More specific considerations must be 
discussed in connection with particular scaling 
methods, since each method contains its own 
definition of unidimensionality. 

A. homogeneous scale is, in general, a reliable 
‘scale. In the previous section we noted that 
several methods of estimating the coefficient of 
equivalence are based on homogeneity. In these 
methods it is assumed that the degree of rela- 
tionship or covariation among the items in a 
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scale is representative of the degree of covaria- 
tion to be expected between these items and 
an equivalent set of items. However, it is 
possible for two highly correlated parallel scales 
not to be homogeneous. The scale described in 
the preceding paragraph could be highly re- 
liable yet not homogeneous. Thus, if equiva- 
lence is measured by correlating two equivalent 
scales, additional evidence of homogeneity is 
needed. 

Validity. If a scale is reliable and homo- 
geneous, it measures some variable. The prob- 
lem of validity is to discover what it measures. 
If the scale lives up to our definition of atti- 
tude, then it measures an attitude universe. 
The first problem in validation is to delimit 
and define this attitude universe. 

It is helpful to distinguish at least three kinds 
of attitude universes. Most attitude question- 
naires consist of items in which the respondent 
is asked to give some verbal response. This 
verbal response is quite often a statement of 
the respondent's opinion, or some other form 
of self-description. These attitude universes 
may be called elicited verbal attitudes. The 
opinions expressed by individuals in normal 
conversation with friends may be classed as 
elements of spontaneous verbal attitudes. A 
third class of behavior is verbal or nonverbal 
behavior directed toward an object in the refer- 
ent class. We may call the corresponding uni- 
verse an action attitude. For example, a 
restaurant manager might tell a survey inter- 
viewer that he would not serve Negroes, he 
might assure his friends that he won't serve 
Negroes, and he might refuse service to a Negro 
in his restaurant. It might also happen that 
the three responses are not so consistent. 

Responses in any of these areas may be scaled. 
Although most of our attitude scales measure 
elicited verbal attitudes, other scales are equally 
possible. With clever observational techniques, 
scales could be developed for spontaneous 
verbal attitudes and for action attitudes. 

These classes of attitude are not exhaustive. 
The indirect techniques lead to further attitude 
universes. Also, subclasses can be made within 
the three stated classes. Action attitudes are 
particularly diverse, but little measurement has 
been done in that area, so that breakdowns are 
as yet unnecessary. 

A description of the attitude universe is 
necessary in order to determine whether the 
scale measures this universe. Essentially, this 
means characterizing the items which might 
have been used in the questionnaire. Many 
investigators would say that, as a representation 
of an attitude universe, the scale is valid by 
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definition, or has face validity. Gulliksen 
(19502) has used the term intrinsic validity for 
this situation, and has suggested some experi- 
mental methods for determining whether a 
scale is intrinsically valid. One method is to 
study the judgments of a group of experts. A 
large set of items could be assembled which the 
experimenter believes to be in the same attitude 
universe as the items on the scale. Experts could 
rate these items as well as the items on the scale 
for relevance to the attitude being considered. 
Alternatively, several experts could each submit 
a questionnaire that is believed to be a measure 
of the attitude universe. These questionnaires 
could be administered and their intercorrela- 
tions investigated by factor analysis to see if a 
single common factor is exhibited by the several 
scales. Such techniques promise to give at least 
a partial solution to the plaguing problem of 
attitude scale validity. 

The second major method of determining 
what a scale measures is to find the variable 
with which it correlates. By definition, an atti- 
tude scale for one universe cannot measure 
a second universe. However, if it correlates 
highly with the second universe, then it 
may legitimately be used as a substitute for 
such a measure. In particular, an elicited 
verbal attitude scale can never be a measure of 
an action attitude, but it may be taken as an 
indicant of the action attitude if a reasonably 
high correlation exists between the two. Since 
this is often the aim of elicited verbal attitude 
scales, correlation with an action attitude is 
called its validity for the action attitude or just 
its validity. 

Unfortunately, it is difficult to determine 
whether such a correlation exists. The validity 
of elicited verbal attitudes has often been ap- 
praised by selecting two groups of people who 
are judged to have different action attitudes, 
and then demonstrating that their elicited 
verbal attitudes differ in the expected direction. 
A major difficulty of this known group valida- 
tion is that the magnitude of the relationship 
cannot be easily evaluated. Furthermore, the 
groups are seldom very well “known.” ‘The 
ideal method of determining correlational va- 
lidity would be to develop an action attitude 
scale, which might consist of a set of standard- 
ized social situations, or a behavior checklist 
which would require extensive observation of 
each individual. 

Many investigations have found that specific 
acts or action attitudes often cannot be pre- 
dicted very accurately from elicited verbal atti- 
tudes. The classic demonstration of the lack of 
correspondence of verbal and nonverbal be- 
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havior is that of LaPiere (1934). He accom- 
panied a Chinese couple on some of their 
travels, and later queried the managers of the 
restaurants, hotels, and auto camps that they 
had visited. Over 90% said they would not 
accept Chinese guests, whereas all of them had 
done so. 

Emphasis on the validity of verbal attitude 
scales for predicting action attitudes often ob- 
scures the fundamental issue in attitude re- 
search. We are interested in the relation of 
attitudes to other variables, such as socioeco- 


nomic status, education, exposure to propa- 
ganda, or other attitudes. If we wish to study 
the relation of elicited verbal attitudes to such 
variables, then we need a scale that measures 
elicited verbal attitude, If we want to investi- 
gate the correlates of spontaneous verbal atti- 
tudes, or action attitudes, then we need scales 
that measure these attitudes directly. It is only 
when we attempt to use a scale based on elicited 
verbal attitude as an indicant of action attitude 
that this question of validity is paramount. 


ITEMS AND Qu ESTIONNAIRES 


Items are the elements from which attitude 
scales are constructed. The usefulness of a 
scale, as well as the definition of the attitude 
that it measures, depends ultimately on the 
items of which it is composed. Ingenuity and 
insight are at a premium in designing items that 
will yield valid and useful scales. 

There are many types and forms of items that 
may be used to assess an attitude. Item type 
refers to the content of the item, while item 
form refers to the manner in which the item is 
related to the latent continuum. The scaling 
methods are largely indifferent to the type of 
item used, although certain requirements are 
made concerning the form of the item. The 
scaling methods usually include procedures for 
detecting and eliminating items which do not 
meet these formal requirements. 


ITEM TYPES 


A great many types of items have been used 
in attitude research. Although the scaling 
methods are usually used with opinion state- 
ments, this is not necessarily the case. Almost 
any item can be used if it has a specified set of 
possible responses. Although we shall not at- 
tempt a survey of all possible item types, we 
shall give a few examples to illustrate the range 
of item types that can be encompassed by the 
scaling techniques. 

The most widely used item type is the opinion 
statement. The respondent may be asked 
whether he agrees with the statement, or he 
may be required to indicate his opinion on a 
five-point scale from “strongly agree" to 
“strongly disagree.” Occasionally the respond- 
ent is asked which of several explicitly presented 


opinions on the issue is most nearly like his own , 


opinion. Open-ended questions may be used. 


" Here the respondent expresses his own opinion, 


and the response is later placed in one of a fixed 
number of categories. Thurstone and Chave 
(1929) suggested that opinion statements could 


be obtained for objective questionnaires by col- 
lecting the responses to open-ended opinion 
questions. 

The responses to opinion. statements are no- 
toriously sensitive to changes in the wording of 
the statement. This is a more serious problem 
in single-question opinion polling than in atti- 
tude scaling, where the use of several statements 
can partially offset the effect. However, this 
does not constitute a license for sloppy wording 
in attitude items. In a readable book, The Art 
of Asking Questions, Payne (1951) discusses the 
many pitfalls to be avoided in preparing opin- 
ion statements or questions. Payne (1950, 
1951) also discusses the problem of meaning. 
People are often willing to state opinions about 
issues that are meaningless to them. It is im- 
portant that the opinion statement be relevant 
to the respondent as well as to the attitude. 
This issue is treated in greater detail in Chap- 
ter 12. 

The use of opinion statements is tantamount 
to asking the respondent for a structured de- 
scription of his verbal attitude. In contrast, 
Rosander (1937) asked for a description of 


the respondent's action attitude. He posed sev- . 


eral hypothetical situations and described a 
particular action; the respondent indicated 
whether he would have taken that action. One 
item from a scale of attitudes toward Negroes 
is: "You are reading in a public library. A 
Negro enters and sits down beside you. You 
leave the library at once." Pace (1939) used 
similar items for general social-economic-politi- 
cal attitudes, F. H. Allport and Hanchett 
(1940) used this type of item to study war- 
producing behavior. Validities found with these 
action-oriented items indicate that they are very 
promising. 

A similar idea was used by Horowitz and 
Horowitz (1938) in studying the development 
of attitude toward the Negro in young children. 


In their "Faces" test the child was shown a 


« 
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set of pictures of white and colored children 
and was asked to point out the one he wanted 
to play with, to eat with, etc. The child could 
also be asked to rank the faces in order of pref- 
erence. Mussen (1950) reports a validity of 

52 for this test, using a criterion of sociometric 

choice at an interracial summer camp. 

The methods for the selection of OSS per- 
sonnel developed during the war by the OSS 
assessment staff (1948) could be adapted for 
use in action attitude research. Essentially the 
technique would involve establishing a series 

v^ of standardized social situations relevant to the 
attitude in question, and studying the behavior 
of the individual in these situations. One could 
presumably establish an objective scale based 
on the responses in a number of these standard 
social situations. (Kelley and Fiske, 1951, used 
these methods to study the selection of stu- 
dents in clinical psychology.) 

Action attitudes can be measured by behavior 
rating scales. The items could be the determi- 
nation of presence or absence of a certain action 
in a particular social situation. Such scales 
would be similar to a scale based on standard 
social situations. In this case, however, the 
scales would depend on raters who knew the 
individual well. The rater could be the indi- 
vidual himself or some close acquaintance. 
Great care must be taken to make the rater's 
job objective, so that reliable results can be 
expected. 

When the purpose of the attitude question- 
naire is disguised, the measurement has been 
called indirect. Campbell (1950) and Weschler 
and Bernberg (1950) have provided compre- 
hensive reviews of these indirect methods. At 
least two major categories of indirect items can 
be distinguished: items of the projective type 
and items based on the selective distortion of 
psychological processes. 

Several investigators have used items similar 
to those on the Thematic Apperception Test. 
Proshansky (1943) used ambiguous pictures 
of labor situations to measure attitude toward 
labor. The measurement was a rating made on 
the basis of the respondent's written interpreta- 
tions of the pictures. Fromme (1941) used po- 
litical cartoons with four alternative captions. 
The respondent selected the best caption for 
each cartoon, and gave reasons for his selec- 
tions. Although Fromme's analysis was quali- 
tative, an objective scale could be developed 
easily. Sanford (1950) studied worry by using 
items similar to those on the Rosenzweig Picture 
Frustration Test. For example, in a cartoon 
depicting two men, one is asking the other what 
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he is worried about. The respondent is required 
to supply the second man's response. 

Several indirect methods are based on the x 
premise that attitudes interfere with psychologi- 
cal processes such as learning, perception, and 
remembering. The selective distortion of these 
processes is a prominent phenomenon in social 
psychology. Consistency among such distor- 
tions is closer to some traditional conceptions 
of attitude than is consistency among opinions. 

Several investigators have used items based 
on the distortion of actual or assumed facts. 
Smith (1947) used statements labeled as facts; 
the respondent indicated whether he believed 
the statement to be true. Newcomb (1946) 
found that errors on an information test were 
related to the respondent's attitude. In Ham- 
mond's (1948) "error choice" technique, a 
factual question is asked, with two response 
alternatives available, each equally far from 
the truth, but in different directions. For ex- 
ample, “The number of man-days lost because 
of strikes from January to June was (1) 34.5 
million, (2) 98.6 million." Bernberg (1951) 
devised a scale of industrial morale based on 
this method of "error choice." 

The selective distortion of logical reasoning 
has been used by Thistlethwaite (1950) as a 
basis for attitude items. He paired neutral and 
“loaded” items of the same (valid or invalid) 
syllogism, The subject is asked to indicate for 
each item whether the conclusion is justified by 
the premise. An example of a "loaded" (in- 
valid) syllogism is: "Given: If production is 
important, then peaceful industrial relations 
are desirable. If production is important, then 
it is a mistake to have Negroes for foremen and 
leaders over whites. Therefore: If peaceful in- 
dustrial relations are desirable, then it is a 
mistake to have Negroes for foremen and lead- 
ers over whites.” 

Indirect attitude items may also be based on 
distortion of perception and memory. Horo- 
witz and Horowitz (1938) invented an Aussage 
Test, which might also be titled “The “Have 
you stopped beating your wife' Test." A com- 
plicated picture is presented for two or three 
seconds, after which the respondent is asked 
misleading questions such as "What is the col- 
ored man in the corner doing?" Stereotyped 
responses are often obtained concerning the 
nonexistent colored man. 

That the relation between these indirect items 
and scales based on opinion statements may be 
complex is shown by Bray's (1950) study of 
prejudice. He used Sherif's (1937) autokinetic 
situation with confederates who were Negroes 
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LATENT ATTITUDE VARIABLE 


Fic. l. Examples of monotone item operating characteristics. 
The ordinate is the expected item score (the probability of a 
positive response, for dichotomous items). 


or who were introduced to the subjects as Jews. 
The amount of conformity of subjects’ reports 
to confederates’ reports could not be directly 
predicted by scores on attitude scales composed 
of opinion statements. Rather, the interaction 
of the attitude scores and personality variables 
such as irritability, social ascendancy, and self- 
confidence were of major importance in de- 
termining the conformity of reports. 


ITEM FORM 


Each scaling method makes certain require- 
ments about the relationship of the manifest 
item to the latent attitude variable. This rela- 
tionship can best be described in terms of the 
average item score for a given scale score, that 
is to say, the regression of item score on the 
latent variable. This curve is called the item op- 
erating characteristic. (In the preceding chap- 
ter the term operating characteristic is applied 
to the curve indicating the probability of a 
type II error in a statistical test. In general, the 
term refers to a quantitative statement of the 
effects to be expected when using some pro- 
cedure or object, such as an attitude item, a 
statistical test, or a vacuum tube) For a 
dichotomous item, if the positive response is 
scored 1 and the negative response 0, then the 
average item score is equivalent to the proba- 
bility of a positive response. In this case the 
item operating characteristic gives the condi- 
tional probability of a positive response for any 
fixed value of the latent variable. Lazarsfeld 
(19502) uses the term trace line in referring to 
the item operating characteristic. 

The so-called psychometric function in psy- 
chophysics is very similar to the item operating 
characteristic, in the dichotomous case. Mosier 
(1940b) has pointed out that the psychometric 
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function can be translated into the item operat- 
ing characteristic by interchanging the roles of 
persons and items, and vice versa. The psycho- 
metric function indicates the probability that 
a person will perceive a stimulus, as a function 
of the stimulus intensity, whereas the item oper- 
ating characteristic indicates the probability 
that a person will agree with, or pass, an item, 
as a function of the person's attitude. 

There are two major forms of items: mono- ` 
tone and nonmonotone. We are using Coombs’ 
(1952) terminology; Stouffer (1950b) and Mos- 
teller (1949) use the terms cumulative and 
noncumulative. Items in the two classes may be 
distinguished by the general shape of their 
operating characteristics. Monotone items have 
monotonic characteristics. That is, as the value 
of the latent variable increases, the operating 
characteristic increases, or may remain con- 
stant, but never decreases. Some examples are 
shown in Fig. 1. 

Examples of monotone items, for a scale of 
height, are: "Are you taller than a Chevrolet?" 
or "Are you taller than 6’0’’?” Note that if an 
item is scored in the reverse direction, so that 
its operating characteristic decreases or remains 
constant, but never increases as the latent vari- 
able increases, it is still considered a monotone 
item. 

The operating characteristics of nonmono- 
tone items, on the other hand, first increase, 
attain a maximum, and then decrease, as the 
latent variable increases. Examples of such 
characteristics are shown in Fig. 2. 

Examples of nonmonotone items are: “Are 
you about as tall as a Chevrolet?" or "Are you 
between 5/9" and 6/3" tall?” We have consid- 
ered only nonmonotone items with single max- 
ima. Occasionally an item may appear to have 
two or more maxima. The “double-barreled” 
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Fic. 2. Examples of nonmonotone item operating characteris- 
tics. The ordinate is the expected item score (the probability of a 
positive response, for dichotomous items) . 


question is an example. In our first nonmono- 
tone item, if several short persons were think- 
ing of a convertible Chevrolet, we might have 
found two maxima. Such items are of little 
value except perhaps in very special circum- 
stances, and should normally be discarded or 
revised. 

The distinction between monotone and non- 
monotone items has frequently not been con- 
sidered in scaling. However, it is particularly 
important, since Thurstone’s scaling methods 
require nonmonotone items with operating 
characteristics having single maxima, while 


Guttman and Likert base their respective meth- 
ods on monotone items. 


OE UAR 


A questionnaire is simply a collection of 
items. Usually the items are all of the same 
type. If several types of items are used, each 
type should be considered separately in the 
analysis unless the items can be shown to be 
sufficiently similar. The items on a question- 
naire may or may not be of the same form, 
depending on which scaling method is to be 
used. 


ScariNc METHODS 


Six major scaling techniques will be treated 
here: the judgment methods, the method of 
summated ratings, scalogram analysis, the un- 
folding technique, latent structure analysis, 
and rating methods. In the judgment methods 
the scale is developed from judges’ ratings of 
the items. The method of summated ratings, 
scalogram analysis, the unfolding technique, 
and latent structure analysis develop scales from 
the data given by respondents, and hence may 
be called response methods, The rating meth- 
ods use judges to rate the individuals. 


JUDGMENT METHODS 


The judgment methods of attitude measure- 
ment are all variants of a technique developed 
by Thurstone (1928a, b) and reported in detail 
by Thurstone and Chave (1929). The tech- 
nique requires dichotomous, nonmonotone 
items having operating characteristics with 
single maxima. Although any type of nonmono- 


tone items can be scaled by the judgment meth- 
ods, opinion statements are usually used. Each 
Statement represents a certain degree of favor- 
ableness toward a set of social objects, events, or 
ideas. Below are some representative items from 
Droba's (1930) scale for measuring attitude 
toward war: 


All nations should disarm immediately. 

Wars are justifiable only when waged in de- 
fense of weaker nations. 

The benefits of war outweigh its attendant 
evils, 


There are two major aspects of the judgment 
methods. First, the items must be scaled; that 
is, each item must be given a scale value that 
represents its degree of favorableness toward 
the issue. Secondly, the respondents must be 
scored on the basis of their responses to the 
items, 

Scaling the items. There are several methods 
of determining the scale values of the items. If 
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Fic. 3. The unit normal deviate transformation. The 
abscissa is measured in units of one standard deviation. 


only a few items are to be scaled, the method 
of paired comparisons is appropriate (Thurs- 
tone, 1927; Guilford, 1936, Chapter VII) . Each 
of several judges (preferably 25 or more) is 
presented with every possible pair of items, and 
is asked which of the pair is more favorable 
to the issue in question. For each pair of items 
we obtain the proportion of times one state- 
ment was judged to be more favorable than 
the other. In the mathematical model it is 
assumed that the perceived differences between 
stimuli have normal distributions. Specifically, 
the model states that 


8$,—8, Bag Ae (1) 


where $, and S$; are the scale values of items # 
and j, sj—j) is the standard deviation of the 
hypothetical distribution of differences, and Xj 
is the unit normal deviate corresponding to pi, 
the proportion of times item i is judged to be 
more favorable than item j, i.e., 


*y 1 2/2 
bg u ——e t dt (3) 
Ze 
—00 v 


The relation between X; and py is illustrated 
in Fig. 3. 

In the simplest case of the model, the so-called 
Case V, all c (,— jj are assumed to be equal and 
are set equal to unity. In this case the model 


becomes 
$,— 8; = Xy (3) 


This simple model will usually suffice. 

'The data may or may not be consistent with. 
the model. For a set of scale values, we can use 
equation (3) to compute an expected set of Su, 
These values can then be compared with the 
observed data. Since there are many more ex- 
perimental data than there are scale values, 
some discrepancies between the expected Xy 


and the observed Ku are inevitable. Our prob- 
lem is to find a set of scale values for which the 
discrepancies are minimum. Mosteller (19512) 
has shown that Thurstone's method of estima- 
tion, described below, is a least-squares solution 
— the sum of squared discrepancies is mini- 
mum. 

Mosteller (1951b) has also provided a chi- 
square test of the fit of the model. Both the 
expected X and the observed X; are converted 
into proportions by means of the unit normal 
deviate transformation — equation (2). These 
proportions, labeled p” for the expected pro- 
portions and p';; for the actual data, are trans- 
formed by the arc sin transformation 


0, — arc sin \/py- (4) 


A table of this transformation is available in 
Snedecor (1946, p. 449). This gives us a set 
of 6’,; and a set of 0” which we use with equa- 
tion (5) to compute chi square: 


zx (0,—9')? 


ici 
Qm fU P 


Here n is the number of judgments on which 
each observed proportion is based, and the sum- 
mation is over all values of i and j in which i is 
less than j, a total of 14k (k — 1) terms, where 
k is the number of items. This chi-square value 
has 14(k—1) (k — 2) degrees of freedom. 


The first step in obtaining the scale values is to 
convert the observed proportions py into Zu by 
equation (2), using a table of the normal distribu- 
tion. These X, are arranged in a two-way table, or 
data matrix, with a row for each item i and a column 
for each item j, as in Table 1. 

Note that X; is logically zero, and that Xj; = 
— Xj. If any Xy is greater than --2.0, or less than 
—2.0, it should be rejected as unstable. If no values 
of X; have been rejected by this rule, the scale 
value of item i is the average of all entries in the ith 
column of the data matrix, as shown in Table 1 (b). 
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Taste |. 
PAIRED COMPARISONS COMPUTATIONS — COMPLETE MATRIX 
(4 rrEMs) 
(a) Matrix of py (b) Matrix of X; 
Item j Item j 
1 2 3 4 1 2 3 4 
1 E 58 .83 .95 1 0 20 95 1.65 
2 .42 ES .80 92 2 —.20 0 .84 1.41 
Item i 3 Br pg) mE e Item i Oca 4 0 39 
4 .05 .08 35 — 4 —L65 —1.41 —39 0 
Column sums —2.80  —2.05 1.40 3.45 
Scale values —.70 — 51 35 86 
TABLE 2. 
PAIRED COMPARISONS COMPUTATIONS — INCOMPLETE MATRIX 
(6 ITEMS) 
(a) Matrix of pj (b) Matrix of X; 
Item j Item j 
1 2 3 4 5 6 1 2 3 4 b 6 
1 — 55 35 .87 .98 1.00 1 0 13 4.67 113 — — 
2 45 — 65 .79 .95 1.00 2 —.18 0 39 81 165 — 
3 125 7:85 Get 91 9.99 3 —.67 — 39 0 67 134 — 
Item i 4 38. 3L.395 E demi 4 E pps a gr y 99 141 
5 02 .05 .09 16 — .78 5 —  —L65 —1.34 — .99 0 Gei? 
6 00° 00 Ol .08- 22 °° 6 — — — -141— 77 0 
(c) Matrix of successive differences (d) Chi-square computation 
Items 
ij 4 E v 6” 9^, — 9", 
et Ae ee Ns ey e - [TEMP T EN E d 
1'7 180 756. 48 - = 13. .755  .716 60.00 57.80 2.20 
2 49 39 42 84 = 14 ST .860 68.87 68.03 84 
8 .28 89 6T 567 43 23  .65 637 53.738 52.95 78 
Item i 4 32 ATI e 99 49 24  .79 .805 62.72 63.79 —1.07 
5 r3 31 35 99 H 25  .95 .955 77.08 77.75 — 67 
6 tá T e: 64 qm 34 45 .695 60.00 56.48 3.52 
95 91 910 72.54 72.54 00 
Column sum 86 LU 257 418 11:96 "TET 797 66.42 63.22 3.20 
Column average .22 35 A 83 65 46 .92  .981 7857 747]  —120 
$, = 0,$, = 22, Sy = 57, S, = 108g = 1.91, 56 78 .742 62.08 5947 2.56 
Sq = 2.56 
n = 100 
X (9^, — 0%) 2 = 42.95 
i<j y2 = 5.23 
d.f. = 6 
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If some X,, have been rejected, as indicated in 
Table 2 (a), chen for each pair of successive columns 
of the data matrix, we subtract the entries in one 
column from the corresponding entries in the other 
column, and record these differences in a second 
matrix, as in Table 2(b). Of course, no difference 
can be obtained whenever one of the values has been 
rejected. Since we are using successive columns, it 
may be possible to arrange the order of the columns 
so that there are very few impossible differences. 
The average of these differences for a pair of col- 
umns is the difference between the scale value for 
those two items. Thus, by arbitrarily setting $, = 
0, the remaining scale values can be determined. 

Note that both the zero point and the scale factor 
are arbitrary in the scale. We set the scale factor 
by letting all standard deviations equal unity. In 
the case of the complete matrix, the zero point was 
set by letting the average scale value be zero. In 
the case of the incomplete matrix, the zero point was 
set by letting $, = 0. Any other arbitrary values for 
the zero point and scale factor may be used. 

The application of the chi-square test is illus- 
trated in Table 2(c). 


The method of paired comparisons becomes 
unwieldy for large numbers of items, since each 
possible pair of items must be judged. For k 
items there are 1⁄(k) (k — 1) pairs; for 25 items 
there are 300 pairs. Several methods may be 
used as short cuts in this situation, including 
the methods of rank order, of equal-appearing 
intervals, and of successive intervals. 

The method of rank order, or order of merit 
method (Guilford, 1936, Chapter VIII), is sel- 
dom used. In this method each judge places the 
items in rank order of favorableness toward 
the issue in question. The scale value of an 
item is the weighted or unweighted average of 
the ranks given to the item by a group of judges. 
Alternatively, a paired comparisons analysis can 
be used, following the procedure suggested by 
Thurstone (1931). For large numbers of items, 
the method of rank order becomes unwieldy, 
since the judge must make many decisions about 
the rank order of items which are nearly equal 
in degree of favorableness. 

The method of equal-appearing intervals 
(Thurstone and Chave, 1929; Guilford, 1936, 
Chapter V) is the most widely used of the judg- 
ment methods. It is much simpler in appli- 
cation than the method of paired comparisons, 
since it requires only one judgment per item. 
Each judge is required to sort the items into a 
fixed number of categories (usually 7, 9, or 11) 
that are arranged in order of favorableness. The 
judge is usually asked to put items into cate- 
gories in such a way that the intervals between 
categories are subjectively equal. That is, it 
should appear to him that the difference in 
degree of favorableness between the statements 


D 


D 


Ku 


of any two adjacent categories is equivalent to 
the difference in favorableness between any 
other two adjacent categories. However, in this 
method there is no way of determining whether, 
in fact, the intervals are subjectively equal. The 
equality of intervals must remain as an assump- 
tion of the experimenter. It is not a particu- 
larly good assumption, since judges tend to 
make coarser discriminations in the mid-range 
of the scale than at the extremes. Hevner (1930) 
demonstrated this end effect empirically by 
constructing a scale by the method of equal- 
appearing intervals and also by paired com- 
parisons. The relation between the two sets of 
scale values was approximately linear except 
at the extremes, where the equal-appearing in- 
terval scale was relatively compressed. 

The method of successive intervals, first re- 
ported by Saffir (1937), was designed by 
Thurstone to overcome some of the difficulties 
inherent in the method of equal-appearing in- 
tervals. The method of successive intervals is 
recommended when there are too many items 
to be scaled feasibly by the method of paired 
comparisons (say 20 or more). 

Each judge is required to sort the items into 
a fixed number of categories (usually 7, 9, or 
11) spaced along a continuum of favorableness. 
The categories have a rank order on the con- 
tinuum, but they are not necessarily equally 
spaced. The items in a given category are 
simply required to be less favorable than those 
in the adjacent category to the left and more 
favorable than those in the adjacent category 
to the right. When a number of judges have 
sorted the items, we can tabulate a judgment 
distribution for each item. This distribution 
shows the proportion of judges who placed the 
item in each category. 

The mathematical model treats the categories 
as contiguous segments of the attitude con- 
tinuum, separated by category boundaries. In 
the model it is assumed that the category boun- 
daries can be located on the continuum in such 
a way that all the item judgment distributions 
are normal. The formal model is stated in 
terms of the cumulative judgment distributions. 
Let the M categories be numbered in order 
from least to most favorable. Let De be the 
proportion of judges who placed item j in cate- 
gory g or any category of lower rank order. 
Let X;, be the unit normal deviate [see equa- 
tion (2) and Fig. 3] corresponding to p; In 
the model, t, is the boundary between category 
g and category g + 1. S; is the mean of the 
judgment distribution for item j, and hence the 
scale value for item j, and o; is the standard de- 
viation of item j. The model is illustrated in 
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Fic. 4. The successive intervals model. Two item judgment 
distributions with different standard deviations are shown. The 
abscissa is the latent attitude variable, and is measured in arbi- 


trary units. $ 


Fig. 4. It states that 


t — 8; 
0; S 


(6) 


A graphical procedure for obtaining the 
values of tj, S} and c; is presented by Saffir 
(1937). Mosier (1940a) described a short cut 
that is useful when a large number of items is 
involved. A least-squares solution has been 
developed by Gulliksen (1952), but its appli- 
cation is quite complicated. 

Fortunately, it is usually adequate to assume 
that all standard deviations, c;, are equal, in 
which case they are set equal to unity. This 
model states that 


Xj, = 


Xj —Hje- Sj. (7) 


Procedures for obtaining the scale values and 
the category boundaries have been described by 
Saffir (1937), Attneave (1949), who uses the 
term "method of graded dichotomies," Garner 
and Hake (1951), who use the term "equal 
discriminability scale," and Edwards (1952). 
A least-squares method similar to these methods 
is described below. 

The method of successive intervals is based 
on the assumption that the item judgment dis- 
tributions are normal on the derived scale. It 
should be noted that this is not an assumption 
about the distribution of attitude scores of those 
who endorse the item, but rather an assumption 
about the distribution of judgments concerning 
the proper location of the item on the scale. 
This assumption must be checked in order to 
ascertain whether the model fits the data. The 
check may be accomplished graphically by plot- 
ting the observed p,, versus the estimated £, on 
normal probability paper. Straight lines should 
result, and if equal standard deviations have 
been assumed, these lines should have the same 
slope. Wide departures from these conditions 


are viewed with alarm. Edwards and Thurstone 
(1952) suggest obtaining the differences between 
the observed proportions and the corresponding 
proportions computed from the estimated 
parameters of the model. Although no statisti- 
cal criterion is presented, the authors give some 
representative average discrepancies. 


The first step in obtaining the scale values and 
category boundaries is to convert the observed pro- 
portions D, into X,, by the unit normal deviate 
transformation, using a table of the normal distri- 
bution. The X,, are arranged in a data matrix with 
a row for each item, and a column for each category. 
"This is illustrated by an artificial example with 6 
items and 5 categories in Table 3. All X,, that are 
greater than +-2.0 or less than —2.0 are rejected. If 
no values are rejected, the average value of each 
column is the category boundary represented by that 
column, while the average value of each row, when 
subtracted from the overall average of the table, 
gives the scale value of the corresponding item. 

If some values are rejected, then for adjacent col- 
umns we subtract the entries in one column from 
the corresponding entries in the other column, and 
the differences are recorded in a column of a second 
matrix. Of course, wherever a value has been re- 
jected, no difference is possible. The average of 
these differences is the difference between the cate- 
gory boundaries represented by the two columns. 
By assigning t, = 0, the remaining category boun- 
daries can be computed, as shown in Table 4. Then 
a fourth matrix is constructed by subtracting the 
computed value of t, from each entry in the gth 
row of the X,, matrix, as shown in Table 4 (d). The 
average value of a row of this matrix is the scale 
value of the corresponding item. 

Since the scale factor and zero point are arbi- 
trary, the scale factor has been set by letting all 
standard deviations equal unity. In the case of the 
complete matrix, the zero point was fixed by letting 
the average scale value be zero. |n the case of the 
incomplete matrix, the zero point was fixed by set- 
ting 4, = 0. Any other arbitrary scale factor and 
zero point may be used. 
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TABLE 3. 
SUCCESSIVE INTERVALS COMPUTATIONS — COMPLETE MATRIX 
(5 CATEGORIES, 6 ITEMS) 
(a) Matrix of p; 


Category 
1 2 3 4 5 


1 40 55 .86 -96 1.00 
2 .28 36 71 -90 1.00 
3 15 A0 65 81 1.00 
Item j 4 -16 25 Ad 82 1.00 
5 09 21 -50 ke 1.00 
6 03 08 Ku 57 1.00 
(b) Matrix of X, 
Category boundary g Row Row Scale 
1 2 3 4 sum average value 
1 — 25 13 1.08 1.75 2.71 68 — 18 
2 — 58 — 36 55 1.28 .89 22 — 82 
3 —1.04 — .52 .89 .88 — .29 — 07 — .03 
Item j 4 — .99 — .67 10 92 — 64 — 46 06 
5 —1.84 — .81 .00 64 —1.51 — 38 28 
6 —1.88 —141 — 33 18 —3.44 — 86 76 
Column sum —6.08 —3.64 1.79 5.65 —2.28 
Category boundary —1.01 — .61 20 94 — 40 
TABLE 4. 
SUCCESSIVE INTERVALS COMPUTATIONS — INCOMPLETE MATRIX 
(5 cATEGORIES, 6 ITEMS) 
(a) Matrix of pj, (b) Matrix of X,, 
Category Category boundary g 
1 2 8 4 5 1 2 5 4 
1 44 10 Ki ER 1.00 1 — 15 52 1.56 = 
2 16 45 EJ! EH 1.00 2 — 99 — 18 EI 1.48 
$. 26. 24 52. .78 100 A -1735 —.-l .05 ku 
Item j 4. 2 16. A0~ ..69 —, 1,00 Item j 4 -175 —.99 —.25 50 
5 01 08 .32 57 1.00 5 — —141 — A7 18 
6 00 02 4 36 1.00 6 - — —1.08 — .36 
(c) Matrix of differences (d) Matrix of t, — Xj, 
2-1 3-2 43 Category boundary Row Scale 
1 67 1.04 - : 1 _2_$_4_sum_valte. 
15 31 17 - É E 
A im P^ - 2 99 96 88 ARTS 
$ 175 154 1.68 1.65 662 1.66 
Item j 4 76 14 75 Itemj 4 125 182 1.98 1.92 747 187 
5 i EK 65 5 -= 224 2.20 2.24 668 2.23 
6 - - 72 6 -= - 2.81 2.78 559 2.80 
Column sum 3.33 449 3.44 
Column average .83 .90 .69 


i= 0, Ia = 83, t3 zz 1,78, tq = 2.42 
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The methods of paired comparisons and suc- 
cessive intervals yield interval scales in which 
the metric is obtained from the variability of 
judgments about the items. If items were al- 
ways judged consistently in paired comparisons, 
so that the proportions were all either zero or 
unity, the mathematical model of paired com- 
parisons could not be used. Identical sortings 
from all judges would render the method of 
successive intervals equally useless. Happily, 
judges seldom agree. There is usually sufficient 
“error” variance to allow a scale to be con- 
structed. The metric in these scales is essen- 
tially a function of the degree of confusion, or 
overlap, between items. The difference of the 
scale values of two items may be used to make 
a precise probability statement about the degree 
of overlap. The scales are based on variability, 
but it is a remarkably predictable variability. 

Scoring the respondents. When a scale of 
items has been established by one of the judg- 
ment methods, a rule for scoring the respond- 
ents must be adopted. The usual scoring 
procedure is to give each person a score which 
is the median of the scale values of the items 
which he endorses. The mean scale value could 
be used, but it is felt that the median is less 
sensitive to the particular set of scale values in 
the scale. Lorge (1939) found very little dif- 
ference between mean scores and median scores. 
By either scoring method, the respondents are 
placed on the scale of favorableness toward the 
issue being studied. 

Item selection. Usually, many more items are 
scaled than will be used on the final question- 
naire. Ambiguous or irrelevant items can be 
discarded when the items are picked for the 
final form of the scale. If the judgment distri- 
bution for an item has a large standard devia- 
tion, the item is presumed to be ambiguous, 
since it apparently means different things to 
different judges. In the extreme case of am- 
biguity, the judgment distribution might be 
spread uniformly over the categories. Items 
that are ambiguous according to this subjective 
criterion should be eliminated before the suc- 
cessive intervals scale is computed, so that the 
assumption of equal standard deviations be- 
comes tenable. The simple estimation pro- 
cedures described above can then be used. 

Irrelevant items cannot easily be detected on 
the basis of judgments. If an item is almost 
always judged to be in the middle category, in 
the methods of successive, or equal-appearing, 
intervals, then it may be irrelevant, but it may 
also be a good neutral item. Thurstone 
(Thurstone and Chave, 1929) proposed a cri- 
terion of irrelevance which can be used when 
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the questionnaire has been administered to a 
group of respondents. For each item in turn, 
the individuals who responded positively to a 
given item are selected; the popularity of each 
of the other items is determined for this group 
of individuals. 'These popularities are com- 
pared with the scale values of the items. Items 
with scale values near the given item should 
be very popular, while items far removed on 
the scale should have low popularity for this 
group. If the popularities are independent of 
the scale values, then some factor other than 
the latent attitude variable is determining the 
responses to this item. 

Specifically, the relevance of item i is deter- 
mined by plotting the ratio pj/p; versus S;. 
The divisor p; is used to correct for the different 
popularities of items in the total group. 

A similar technique can be used to get a 
rough picture of the operating characteristics 
of the items. In this case, individuals of nearly 
equal attitude scores are grouped together. The 
proportion of individuals who endorse the item 
can be determined for each group, and can be 
plotted as a function of the mean attitude score 
of the group. An acceptable operating charac- 
teristic will have a maximum near the scale 
value of the item, and will decrease toward zero 
on either side of this maximum. 

Items having unacceptable operating charac- 
teristics, or giving indications of irrelevance, 
should be rejected. Although these criteria are 
subjective, they allow the investigator to com- 
pare items, and then to choose the most relevant 
items and the items with the best operating 
characteristics, 

When the poor items have been discarded, 
the investigator may still have more items than 
he needs for the final scale. In this case, he 
should select a set of items with scale values 
spread evenly along the continuum. If particu- 
larly good discrimination is desired in some 
portion of the continuum, then relatively more 
items may be selected with scale values in that 
range. If enough items are available, parallel 
forms of the scale can be produced by using 
items with matched scale values. 

Criticisms of the judgment methods, Several 
authors have criticized the central, or neutral, 
items on Thurstone-type attitude scales for fail- 
ing to discriminate high and low groups on the 
attitude continuum. This criticism stems from 
a lack of understanding of nonmonotone 
items. An ideal item characteristic curve for a 
neutral item would have zero probability of 
response at both extremes. In fact, if a neutral 
item does discriminate between high and low 
groups, it is not a very good neutral item. 


—————Ó 
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According to Edwards (1946), items with 
scale values in the middle range often have 
higher judgment variances than other items. 
This may be due in part to the curtailed distri- 
butions of extreme items in the method of 
equal-appearing intervals. But it may also be 
due in part to ambiguity. Edwards also states 
that neutral items are more likely to be irrele- 
vant according to the criterion of relevance. 
These troubles emphasize the difficulty of writ- 
ing neutral items. No matter how hard one tries 
to avoid it, such items have a tendency to be 
“double-barreled.” 

The second major criticism is that the opin- 
ions of the judges might influence their ratings. 
Advocates of the judgment methods have de- 
nied this, stating that the only requirement for 
a judge is that he be able to read. Studies by 
Hinckley (1932), Ferguson (1935), and Pint- 
ner and Forlano (1937) all strongly support the 
contention that the attitudes of the judges do 
not affect the resulting scale. In these studies, 
the judges were separated into high and low 
groups on the basis of attitude scores. Scales 
were derived from each group of judges; the 
scale values correlated better than .95. Re- 
cently, Hovland and Sherif (1952) have criti- 
cized these studies for not using judges with 
extreme attitudes. Using Hinckley's scale of 
attitude toward Negroes, they found that judges 
with extreme attitutdes do yield quite different 
scales. Hinckley had rejected all judges who 
placed more than 25% of the statements in a 
single category, since he felt that they were 
being “careless.” Hovland and Sherif find that 
these “careless” judges have very extreme atti- 
tudes and deep involvement, and that the single 
category that they favor is usually an extreme 
category. Scales obtained from Negro judges 
and from pro-Negro white subjects were quite 
different from scales obtained from "average" 
white subjects. The authors point out that this 
result is consistent with other facts of social 
psychology. Many studies indicate that atti- 
tudes, desires, and needs may influence percep- 
tion, judgment, and other behavior. Kelley, 
Hovland, Schwartz, and Abelson (1953) have 
investigated the problem of biased judges fur- 
ther. They conclude that only in cases of ex- 
treme bias will there be a serious effect on the 
scale. They point out that Hovland and Sherif 
used the method of equal-appearing intervals, 
and that by using the method of successive 
intervals, the differences between the scales can 
be reduced substantially. The method of paired 
comparisons nearly eliminates the differences. 

It appears that we can expect the bias effect 
to be small or negligible in most cases. How- 


ever, it seems clear that the problem cannot be 
overlooked. More research is needed to deter- 
mine the extent of the bias, and the conditions 
under which it occurs. 


THE METHOD OF SUMMATED RATINGS 


The Likert-type scale. A few years after the 
Thurstone scaling methods were introduced, 
Likert (1932) published a technique for the 
measurement of attitudes which was similar to 
methods in use in the mental testing field. In 
Likert's method of summated ratings, five cate- 
gories of response are provided for each item: 
strongly approve, approve, undecided, disap- 
prove, and strongly disapprove. These cate- 
gories are scored 5, 4, 3, 2, and 1 respectively. 
If an item is cast in the negative sense, with 
strongly approve indicating an unfavorable at- 
titude, then the category scoring is reversed. 
An individual's scale score is the sum of his 
scores on the items. (Likert originally scored ` 
the response categories of an item on the basis 
of a normal curve transformation of the pro- 
portions of respondents choosing each category. 
This "sigma scoring" method was immediately 
abandoned when it was discovered that the 
simpler 5 4 3 2 1 scheme yielded scores which 
correlated better than .99 with the sigma scores.) 

The items on a Likert-type scale should have! 
operating characteristics that are Maite 
increasing functions of the latent attitude con-| 
tinuum, That is, the more favorable a person's 
attitude, the higher his expected score for the 
item would be. The items are evaluated by 
administering them to a group of respondents. 
The relations of the item score to the total score 
on the set of items is an index of the discrimi- 
nating power of the item. Items with high dis- 
criminating power are selected for the final 
form of the questionnaire. 


One widely used index of item discrimination is 
based on the differences in item scores for subgroups 
with high and low total scores, respectively. From 
the distribution of total scores, the highest 25% (or 
27% — the exact percentage is of minor importance) 
of the population is selected for the high group; the 
lowest 25% forms the low group. For each group 
the mean, M, and the variance, oi, of the item 
scores are computed for each item separately. The 
index used is usually the critical ratio: 
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where n is the number of respondents in each group. 
This index evaluates the mean difference relative to 
the item score variances. It should be noted that the 
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proper index is the critical ratio, not the probability 
of statistical significance. 

Another useful index is the correlation between 
item scores and total scores. In many ways, this in- 
dex is superior to the critical ratio, since it makes 
direct use of the scores of all respondents, rather 
than just the scores of the two subgroups. 


These procedures of item analysis are very 
similar to mental test methods, although mental 
test items are usually scored 0 or 1. Gulliksen 
(1950b) has provided thorough discussions of 
problems such as test scoring, category weight- 
ing, and item analysis that are common to 
attitude measurement and mental testing. 
Davis (1951, 1952) has reviewed a large number 
of item analysis techniques, many of which are 
applicable to Likert-type scales. 

The single common factor model. Likert did 
not provide a mathematical model for his scal- 
ing method. However, the scoring technique of 
the Likert scaling technique implies a model 
that we shall call the single common factor 
model, In this model, which is derived directly 
from factor analysis, the intercorrelations of the 
items are assumed to be due to a single common 
factor to which the items are all mutually re- 
lated. Since the latent attitude dimension is 
defined by the mutual interrelations or con- 
sistency among items, the items are assumed to 
be interrelated only by virtue of the latent fac- 
tor. With the effect of this factor held constant, 
the items are unrelated, or independent. This 
is the basic idea of both factor analysis and 
latent structure analysis. Mathematically, the 
item score is considered to be the weighted sum 
of two independent factors: the general attitude 
factor, and a factor specific to the item. 

The model carries with it a justification of 
the scoring procedure, and in this sense is im- 
plied by the scoring procedure, As the number 
of items increases, the linear correlation of 
the total score with the single common factor 
approaches unity. For 25 items, a typical value 
of this correlation would be about .90. The 
reason for this near-perfect linear correlation 
is that as we add more and more items, the 
effects of the general factor cumulate very fast, 
since it is represented in each item, while the 
effects of the unique factors cumulate more 
slowly, since these factors are assumed to be 
mutually independent, and hence tend to can- 
cel each other out, to some extent, The mathe- 
matical analysis will not be given here, but it 
parallels the discussions of weighting given by 
Wilks (1938) and by Gulliksen (1950b, Chap- 
ter 20). 

The model stated here assumes a linear re- 
gression of item score on the common factor. 


This assumption leads to a linear regression of 
total score on the common factor, and to a per- 
fect linear correlation between total score and 
factor as the number of items becomes infinite. 
Lord (1953) has pointed out that this model 
cannot be precisely correct, since it would theo- 
retically allow any item score to occur, whereas 
actually the item score must be between 1 and 5. 
However, the restriction is probably not im- 
portant. 

A different model based on the single com- 
mon factor idea has been developed recently 
by Lord (1952). He deals with dichotomous 
items, and assumes that the item operating 
characteristics are normal ogives, following 
"Tucker (1946) and others. He shows that the 
distribution of scale scores depends on the item 
characteristics and may be quite different from 
the latent variable distribution. According to 
the model, there is a perfect curvilinear corre- 
lation between scale score and latent factor, as 
the number of items becomes infinite. The re- 
gression of the latent factor on the test score 
has an ogive shape. 

One way to verify the single common factor 
model would be to perform a factor analysis 
of the item intercorrelations. Product-moment 
correlations would be used for the linear model, 
while tetrachoric correlations are appropriate 
for the ogive model. Certain precautions must 
be observed when factoring items, since the 
analysis is very sensitive to the item response 
distributions. Ferguson (1941) and Wherry 
and Gaylord (1944) have discussed this prob- 
lem for dichotomous items. The difficulty is 
less crucial for five-category items, but it must 
be taken into account. 

In practice, high item-scale correlations are 
sometimes taken as presumptive evidence of a 
single common factor. This is not necessarily 
valid evidence, since two or three equally po- 
tent factors might be represented by the items. 
An alternative test of the single factor model 
is to determine the homogeneity of the scale, 
since a homogeneous scale is a single factor 
scale. 

The two factor models presented show that 
the relation of the score distribution to the 
single common factor depends on the particular 
model used to define the factor. It can also be 
shown that the score distribution depends on 
the operating characteristics of the particular 
items in the questionnaire. The investigator 
can freely manipulate the distribution by se 
lecting the items carefully. Since the scoring 
method is insensitive to the characteristics of 
the particular items, and since it does not de- 
pend on the factor model, it has no unique 
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rational metric, and must be regarded as yield- 
ing an ordinal scale. 

There is a sense in which a Likert-type scale 
has a metric. For any given questionnaire, the 
score distribution will be fixed for the general 
population. Then a metric is possible in the 
sense that a difference between scores is related 
to the proportion of people having interme- 
diate scores. Furthermore, percentile scores can 
be used. These scores are free from the troubles 
of the raw total scores. The percentile scores 
should be constant, except for unreliability, no 
matter how the raw score distribution is ma- 
nipulated by item selection. 

Maximum variance scores. The possibility of 
weighting the items differentially instead of 
simply adding the unweighted item scores has 
been investigated by several authors. It is well 
recognized that any weighted combination of 
items is very highly correlated with any other 
weighted combination, if corresponding weights 
have the same sign, and if the number of items 
is large. Weighting becomes a problem when 
the number of items is small. Gulliksen (1950b) 
has given a comprehensive discussion of item 
weights. We shall mention only one approach 
to the problem. 

Horst (1935, 1936) , Wilks (1938) , and Edger- 
ton and Kolbe (1936) suggested a method for 
combining scores so that the resulting score 
distribution would have maximum variance, 
under the restriction that the sum of squared 
weights is unity, The same concept was used 
by Guttman (1941) to develop weights for each 
item response separately. The result is known 
mathematically as the first principal component 
of the matrix of covariances of the variables to 
be weighted. The use of this scoring method 
does not change the arbitrary nature of the 
score distribution, since it will be affected by 
item selection in the same way as the un- 
weighted score distribution. 

A practical method for obtaining approxi- 
mations to maximum variance scores is 
discussed by Niven (1953), who calls the tech- 
nique the method of reciprocal averages. 


SCALOGRAM ANALYSIS 


A new departure in attitude measurement 
was provided by Guttman (1944), who pro- 
posed a nonmetric method for scaling mono- 


tone attitude items. In a Guttman scale the. 


items have a special cumulative property. For 
example, a person who responds positively to 
the third item on the scale is almost sure to 
have responded positively to the first and sec- 
ond items. 
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Similar notions had been used in earlier 
studies, but not as a general method. One of 
the earliest examples of measurement in social 
psychology is the social distance scale of Bogar- 
dus (1925). He used a set of seven items de- 
noting varying degrees of social acceptability. 
'The acceptability, or social distance, of a na- 
tionality group was ascertained by noting 
which of the items were endorsed by a re- 
spondent. The items included admission to 
this country as a visitor only, admission to citi- 
zenship, to equal employment, admission as 
neighbors, as personal chums, and as relatives 
by marriage. Anyone willing to admit mem- 
bers of the group as neighbors would almost 
certainly be willing to indicate acceptance of 
items earlier on the list. 

A second instance of the use of cumulative 
items was reported by May and Hartshorne 
(1926). They used six tests of cheating that 
could be arranged in order from the test on 
which fewest children cheated to the test on 
which most children cheated, or the most 
tempting test. A child who would cheat on 
one of the tests would also cheat on the more 
tempting tests. 

However, the implication of these early 
studies for a general method of scaling were 
largely ignored. It was Guttman who advocated 
the use of such cumulative items as a basis for 
a formal method of scaling. His development 
of scalogram analysis stimulated widespread 
interest in the method. A recent and compre- 
hensive account of the method will be found 
in Guttman (1950b). 

The model. Although scalogram analysis is 
applicable to items with any number of re- 
sponse alternatives, the scale can most easily 
be described for dichotomous items. The basic 
idea is that the items can be arranged in an 
order so that an individual who agrees with, 
or responds positively to, any particular item 
also responds positively to all items of lower 
rank order, The rank order of the items is the 
scale of items; the scale of persons is very simi- 
lar, people being arranged in order according 
to the highest rank order of items checked, 
which is equivalent to the number of positive 
responses in a perfect scale. 

As an example of a perfect Guttman scale, 
consider a questionnaire concerning height. 
The items might be: (1) I am more than 5/0” 
tall, (2) I am more than 5/3” tall, (3) I 
am more than 5/6” tall, etc. Very little incon- 
sistency of response would be expected. The 
operating characteristic of a perfect Guttman 
item would indicate a response probability of 
zero up to a certain point on the continuum, 
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Taste 5. 


SCALOGRAM DIAGRAM FOR THREE DICHOTOMOUS ITEMS. 


++ indicates a positive response, 


I 

Individuals H 
(scale types) DI 
IV 


and from that point on a response probability 
of unity. In Table 5 is shown a diagram of a 
perfect Guttman scale of three items. In this 
scale an individual who gives the positive re- 
sponse to item 3 also gives the positive response 
to items 1 and 2. On the other hand, a person 
who gives the positive response to item 2 may or 
may not also agree with item 3; it is certain that 
he agrees with item 1. 

When the items have been arranged in order, 
it is also clear that the persons have been di- 
vided into a number of groups, or scale types, 
according to their pattern of item responses. In 
the example cited there are four typical re- 
sponse patterns: (+ + +), (— 4- 4). (—— 
+), (— — —). The nonscale response patterns 
which do not occur in a perfect scale are 
Gb — D. 44, G+, and (E — 
—). The score of an individual is determined 
by assigning the integers in order to the scale 
types, viz., 3, 2, 1, 0. In the case of dichotomous 
items, the score is equivalent to the number of 
positive responses. 

For a perfect Guttman scale, the order of the 
items is the same as the order of their populari- 
ties. Also, every 2 X 2 contingency table 
showing the relation between two items has an 
empty cell, since no one who responds nega- 
tively to the more popular item will respond 
positively to the less popular item. This is an 
important mathematical property of the per- 
fect Guttman scale for dichotomous items: if 
pi py then py — p; where py is the propor- 
tion who respond positively to both items ; 
and j. 

In the case of items with several response 
categories, the scale pattern becomes slightly 
more complex. Here each response category 
of an item is treated as a separate item; it is 
necessary to find an order for these response 
categories. The precise definition of a scale is 
given by Guttman (1950b, p. 62): "We shall 
call a set of items of common content a scale if 
a person with a higher rank than another per- 
son is just as high or higher on every item than 


— indicates a negative response. 
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the other person." Items with several response 
categories that have a logical order can be re- 
duced to dichotomous items for the analysis. 
A single item with five response alternatives 
would be replaced by four dichotomous items. 
The positive responses to these four items 
would be equated with responses in category 
l, categories 1 or 2, categories 1, 2, or 3, and 
categories 1, 2, 3, or 4, respectively. By this 
simple expedient the dichotomous model can 
be made to include the nondichotomous case 
as well. 

When a set of items satisfies the scalogram 
model, the scale is said to be unidimensional. 
The model can be extended to the multidimen- 
sional case. Bennett (1952) considered the 
conjunctive and disjunctive models for many di- 
mensions. Each individual has a position on 
each of several attitude dimensions. Each item 
has a breaking point on each dimension. In 
the conjunctive model, a person will respond 
favorably to an item only if his position is 
above the item's breaking point on all the di- 
mensions. In the disjunctive model he will 
respond positively if he is above the item's 
breaking point on any one of the dimensions. 
Bennett (1951) has also developed methods for 
ascertaining the number of dimensions neces- 
sary to account for the observed item responses. 

The scaling technique. In the practical busi- 
ness of attitude measurement, some of the non- 
scale response patterns are bound to occur. Ford 
(1950) has pointed out that the prototype of 
a cumulative set of attitude items is not “Are 
you taller than 5/0?" etc., but rather, “Are you 
taller than a table?" "Are you taller than the 
head of a pony?" "Are you taller than a good- 
sized bookcase?" We do not expect these items 
to form a perfect scalogram. No matter what 
set of ordered response patterns is selected as the 
scale types, there will be some errors. The pur- 
pose of the scaling technique is to select a set of 
scale types so that the errors will be minimized. 

In a perfect Guttman scale, an individual's 
score bears a one-to-one relation with his re- 
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sponse pattern. That is, his item responses are 
completely specified by his score. From the 
score his item responses can be perfectly repro- 
duced. For a nonperfect scale an individual 
receives the score of the scale type which is most 
nearly like his response pattern. That is, a 
score is assigned so that his response pattern is 
as nearly reproducible as possible. 

Consider a scale of five dichotomous items, 
with scale types (+ + + ++), (+++ + 
=. Dt, G+-— 7, H 
— — —), and (—— ——— ). A person with 
response pattern (+ -+—-+ +) would be 
classed in the (+ -+ -+ + +) category, and 
one error counted. The response pattern (4- 4- 
— — +) would be classified with (+ + — — 
—) and one error counted. The response pat- 
tern (4- — + — +) could be classed with 
(HHHH o (4 +4+——,0r G0 — 
— —  —) for two errors. Henry (1952) pro- 
posed that in ambiguous cases, where several 
classifications are possible with the same error, 
the dilemma could be resolved by choosing the 
most popular of the alternative patterns. Bor- 
gatta and Hays (1952) suggest using the "aver- 
age" of the alternatives. In the above examples 
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the "average" of no minuses, three minuses, 
and four minuses is presumably two minuses, so 
the proper scoring of (+ — + — +) would be 
(+ + + — —). The argument is based on the 
Lazarsfeld latent distance model (ef. infra). 
(In practice, the number of positive responses 
may be used as the score, since it correlates 
highly with the more complex scoring methods.) 

The percentage of errors of reproducing scale 
patterns is the basic measure of error. The 
quantity 1 — number of errors B 

number of item responses 
called the coefficient of reproducibility, which 
we shall symbolize by Rep. Rep is the per- 
centage of item responses which are correctly 
reproduced. The practical scaling problem is 
to arrange the items so that the resulting scale 
types yield the maximum Rep. 

Several methods have been described for se- 
lecting the scale types. Most of these methods 
are based on the extended scalogram diagram, 
which is essentially the matrix of item re- 
sponses, with a row for each individual and a 
column for each item response category. An 
extended diagram for a perfect scale of three 
dichotomous items is shown in Table 6. 


'TABLE 6. 


EXTENDED SCALOGRAM DIAGRAM FOR THREE DICHOTOMOUS ITEMS. 


X indicates that the individual responded in the 
item response category. 

-]- indicates the positive response category. 

— indicates the negative response category. 


Item response categories 
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Note that the last three columns are com- 
pletely redundant. Note also that the responses 
form a parallelogram. Procedures are described 
for rearranging the rows and also the columns, 
as well as for combining response categories 
and perhaps even reversing the scoring of some 
items, so that a response parallelogram is 
achieved with few deviations, These methods 
are not automatic but require keen judgment 
concerning the kind of sorting likely to pay off. 
Suchman (19502) describes the scalogram 
board procedure, which represents the response 
matrix by buckshot placed in small indenta- 
tions in a set of removable wooden slats. The 
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sorting is accomplished by interchanging these 
slats in their frame. Since both horizontal and 
vertical sorting is desired, two matching scalo- 
gram boards are needed. Patience and a steady 
hand are also helpful. Methods for tabulating 
the scalogram diagram on IBM equipment have 
been described by Noland (1945), Ford (1950), 
and Kahn and Bodine (1951). Paper and pen- 
cil methods have been described by Guttman 
(1947) and Marder (1952). 

None of these methods is elegant or compact. 
Each one attempts to evaluate the complete 
data matrix without the aid of summary statis- 
tics. For large numbers of respondents, the 
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task is overwhelming. It has been suggested 
that the scale be established on the basis of a 
sample of 100 cases. Since the statistical theory 
of the random fluctuation in reproducibility is 
unknown, this seems to be a dangerous extrava- 
gance. Moreover, it is difficult to deal with 
more than 10-20 items with these procedures. 


The procedure recommended by the present au- 
thor is simple. First, dichotomize all items by com- 
bining response categories or by defining additional 
items (see above). Second, order these items accord- 
ing to their popularities, with the least popular 
having the lowest rank. Goodenough (1944) and 
others have suggested this ranking procedure. 
Third, estimate Rep as follows: for each item, tabu- 
late a, the proportion of cases giving the positive 
response to that item and the negative response to 
the item of next higher rank; add these a, (with k 
items there are k — 1 ajs), divide by k, and sub- 
tract from unity. This always gives an overestimate 
of Rep, but the error is slight. A validation of this 
estimate as well as a method for improving it by 
additional tabulations has been given by Green 
(1953) . 


An alternative useful method, called H-tech- 
nique, for improving cumulative scales has been 
suggested by Stouffer, Borgatta, Hays, and 
Henry (1952). The details of their method 
will not be given here, but the basic notions 
will be outlined. All items are dichotomized 
by combining response categories. Items with 
similar popularities are grouped together; 
about four such groups are expected. The items 
within each group are combined to form a new 
contrived dichotomous item. For example, if 
there are three items in a group, a positive 
response on the contrived item might be defined 
as the occurrence of a positive response on any 
two or all three true items, while the negative 
response would be equated with a positive re- 
sponse on none or only one of the true items. 
Finally, the contrived items would be used to 
form a scalogram. (The method of the pre- 
ceding paragraph can be used from this point.) 
The authors show that reproducibility will be 
enhanced by H-technique. 

Criteria for a Guttman scale. Guttman has 
provided criteria for determining whether a 
set of items is acceptable as a scale, in his sense 
of the term. The major necessary condition is 
that the reproducibility, Rep, be at least .90. 
However, this is not a sufficient condition; ad- 
ditional criteria must also be met. 

The need for auxiliary criteria arises from 
the unfortunate mathematical properties of 
Rep. Festinger (1947) , Guttman (1950b) , and 
others have pointed out that the size of Rep 
depends on several factors. The value of Rep 
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for the scale is the average of the values of Rep 
for the items in the scale. The reproduci- 
bility of an item cannot be less than the popu- 
larity of the modal category, since the scale 
can do no worse than assign the most popular 
response in all cases. Thus the average popu- 
larity of the most popular response to each item 
is a lower bound of Rep. Items with high 
modal popularity will cause Rep to be spuri- 
ously high. 

A second difficulty is that the expected value 
of Rep, if the items are actually independent, 
is often high. This can be illustrated for 
dichotomous items. Consider four independent 
items with marginals, p, of .2, .4, .6, and .8. 
The average popularity of the modal categories 
is 1/4 (8 + .6 + .6 + 8) = .7. We can com- 
pute the expected proportion of people in each 
response category; eg, (+ + — +) has .2 X 
A X 4 X 8 = .0256 or 2.56% of the total 
population. This proportion is assigned one 
error. Proceeding in this way, we compute the 
expected Rep to be .90. A set of nine items with 
marginals .1, .2, A. .4, A .6, .7, .8, .9 has an ex- 
pected Rep of .83, assuming independence of 
items. Festinger (1947) suggests that the ex- 
pected Rep, under the assumption of inde- 
pendence, should be reported whenever an 
actual Rep is reported. Many Reps appear to 
indicate much less response consistency when 
evaluated in this way. 


The auxiliary criteria are as follows: 

l. There should be a good spread of item 
marginals, although few, if any, items should 
have marginals less than .2 or greater than .8. 

2. Each response category should have more 
nonerror than error. Otherwise the category 
does not correlate with the scale. 

3. If the items are dichotomous, at least ten 
items should be used. 

4. Errors should be random. If any particu- 
lar error occurs much more frequently than 
others, it is evidence of a nonscale type. 

If these criteria are all met, but Rep is 
less than .90, the items are called a quast 
scale, Since errors are random in a quasi- 
scale, it has one property in common with an 
acceptable Guttman scale, namely, the curvi- 
linear correlation (eta) of the scale scores with 
any outside variable is equal to the multiple 
correlation of the items with that variable. 
That is, if the scale score is held constant, the 
items are not correlated with each other or with 
any other variable. Actually this property 
seems to be the criterion for random errors in 
a Guttman scale. 


ATTITUDE MEASUREMENT 


The basic difficulty in determining whether 
a set of items forms a scale is that the Guttman 
model is not a probability model, so the theory 
of the statistical fluctuations to be expected is 
obscure, and must be based on a complex set of 
interdependent criteria. The latent distance 
model, discussed below, provides a probability 
model for a Guttman scale, and hence over- 
comes many of these difficulties. 

If a set of items does not scale, the possibility 
exists of rejecting one or two poor items, and 
then achieving a scale, Guttman is chary of this 
procedure, preferring to say that the universe is 
not scalable. However, it seems possible to 
have perfectly good items with the wrong form 
for a Guttman scale. To this author, the possi- 
bility of rejecting items seems to be a necessary 
part of any method of attitude measurement. 

Kriedt and Clark (1949) used scalogram 
analysis to select a reproducible scale. They 
complain that the method is cumbersome, due 
to the multiple criteria. They found that the 
resulting scale was less reliable than a scale 
constructed by item analysis methods from the 
same initial set of items. 

Edwards and Kilpatrick (1948a, b) have sug- 
gested a method of item selection for a Guttman 
scale that uses judges and also item analysis 
techniques. A large set of dichotomous mono- 
tone items is assembled. Judges are asked to 
sort these items into categories according to 
degree of favorableness. Note that the judge’s 
task is somewhat different than in the judg- 
ment methods where nonmonotone items are 
used. Here, the judge is probably rating ac- 
cording to the expected proportion of positive 
responses to the item. Items with a high judg- 
ment variance are rejected as ambiguous. The 
remaining items are scored 1 for a positive 
response, 0 for a negative response, and a total 
score is obtained. An item analysis is per- 
formed, as in the method of summated ratings. 
Nondiscriminating items are rejected, and the 
remainder are subjected to a scalogram analysis. 

Loevinger (1947, 1948) has developed a the- 
ory of homogeneous tests that is equivalent to 
scalogram analysis. Her definition of homo- 
geneity is identical to Guttman's concept of 
a perfect scale. Her coefficient of homogeneity, 
corresponding to Rep, is the ratio of the average 
inter-item covariance to the average covariance 
of a perfectly homogeneous set of items with 
marginals equal to those of the observed items. 
This index has some of the difficulties inherent 
in Rep, but it may prove to be superior; not 
enough is known at present to judge. 

Intensity and a zero point. Guttman, Such- 
man, and their followers have suggested a 


357 


rational method for obtaining a zero point on 
the attitude scale. Although this development 
is in terms of scalogram analysis, the method 
is applicable to any attitude scale. The basis 
for the development is that the intensity with 
which an attitude is held seems to be related 
to the attitude score by a U-shaped function, 
with people at either extreme of the attitude 
scale having intense attitudes, and people in 
the center being relatively apathetic. The atti- 
tude zero point is defined as that attitude score 
for which the intensity is minimum. That is, 
persons with an attitude score of zero don't 
care very much, one way or the other. Thus, if 
an independent measure of intensity is avail- 
able, it is possible to define the zero point of 
the attitude objectively. An alternative pro- 
cedure that may be necessary in practice is to 
define a region of indifference. This seems pref- 
erable when the intensity function does not 
have a sharply defined minimum. 

Katz (1944), Cantril (1946), and Suchman 
(1950b) have suggested methods for measuring 
intensity. An intensity question can be asked 
after each attitude question, The intensity 
question might be, “How strongly do you feel 
about this?” with alternatives very strongly, 
fairly strongly, and don’t care. Another pos- 
sibility is the question, "How sure are you 
that your opinion is right?" with alternatives 
very sure, fairly sure, and not sure. Still an- 
other possibility is, “How much does this ques- 
tion mean to you personally?”  Self-rating 
scales have been used, such as a thermometer 
on which the respondent indicates the intensity 
of his attitude. It has been suggested that if 
Likert-type alternatives have been allowed for 
each item, then one can lump strongly agree 
and agree as the positive response, the other 
categories being the negative response. Also, 
strongly agree and strongly disagree can be 
lumped as strong intensity, approve and disap- 
prove as medium intensity, and undecided as 
low intensity. Since the attitude and intensity 
variables thus defined are not independent, it 
would seem better to ask two separate ques- 
tions. Postman and Zimmerman (1945) found 
that decision time was correlated with inten- 
sity. They asked the subject to respond to each 
item on a ten-point scale from strongly agree 
to strongly disagree. Decision times were high- 
est for items rated in the center of the scale. 

, When intensity questions are available, inten- 
sity may be scaled, Suchman suggests that the 
scalogram method of scaling be used for in- 
tensity, although he finds that intensity is 
usually a quasi-scale. 

The definition of a zero point on an ordinal 
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or interval scale is an arbitrary matter. In gen- 
eral, one point is logically as good or as bad as 
another. It seems reasonable, however, to re- 
quire that the zero point should not depend on 
the particular set of items used. The definition 
of a zero point as the attitude score of minimum 
intensity, if adopted, would be a convention. 
Any justification must be on empirical grounds. 

If it is desired to dichotomize the population 
into pro and con on an issue, the split might 
well be made at the region of minimum inten- 
sity. It is possible that one group will behave 
quite differently from the other. Suchman 
(1950b) has shown that two scales with biased 
selections of items can be compared by equating 
regions of indifference. This is good evidence 
of the utility of the intensity convention for a 
zero point. 

Metric and components. 'To provide a metric 
for his scale, Guttman (1941, 1950c) proposed 
a criterion of scoring so that persons giving a 
certain response to an item would have scores 
as similar as possible among themselves and as 
different as possible from persons giving other 
responses to the item. This criterion is mathe- 
matically equivalent to the criterion for maxi- 
mum variance scores discussed above. 

Maximum variance scores do not play a ma- 
jor role in scalogram analysis, since the simple 
ranks of response categories are adequate for 
most purposes. However, the mathematical 
analysis is interesting, since it leads to several 
sets of scores, all of which satisfy the equations. 
These sets of scores are orthogonal, i.e., uncor- 
related. The first set of scores has maximum 
variance. The scores in the second set have 
maximum variance under the restriction that 
they are uncorrelated with the first set. Scores 
in the third set have maximum variance under 
the restriction that they are uncorrelated with 
the first and second sets, etc. Mathematically, 
these sets of scores are known as principal com- 
ponents. It turns out that the first. principal 
component has the largest variance, the second 
principal component has the second largest vari- 
ance, etc. If there are M possible scale scores, 
i.e., M perfect scale types, then there are M prin- 
cipal components. "The existence of these com- 
ponents is a straightforward mathematical 
property of the items and the mathematical 
problem of maximizing the score variance. 

Guttman has shown that, for a perfect scale, 
if the values of the various sets of scores are 
plotted as functions of the original scale scores, 
the first principal component is a monotonic 
function of the scale scores. For this reason it 
can be used as a metric for the system. More- 
over, the second principal component is U- 
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shaped, having one inflection point; the third ^ 
principal component has two inflection points; — 
the fourth has three inflection points, etc. 

Since the second component and the inten- - 
sity function are both U-shaped, Guttman has 
equated the two functions, saying that intensity 
is the second principal component. The identi- 
tification is logical only if it is taken to mean 
that intensity is related to the scales in roughly 
the same way as the second component. The 
similarity of the two functions seems to be quite 
accidental; intensity is not in any way more 
fundamental or meaningful because of it. Gutt- 
man believes that it will be possible to find 
other attributes of attitude which behave in 
roughly the same way as the third, fourth, and 
higher components. He has already identified 
the third component as "closure" (Guttman, 
1950d) . 

The present author agrees that it is profitable 
to investigate the correlates of attitude. There 
may be many interesting characteristics of atti- 
tude, such as clarity, specificity, intensity, clos- 
ure, involution, etc. Perhaps some of these 
characteristics will have several inflectioi 
points, and hence correspond with one of the 
higher components. If so, the correspondence 
will be entirely fortuitous. Whether or not such 
functions can be demonstrated reliably, and - 
whether they will increase our understanding 
of attitude, remains to be seen. 


THE UNFOLDING TECHNIQUE 


Coombs (1950, 1952) has presented a scaling 
method in the spirit of scalogram analysis, but - 
dealing with a different situation. The Te 
spondent is asked to rank the items in order. 
of preference — or in order of agreement from ' 
strongly agree to strongly disagree. In the 
model, each item is viewed as having a 4 
position on the latent dimension, while each. 
respondent is likewise located on the dime 
sion. The respondent's order of preference 
determined by the relative distance of the item 
from his location. For example, in the situation 
depicted in Fig. 5, the rank order reported b 
respondent I would be BCAD; respondent UJ 
BCDA; respondent III, CBDA. Since directio 
is ignored, it is as if each individual had folde 
the scale over, using his location as a hin 
From a set of these rank orders, one can ded 
the order of the items on the continuum, 4 
thus unfold the continuum, as it were. 

It is also possible to deduce something abo! 
the relative magnitudes of intervals betw 
items. Note that for each pair of items, the ite 
preferred by a respondent depends on the lo 
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ACAD DC BD c D 


Fic. 5. The model of the unfolding technique. The letters 
represent items, the Roman numerals represent respondents. The 
triangles indicate the mid-points of intervals between items. 


tion of the respondent in relation to the mid- 
point of the interval between the items on the 
scale, Thus, respondent I prefers A to D be- 
cause he is on the A side of the AD mid-point. 
Thus, the preference orders tell us something 
about these mid-points. In our case, we know 
that AD is to the left of BC. This implies that 
the interval between A and B is larger than 
the interval between C and D. If respondent II 
had reported CBAD, we would have had to 
place mid-point AD to the right of mid-point 
BC. In this case, the interval between A and B 
would have been smaller than the interval be- 
tween C and D. 

Practical methods of analyzing such a set of 
data have not yet been worked out. One major 
difficulty is how to treat errors. As in the case 
of scalogram analysis, we must first decide which 
response patterns to accept and which to class 
as error. 

In the case of attitude measurement, it would 
seem that an additional piece of information 
would be helpful. The respondent could be 
asked to indicate which statements were more 
pro than his position, and which were more 
con. 'This would certainly help in the unfold- 
ing process. 


LATENT STRUCTURE ANALYSIS 


Lazarsfeld (1950a, b) has recently reformu- 
lated the theory of attitude measurement. His 
latent structure theory is a general mathemati- 
cal model that helps to clarify the problem of 
attitude measurement. Latent class analysis and 
the latent distance model are two special cases 
of the general model that will be discussed in 
detail. 

The general latent structure model, We have 
defined an attitude operationally as a syndrome 
of response consistency. Response consistency 
implies that the manifest items used to assess 
the attitude are mutually intercorrelated. In 
fact, the interrelationships among items define 
the attitude. The attitude variable must get 
its meaning from these consistencies. In design- 
ing a mathematical model for the latent attitude 


variable, we shall simply reverse this reasoning, 
and say that the items are intercorrelated be- 
cause of the latent variable. Since each item 
is related to the same underlying variable, the 
items must be related to each other. Moreover, 
this is the only reason they are related to each 
other. The latent variable accounts for all of 
the manifest interrelationships. If the latent 
variable were held constant, or partialled out, 
the items would be independent. This is the 
basic idea in latent structure analysis. 

Although the latent structure model is ap- 
plicable to any type of manifest variable, our 
discussion will be simplified if we consider only 
dichotomous items. The operating characteris- 
tics of these items can be represented in terms 
of the probability of a positive response. Let x 
represent the latent attitude variable. Then 
let f, (x) be the operating characteristic of item 
i, as a function of x. (Lazarsfeld's term is “trace 
line.”) For any fixed xo, f;(xo) gives the 
probability of a positive response to item i. Now 
let ¢(x) be the distribution of scores on the 
latent variable. For any fixed interval about x, 
(x, x + dx), é(x)dx gives the proportion of 
people having latent attitude scores in that 
interval; ie, (x) is a probability density 
function. Then the proportion of people who 
have attitude scores in an interval about x and 
who respond positively to item i is fix)o(x) 
dx. To find the total proportion who respond 
positively to item i, we must integrate over all 
possible values of x: 


+ oo 
pi | flx)o(x)dx. 
y 


Now consider the proportion who respond 
positively to both items ¿ and j. The basic 

sumption of latent structure analysis is that 
de items are independent if x is held constant. 
But f,(x) and f;(x) are the probabilities of 
response for a fixed x. Thus, for any x, the 
probability of a positive response to both items 
i and j is the product f,(x) f;(x)- Then the 
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proportion of people who have an attitude score 
in an interval about x and who respond posi- 
tively to items i and j, is fi(x)f;(x)p(x)ax. Fi- 
nally, 


+ e 
gu f fio e (dx. 


In a similar manner, we can obtain 


+ oo 
iat f fa (x) fy G9 fa GO (dx. 


The complete system of these equations, for p, 
Dip Dino Pii Etc., completely describes the item 
interrelationships. 

Very little can be done with the model with- 
out making further assumptions about the 
operating characteristics, or the attitude distri- 
bution, or both. By making such assumptions, 
several special models may be derived. The 
single common factor models presented in the 
discussion of Likert's scaling method are ex- 
amples of special latent structure models. 

The similarity of latent structure analysis 
and factor analysis has been demonstrated by 
Green (1952). Thurstone’s (1947) multiple 
factor analysis can be derived as a special case 
of the general latent structure model, extended 
to continuous manifest variables. 

Latent class analysis. One of the important 
models stemming from the general theory is 
latent class analysis. The basic notion of this 
model is that the interrelationships among the 
manifest items reflect the existence of two or 
more distinct subpopulations which make up 
the total population of respondents. Within 
each subpopulation, or latent class, it is as- 
sumed that the items are independent. How- 
ever, the proportions of people who respond 
positively to each item will vary from class to 
class. It is this variation of item marginals 
which makes for correlation between items. 

As an example, consider two items concern- 
ing government ownership of industry: 


l. Should the government own the public 
utilities? 
2. Should the government control the coal 
mines? 
For some population of respondents we might 
get the following fourfold table of proportions, 
indicating a positive correlation between the 
two items: 
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Item 1 
+ - Total 
+ 24 23 47 
]tem2 — 17 36 53 
Total 4l 59 100 


Suppose now that the population is made up 
of a group of prosocialists and a group of anti- 
socialists. It is not claimed that the classifica- 
tion is clear-cut and abrupt, but rather that 
two latent classes with these general tendencies 
can be found. Then the table for the total 
population might be broken down as follows: 


Prosocialists 
= Total 
+ 20 12 32 
- 3 8 
Total 25 15 40 
E 
Antisocialists 
- Total 
ay 4 il 15 
- 1 33 45 
Total 16 44 60 
Total 
- Total 
+ 24 23 47 
=æ 17 36 53 
Total 41 59 100 


Within each latent class, the items are inde- 
pendent. Knowing the response to one item 
does not help to predict the response to the 
other. For the total group, however, there is a 
positive relationship between the items. The 
existence of the two latent classes is responsible 
for this correlation. 

Within each class there is still considerable 
lack of response determination. Presumably 
there are several other factors operating which 
lead a certain proportion of the people in the 
class to give each response. However, since the 
items are independent, the factors influencing 
the split on one item must be different from the 
factors influencing the split on the other item. 
For, if there were any factor tending to lea 
to a positive response on both items, this would 
lead to a correlation between the items for that 
class. 

In the mathematical model, the attitude dis 
tribution is represented by the parameters "w 
the proportion of people in class s. The item 
operating characteristics become the parameters 
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Fic. 6. Representative item operating characteristics in the latent dis- 
tance scale. The three parameters (a; Bp and x; are shown for the 


solid characteristic in the figure. 


vi, the probability of a positive response to item 
i for class s. We then have directly that 
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Methods for estimating the latent parameters 
n, and vj, have been developed by Lazarsfeld 
and Dudman (Chapter 3 in Lazarsfeld, et al., 
1950b) , Gibson (Chapter 6 in Lazarsfeld, et al., 
1950b) , and Green (1951a, b). The procedures 
become quite complicated if more than two 
or three classes are involved. Anderson (1953) 
has developed the sampling theory of the esti- 
mation procedure devised by Lazarsfeld and 
Dudman. 

The latent class model implies a nominal 
attitude scale, with each individual being 
placed in one of several latent classes. Bayes’ 
rule can be used to compute the probability 
that an individual belongs to a particular latent 
class, on the basis of his item responses. The 
classification can be made according to the 
maximum probability. However, this classi- 
fication rule may not lead to classes with the 
appropriate parameters. If a set of data fits the 
latent class model, then we know that the data 
behave as if there were a number of latent 
classes with the stated properties. However, we 
do not know whether the population of re- 
spondents can be subdivided into classes having 
these properties. This logical difficulty is in- 
herent in the model. 


Many psychologists prefer to consider atti- 
tude as a continuous variable rather than a 
discrete variable having only a few values. It 
is, of course, possible to consider the discrete 
class model as an approximation to the con- 
tinuous model. 

There are several unsolved problems in latent 
class analysis, including item selection and sta- 
tistical tests of goodness of fit. Also, there have 
been very few attempts to use the model. Rossi 
(1951) and Green (1951b) applied the model 
to problems of social stratification. Until more 
empirical evidence has accumulated, it is im- 
possible to evaluate the utility of the class 
model. 

The latent distance scale. The latent struc- 
ture model that provides a probability model 
for scalogram analysis is called the latent dis- 
tance scale, Although the latent distance scale 
can be written as a special case of the latent 
class model, its relation to the Guttman scale 
will be clearer if we treat it as a continuous 
model. 

For convenience, the range of the attitude 
variable is restricted to the interval 0 =x = 1, 
and the attitude distribution ¢(x) is assumed 
to be uniform over that interval. That is, 
(x) = 1. Each item operating characteristic 
is assumed to have some low constant value, a, 
up to some breaking point, x, and to have a 
high constant probability, ĝ;, thereafter. Ex- 
amples of such trace lines are given in Fig. 6. 
A simpler model can be obtained by assuming 
Bı = 1 — a. Methods for estimating the 
parameters x, and a, have been developed by 
Lazarsfeld and Hays (1952) . 

The perfect Guttman scale is obtained by 
setting 2, equal to zero (and 6, equal to unity). 
However, the utility of the latent distance 
model is that an imperfect Guttman scale can 
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be represented equally well. Not only can the 
error in each item be evaluated relative to the 
total scale, but the model enables us to com- 
pute the scale from summary statistics. We 
can determine the parameters x, and a, from 
the data in the form of the sets of proportions 
p, and py 

Since the latent distance scale is based on a 
probability model, the nonscale types of scalo- 
gram analysis are not troublesome — every 
response pattern has a certain probability of 
occurring. In addition, the many criteria for a 
Guttman scale, including the requirement of 
90% reproducibility, become unnecessary. For 
the latent distance scale, we merely wish to 
know whether the model fits the data within 
the limits of statistical fluctuation. An arc sin 
chi-square test similar to that used in the meth- 
od of paired comparisons would be appropriate. 


RATING METHODS 


Some investigators attempt to measure atti- 
tude by asking a judge to rate the attitude of 
each individual. In some cases, a single over-all 
rating is made. Alternatively, the judge rates 
the individual on several specific items, from 
which a composite measure is obtained. 

The information on which a rating is based 
may have several sources. Krech and Crutch- 
field (1948) list seven possible sources: non- 
verbal behavior toward the object, verbal 
behavior relating to the object, secondary ex- 
pressive cues, responses in clinical-type in- 
terviews, personal documents, responses on 
projective tests, and immediate experience. If 
the ratings are to be made in a uniform manner, 
it is essential that the same kind of information 
be available for each individual. The use of a 
standardized interview would help to ensure 
uniformity of information. Reliance on hap- 
hazard observation is more likely to increase 
error variance than to lead to an insightful 
rating. 

A single over-all rating can be obtained in 
several ways. Each individual can be rated on 
a discrete scale, such as a ten-point scale. In a 
variant of this technique, the position of an 
individual is indicated on a continuous line 
segment that represents the attitude continuum. 
Verbal descriptions attached to points on the 
scale help to ensure the consistency of ratings. 
An over-all rating can be obtained from tho 
respondent himself. Riker (1944) found cor- 
relations of about .65 between self-ratings and 
typical Thurstone scales. 

If several ratings can be obtained for each 
individual, any of the judgment methods can 
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be used to get a scale of individuals. Here the 
individuals are treated as items to be sorted and 
compared. The use of a judgment method re- 
quires either a large number of judges, or a 
large number of repeated ratings by a single 
judge. Neither of these alternatives is practical 
in many situations. 

One objection to single, over-all ratings is 
that the rater may not be using the same criteria 
for all individuals. We might find, for ex- 
ample, that As attitude toward the political 
status of Negroes was being compared with B's 
attitude toward the economic status of Negroes. 
This difficulty can be overcome by dividing the 
total situation into several specific components, 
and obtaining a rating of each individual on 
each component. The components then be- 
come items that can be scaled by one of the 
judgment or response methods. For example, 
rather than rate on the basis of a general im- 
pression obtained from a TAT, one could use 
specific items, such as: "Does the protocol indi- 
cate aggressive tendencies toward Jews?" Ata 
more specific level, one could use items such 
as: "Was adultery mentioned or implied in any 
of the stories?" Behavior rating scales are of 
the same general nature. The use of specific 
items of this sort gives the experimenter some 
control over the criteria of the judges, and helps 
to ensure uniform treatment of all respondents. 

The forced-choice technique (Sisson, 1948; 
Travers, 1951) has recently been proposed as 
a method for systematic rating. On the basis 
of item analysis data, pairs of items are chosen 
which have equal popularity but different dis- 
criminating power. Two such pairs are com- 
bined in a tetrad. The rater is required to say 
which item in the tetrad is most like the in- 
dividual and which is least like the individual. 
Gordon (1951) gives this example from a per- 
sonality questionnaire: (a) a good social mixer, 
(b) gives in readily to other people's wishes, 
(c) thorough in any work undertaken, (d) 
worries about possible misfortune. "The major 
difficulty with this technique is that the four 
items in a tetrad may not be comparable. Of 
course, if all items refer to the same underlying 
dimension, this criticism is not valid. 

The forced-choice technique is one attempt 
to overcome the halo effect. It is well known 
that some sets of specific items are not much 
better than an over-all rating scale, since all 
the items can be rated on the basis of a general 
impression of the individual. By forcing a de- 
cision between items that would otherwise both 
be checked, the effect of halo can be reduced. 
Whenever possible, however, items should be 
constructed which are sufficiently objective SÉ 
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that halo becomes a minor factor, In this case, 
the forced-choice technique would not be par- 
ticularly useful. 

The rating methods introduce a new aspect 
of reliability, since different judges will un- 
doubtedly give different ratings. We can de- 
termine stability for one judge, as well as 
equivalence and homogeneity, if specific items 
are used, and we can also determine rater re- 
liability by correlating the ratings of different 
judges. (In the mental test literature, the prob- 
lem is called reader reliability; see Gulliksen 
1950b.) To determine rater reliability, at least 
two judges must rate each individual. If rater 
reliability is found to be high, then in future 
studies it is reasonable to allow different raters 
to rate different individuals. Ebel (1951) has 
presented a method for determining the aver- 
age rater reliability if several raters are used. 


COMPARISON OF SCALING METHODS 


Scale criteria. Each scaling method is based 
on a model which relates the data to a latent 
attitude continuum. Procedures are incorpo- 
rated for checking the agreement of the data 
and the model, 

"There are two sets of data to be evaluated in 
the judgment methods. In the first place, the 
scale of items depends on a model for the judg- 
ments about the items. In the method of suc- 
cessive intervals, the judgment distributions 
must be normal on the derived scale. In the 
method of paired comparisons, the difference 
in scale values should agree with the proportion 
of times one of each pair of items is preferred. 
Secondly, the criterion of relevance is used to 
check the agreement of the item responses with 
the assumptions about the item operating char- 
acteristics. 

The method of summated ratings requires a 
factor analysis of the items to determine 
whether a single common factor accounts for 
most of the inter-item correlation. Item-test 
correlations are sometimes used in lieu of a 
factor analysis. An alternative check is to de- 
termine the homogeneity of the items. 

Scalogram analysis includes several criteria 
that a scale must satisfy. "The reproducibility 
must be at least .90, the item marginals must 
have a large range but must not include ex- 
treme values, each response category must have 
more nonerror than error, and errors must be 
random. 

In latent structure analysis, the items must 
be independent (uncorrelated) for individuals 
with the same attitude score. No practical sta- 
tistical test is available to test this hypothesis. 
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It is apparent from this summary that most 
of the scale criteria are subjective. The deci- 
sion depends heavily on the judgment of the 
experimenter. It is doubtful that the decision 
could ever be completely automatic, but more 
objective criteria should be provided for the 
experimenter. Since, in theory, all the scaling 
methods can be subsumed under latent struc- 
ture analysis, a practical statistical test of the 
latent structure hypothesis could be used with 
any method. 

Unidimensionality. All the scaling models 
considered here, with the exception of latent 
class analysis, are unidimensional models. The 
data are referred to a single underlying dimen- 
sion. If the data fit the model, the scale is 
unidimensional by definition. If the data do 
not fit the model, then either a scale does not 
exist, or a multidimensional scale is necessary. 
Unidimensionality has no meaning apart from 
the context of a particular scaling model. 

One might think that latent structure theory 
has solved the problem of unidimensionality in 
general, by requiring the items to be inde- 
pendent for a fixed score on the latent attribute. 
However, we have seen that the general latent 
structure theory is completely abstract, Addi- 
tional assumptions must be made about the 
operating characteristics or the attitude distri- 
bution in order to make use of the model. 
Whether or not a set of data is unidimensional 
will depend on what assumptions are made. 
By assuming sufficiently complex item charac- 
teristics, any set of item response data can be 
referred to a single continuum. Dimensionality 
cannot be divorced from the particular model. 

There is an interesting difference between 
the judgment methods and the response meth- 
ods concerning unidimensionality. In the re- 
sponse methods, we must have a unidimensional 
set of items. The response to each item must 
be a function of the single latent variable, plus 
some factor unique to the item. But in the 
judgment methods, the items can be multidi- 
mensional stimuli that are judged with respect 
to a single dimension. Perfect consistent 
judgments of brightness can be obtained for 
colors which differ in saturation and hue as 
well as brightness. This implies that in some 
circumstances we could get two different uni- 
dimensional scales from the same set of items, 
using the judgment methods. This apparent 
paradox is not a logical inconsistency; in fact, 
it is a useful property of the judgment methods. 

Items. The judgment methods require non- 
monotone items, while scalogram analysis re- 
quires monotone items. Mosteller (1949) has 
shown that this is not a fundamental dif- 
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ference. He presented a modified scalogram 
technique for nonmonotone items, and sug- 
gested that it would be equally possible to 
modify the judgment methods so that they 
could handle monotone items. 

Typically, the judgment methods and the 
method of summated ratings deal with large 
numbers of items, while scalogram analysis sel- 
dom employs more than 10 or 15 items. The 
single common factor model implies that a large 
number of items is desirable with the method 
of summated ratings, in order that the specific 
item factors will be greatly outweighed by the 
common factor. Scalogram analysis permits 
very little error in any item, and hence appears 
to need fewer items. This is a happy circum- 
stance, since it is very difficult to find the almost 
perfect item required for a Guttman scale. The 
importance of the single factor theory is its 
demonstration that a large number of mediocre 
items can be just as good as a small number of 
precise items, 

The scaling methods include techniques for 
discarding items which, for one reason or an- 
other, do not meet the requirements of the 
scale, This is a perfectly logical procedure, but 
it poses some problems for the description of 
the attitude. universe. We must not only in- 
clude all the possible items similar to those 
retained in the questionnaire, but we must also 
exclude all items similar to those rejected from 
the scale. In other words, by rejecting items 
we may restrict or change the nature of the 
variable being measured. This consideration 
does not hold, of course, for items which are 
rejected simply because they are of the wrong 
form for some particular scaling method. 

Metric. The methods of paired comparisons 
and successive intervals appear to yield interval 
scales, as does the method of equal-appearing 
intervals if we make the assumption that equal- 
appearing intervals are equal. In a narrow 
sense, the method of summated ratings can be 
said to provide an interval scale, although it 
has interval properties only in terms of the 
population attitude distribution. The Gutt- 
man scale, as well as the latent distance scale, 
is primarily an ordinal scale, although a metric 
can be provided by the maximum variance 
method. Latent classes form a nominal scale. 

An interval scale should have the property 
that the score distribution is invariant except 
for a linear transformation, with respect to the 
particular set of items used. It should not be 
possible to manipulate the score distribution 
by selecting items with special characteristics. 
According to this criterion probably none of 
the scales has equal intervals. 


In theory, the judgment scales should be in- 
variant with respect to item parameters. If a 
biased set of items is used, the number of items 
endorsed by an individual may be changed 
markedly, but his scale position should not be 
affected. Actually, it is doubtful that the in- 
variance could be demonstrated. The scale of 
items could probably be manipulated by a 
biased selection of items to be judged. Studies 
of the anchoring effect, summarized by Mc- 
Garvey (1943) , suggest that the distribution of 
judgments is influenced by the set of items, 
However, this is not likely to alter the final 
scale appreciably, since the scale depends on 
the relative placement of items. That is, in the 
method of successive intervals, a biased selec- 
tion of items could result in large shifts in the 
positions of the category boundaries, but the 
relative shifts of item scale value would prob- 
ably be small. Thus we shall not be far amiss in 
treating the judgment scales as interval scales. 

Not so with the method of summated ratings, 
or the maximum variance scales. It is well 
known that the distribution of scores on these 
scales is very sensitive to the parameters of the 
particular items, such as item-total score corre- 
lation and item popularity. Skewed, flat, or 
normal distributions can be obtained by ap- 
propriate item selection. The units of the scale 
have no meaning apart from the particular set 
of items, and the associated score distribution. 

Although the distribution of scores on a 
Guttman scale is of little importance in most 
cases, it is equally susceptible to manipulation. 
Smith (1950) pointed out that the successive 
item popularities correspond to the cumulative 
score distribution for a perfect scale. 

Experimental comparisons, A number of 
studies have compared the various scaling 
methods. Hevner (1930) compared the meth- 
ods of rank order, equal-appearing intervals, 
and paired comparisons. In general, she found 
very close linear relationships among scales 
constructed by the three methods. However, 
she found an end effect in the equal-appearing 
interval scale, both extremes of the scale being 
compressed, Saffir (1937) compared the methods 
of rank order, successive intervals, and paired 
comparisons. He found linear relationships 
throughout. 

Several authors have attempted to compare 
the judgment methods and the method of sum- 
mated ratings. Likert, Roslow, and Murphy 
(1934) used the five-category response with a 
Thurstone scale. After eliminating the neutral 
items, they were able to score the Thurstone 
scale by the Likert method of summated rat- 
ings. They found this score to be more reliable 
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than the Thurstone-type score, although the 
two scores were highly correlated. The items 
originally used by Thurstone and Chave in 
constructing their scale of attitude toward the 
church were employed by Edwards and Ken- 
ney (1946) to compare the Thurstone and 
Likert techniques. They constructed two 
equivalent 20-item Thurstone scales, which cor- 
related .88, and one 25-item Likert scale that 
had a reliability of .94. The Likert scale corre- 
lated .74 and .92, respectively, with the two 
Thurstone scales. Only five items selected for 
the Likert scale were also selected for the 
Thurstone scales, although this restriction was 
not imposed. Eysenck and Crown (1949) con- 
structed a 24-item scale of anti-Semitism by the 
method of equal-appearing intervals. They 
scored the scale in the usual Thurstone way, 
obtaining a reliability of .83. If the scale was 
scored by the Likert method, the reliability was 
.90. They also tried a “scale product” score 
that was obtained by multiplying the item scale 
value by the response score (I, 2, 3, 4, 5) and 
summing over all items. This hybrid score had 
a reliability of .94. 

These authors seem to have been unaware of 
the logically incompatible requirements of the 
two methods concerning item operating charac- 
teristics. Theoretically, a nonmonotone type of 
item could not be used on a Likert-type scale, 
and a Likert-type item could not be used on a 
Thurstone scale. In practice, Thurstone items 
with scale values near the extremes tend to have 
monotonic operating characteristics. Thus if 
“troublesome” items with scale values near the 
center are rejected, the two methods are com- 
patible in practice. Nevertheless, these studies 
cannot be regarded as evidence of the compara- 
bility of Thurstone and Likert scales. The 
scale product score is logically inconsistent with 
the nonmonotone item philosophy. 
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Clark and Kriedt (1948) compared scalo- 
gram analysis with the Likert approach. They 
used the Rundquist-Sletto 22-item scale of 
economic conservatism, a Likert-type scale. 
They found marked unreliability in the manner 
in which response categories were combined. 
The reproducibilities for two samples were .79 
and .80, although the odd-even reliabilities were 
.85 and .88, and the median item-total score 
correlations were .58 and .65. Thus there seems 
to be evidence of a strong common factor, but 
the scale does not meet Guttman's require- 
ments. Eysenck and Crown (1949), in their 
study of an anti-Semitism scale, compared the 
Likert scoring of the scale with a scalogram 
analysis. They computed a factor analysis of 
item tetrachoric correlations, and found a 
strong common factor, with a very weak second 
factor. The scalogram analysis, however, re- 
sulted in a reproducibility of only .85. Item 
reproducibilities were highly related to com- 
mon factor loadings with a curvilinear correla- 
tion (eta) of .84. The authors asserted that 
the common factor rather than the reproduci- 
bility is the proper criterion of scalability. 

Niven (1953) has compared scalogram analy- 
sis with the method of reciprocal averages, the 
latter yielding an approximation to maximum 
variance scores. From 29 items he selected 10 
for the Guttman scale and 12 for the reciprocal 
averages scale, using objective criteria. The 
two selected scales had 9 items in common. 
He found a correlation of .93 between the two 
scales in the original sample, and a correlation 
of .89 in a cross-validation sample. This result 
is consistent with the mathematical analysis 
that demonstrates that the maximum variance 
scores, or first principal component, is a mono- 
tonic function of the scalogram scores for a 
perfect Guttman scale. 


CONCLUSION 


It is clear that there are several procedures 
available to the investigator who wishes to 
measure attitudes. The judgment methods, the 
method of summated ratings, scalogram analy- 
sis, and the rating methods are all useful tech- 
niques. At present the latent structure models 
and the unfolding technique must be classed as 
scaling models rather than as scaling methods, 


since satisfactory practical methods of appli- , 


cation are not currently available. (It seems 
likely that straightforward procedures for ap- 
plying the latent class model and the latent 
distance model may soon be available.) 

There is still a great deal to be learned about 


the scaling methods. Currently, an investigator 
who must choose one of the methods can find 
very little information on which to base his 
choice. More experimental and analytical com- 
parisons of the methods are needed. Latent 
structure analysis seems to provide a very prom- 
ising framework for future development of 
scaling techniques. Multidimensional scaling 
methods also seem promising for attitude meas- 
urement. 

In this forest of methods, the investigator 
should not lose sight of the fact that a scale is 
made from items. The initial and basic prob- 
lem of attitude measurement is to assemble a 


D 
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set of carefully worded, insightful items that 
cover the area in question. A hodgepodge of 
carelessly contrived items all thrown into the 
hopper of any scaling technique will invariably 
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Systematic Observational Techniques 
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The psychologists of Wundt's day were among 
the first to be concerned with the relationship 
between the subjective reaction and report of 
the human organism and the stimulus situation. 
In the psychophysical experiments which dom- 
inated the last century, the physical character- 
istics of the stimulus were known by independ- 
ent techniques. "The task was to discover their 
correlates in experience. In these cases, the sub- 
ject did not serve the function of determining 
"what is out there." 

The student of behavior moved from this con- 
cern to an interest in more complex behavior 
and to aspects of the organism other than sen- 
sory function. Paralleling this shift in interest 
has been a change in the role of at least some of 
the human organisms in the experimental proc- 
ess. They became the measuring instruments, 
analogous to those independent techniques 
which the earlier experimenter used to deter- 
mine the physical characteristics of the stimulus 
situation. As the scientist became interested in 
problems such as social interaction in a play- 
ground situation or the way in which a collec- 
tion of people solved a common problem, he 
found it necessary to pay attention to variables 
for which no physical measuring instruments 
were available. If the development of sympa- 
thetic behavior is an important concern, how 
can sympathetic behavior be recorded and meas- 
ured? It is in this connection that the human 
observer emerges as a measuring instrument. 
Not only does he define the stimulus situation, 
but he also records the responses. 

This use of the human being as a source of 
information not about himself but about some- 
one or something else is now widespread, There 


A Brier History OF 


The informal history of the social observer 
goes back to the beginning of time. The 
novelist and the poet have long been concerned 
with the precise description of social behavior 
and with deriving propositions about motives 
and feelings from observed behavior. The av- 
erage citizen in Nis day-to-day interactions makes 


is hardly an experiment on group process or a 
study of social behavior which does not call 
heavily upon his services. In addition, the 
variety of situations in which he is used has 
increased considerably and now ranges from 
carefully controlled laboratory experiments to 
diverse field situations. 

While it is by no means a new approach to 
the problem of the human observer, it is useful 
to keep in mind the essential identity between 
the human observer and the thermometer or 
other instrument for measuring the stimulus sit- 
uation, If we do this we shall see the problems 
of calibration, of agreement, and of validity as 
essentially the same as in the case of other meas- 
uring instruments. While there are undoubt- 
edly some unique problems in the case of the 
human measuring instrument, the objective is 
the same as with any other instrument, namely, 
accurate and reliable information, 

Although the use of the human observer has 
become extensive, there is a relative paucity of 
information concerning factors which affect the 
accuracy and reliability of observer data. 
Strictly methodological studies involving the 
use of the human observer are rare. Many of 
the methodological problems have been left un- 
touched largely because most of the experiment- 
ers who have used the human instrument have 
been concerned with using him effectively in 
their particular situation. No one has made it 
his concern to study the scope and limits of the 
human observer per se. As a result, the present 
chapter offers tentative conclusions largely 
based on the experiences of experimenters 
rather than on the results of precise methodo- 
logical investigation. 


Onserver METHODS 


precise observations, especially in areas which 
are of critical importance to him. The inse 
cure employee makes inferences concerning the 
feelings of his boss and these inferences are 
based upon precise descriptions of the way in 
which the boss opens the door and closes it, his 
facial expressions, and so on. For the socially 
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effective individual these observations and in- 
ferences from them are, by and large, shared. 
When they become notoriously inaccurate or 
generally unshared we regard the individual as 
a deviant. Part of the learning task of every 
child involves accurately noting and interpret- 
ing complex social cues. Observation of chil- 
dren indicates that the very small child learns 
rather quickly the significance of some very 
subtle cues. In fact, part of our theorizing 
about child development, especially in the area 
of parentchild relationships, is based on the 
proposition that the child responds in terms of 
valid inferences he makes concerning the mo- 
tives and feelings of those around him. If 
we accept this general argument, the exten- 
sive use of the trained observer is primarily an 
application in the scientific area of a general 
skill which most humans have to some degree. 
The application of this ability to the pursuit 
of knowledge imposes some refinements and 
some more rigorous requirements, but no new 
or occult skill. 

'The modern, or scientific, history of the hu- 
man observer, as far as the social sciences are 
concerned, probably begins with such research- 
ers as Bühler, Stern, Dorothy Swaine Thomas, 
Florence Goodenough, and Olson. The origins 
of the trained observer of behavior in medicine 
go back much further, of course. The impetus 
for quantification of observations was first ap- 
parent in the work of Thomas and her associ- 
ates (Thomas, 1933; Thomas, Loomis, and Ar- 
rington, 1933). As a group, these early investi- 
gators used the human observer in certain ways 


OCCASIONS FOR THE 


We have specified the general condition un- 
der which observers have been found to be use- 
ful, namely, when other measurement devices 
are unavailable or inappropriate. We shall turn 
in this section to a more precise description of 
the circumstances in which the human measur- 
ing instrument is necessary or useful. 

When other techniques of measurement 
would disrupt the process of social interaction. 
It might be perfectly possible to ask the par- 
ticipants in a group discussion at various points 
during the meeting how satisfied they are with 
the behavior of the leader. However, this pro- 
cedure might disrupt the process under study 
in two ways. The first would be particularly 
important when the investigator was concerned 
with the occurrence or nonoccurrence of cer- 
tain events later on in the group discussion. 


The fact that the group is stopped and has to 
resume may have an effect on the time at which 


which might be contrasted with much of the 
more recent use. The early emphasis tended to 
be on exhaustive observational categories. The 
major focus was on the formal, rather than func- 
tional, character of behavior. The categories 
involved little interpretation on the part of the 
observers. These experimenters were primarily 
concerned with physical acts such as pushing, 
touching. Another central concern, particu- 
larly in the case of the Thomas group, was with 
reliability as measured by agreement between 
observers. This concern with reliability led to 
progressive modification of the categories of de- 
scription until the inter-rater agreement was at 
a maximum. As a result, the categories of ob- 
servation became more and more removed from 
any theoretical basis and their predictive effi- 
ciency was minimal. Recent studies using the 
observer differ from the pioneering efforts as, 
more and more, the theory underlying the re- 
search, rather than reliability considerations, 
has come to dictate what is to be observed. The 
movement has been away from exhaustive re- 
cording of all the behavior which occurs toward 
the selection of particular aspects of behavior in 
order to answer specific questions. This trend 
has resulted in the development of different ob- 
server systems, depending on the purpose of 
the experimenter and the framework he is using. 
The newer systems also differ from the earlier 
ones in that they more frequently require the 
observer to make inferences concerning motives 
and feelings from the behavior. We shall dis- 
cuss in a later section examples of observer sys- 
tems which exemplify these trends. 


USE or OBSERVERS 


these events occur, and might, in certain cir- 
cumstances, prevent their occurrence. Here the 
effect of the interruption is not on the variable 
under observation but on some other dimension 
of group process. The second kind of disrup- 
tion concerns whether or not the measurement 
itself produces changes in the variable with 
which the investigator is concerned. The meas- 
uring instrument calls the attention of group 
members to a particular aspect of their be- 
havior or their reactions. This particular char- 
acteristic may, as a result, become salient for 
them, and the knowledge of the experimenter's 
interest in a particular area may have an influ- 
ence on the reactions which they later report. 


"This would be particularly true when the meas- 


uring instrument calls attention to aspects of 
behavior which might lack social approval. In 
these cases the members of the group might in- 
hibit such behavior. Consequently, the inter- 
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ruption for measurement produces a change in 
the variable being measured. 

When the actor himself is inadequate as a 
direct source of information. At the risk of 
oversimplification, we may suggest that there 
are basically only two ways in which we investi- 
gate human behavior. We watch the human 
organism or we ask him questions. There are 
a variety of ways in which we ask questions, 
some of them quite direct, as in a self-rating 
scale, and some of them very indirect, as ina 
projective test. Sometimes the individual scores 
himself, as with an attitude scale; or his per- 
formance is scored by someone else, as with a 
projective protocol. There are, however, a va- 
riety of circumstances in which asking the actor 
is unsatisfactory, no matter what the method. 

One of the most obvious of these is when the 
score or measure in which the experimenter is 
interested cannot be derived from individual 
scores, An illustration of this type of score is 
rate of interaction. In this case the experi- 
menter might be concerned with the pacing of 
the meeting and use as indices the extent to 
which there is a delay between the participation 
of one member and the next participation, or 
the frequency with which more than one per- 
son is speaking. 

Using the actor as an informant may also be 
inadequate because the phenomenon with 
which the experimenter is concerned is ephe- 
meral. If it is embedded in a prolonged and 
complex series of interactions it is likely that 
the memory of the informant is sorely strained 
or totally inadequate for the task. It may be a 
temporally brief event likely to be overlooked 
by the group member who is intent on other 
objectives. 

The actor as a source of information fre- 


Tue Two Basic OBSERVER SYSTEMS: 


THE TWO OBSERVER TASKS 


There are essentially two tasks which can be 
given to the observer. He can be asked to clas- 
sify the behavior he sees into categories or he 
can be asked to assign some numerical index to 
that behavior. Fundamentally, these two tasks 
are probably the same in the sense that when 
the observer assigns a number to a behavior, 
he is putting that behavior in a class of be- 
haviors which have that number. In the ideal 
rating scale, productive of maximum reliability, 
there is a clear definition of the dimension of 
observation and a specification of the proper- 
ties which behavior must have in order to be 
appropriately assigned a number on the scale. 
It is as though subcategory definitions and illus- 
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quently falls short because he lacks the sensi- 
tivity or the vocabulary which the particular 
observation requires. Most category systems— 
indeed, most theoretical systems—which are be- 
ing applied to complex social interaction situa- 
tions call attention to dimensions which are 
not the ones in which the typical group member 
reports social interaction. Typically, when one 
asks an untrained observer what went on at a 
meeting, he reports in terms of the content of 
the discussion. To make the actor useful as a 
source of information on some of the less ob- 
vious dimensions of group process he must be 
sensitized to the existence of these dimensions. 
The circumstances may be such as to make it 
unlikely that a valid report would be made by 
the actor. In a study of seventy-two conference 
groups in business and industry, the vast ma- 
jority of the participants indicated that they 
were highly satisfied with the leader's perform- 
ance. Since in most cases the leader was also 
the superior in the organization to the mem- 
bers, there is reason to suspect that there were 
pressures against reporting dissatisfaction with 
the leader. A variant of this would be the case 
where the experimenter was concerned with the 
incidence of behavior which was unconsciously 
motivated. "There are situations in which per- 
sonality theory would predict that the actor's 
report of his own behavior would be distorted. 
Similar distortion might occur in the actor's 
report of the behavior of other people toward 
him. There is ample evidence that perception 
is selective and in part determined by person- 
ality needs. When the investigator is inter- 
ested in behavior which might trigger off anxi- 
ety responses in the participants, it is hardly 
likely that their description of this behavior 
will correspond to what actually occurred. 


CATEGORY SETS AND RATING SCALES 


trations are erected perpendicular to the scale, 
defining the characteristics of behaviors to 
which each scale number should be applied. 
When we apply a category system to the same 
task, we decide the zero point; that is, we decide 
the conditions which must be met for the act to 
be classified in that category. If an act does not 
meet the conditions, we classify it into some 
other category; in the simplest case this other 
category includes all behaviors not suitably clas- 
sified in the critical category. Typically, the 
system is more complicated than this, providing 
other categories into which the behavior might 
conceivably be placed. Usually, these other cate 
gories are qualitatively different from cach 
other. They are on different dimensions and 
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have no quantifiable relationship to each other. 
When, however, they do have a quantifiable 
relationship to each other, as with a continuous 
category system described below, we have met 
the conditions of the ideal rating scale. 

Thus, while it may appear that rating and 
classifying are quite different, they are, in a 
fundamental sense, the same. Both rating and 
categorizing require careful specification of the 
dimension or category. If discriminations 
within a category are necessary or desirable, and 
these discriminations should be numerically 
ordered with respect to each other, the rating 
scale is one solution; a continuous category 
system is another. If it is necessary or desirable 
to discriminate among residual behaviors, a set 
of rating scale items or a category system with 
qualitative differences between categories are 
alternative solutions. When an act can be rated 
simultaneously on several scales we approxi- 
mate the condition of a category system which 
permits multiple coding. This is the case where 
the categories in the set are not mutually ex- 
clusive and hence have no logical necessary 
qualitative or quantitative relationship to each 
other. 

Carter and his associates made a careful study 
of the relationship between ratings and category 
scores (Carter et al., 1951a). After his observers 
had coded a set of behaviors, they were asked to 
rate, on a seven-point scale, each of the partici- 
pants on fifteen personal traits and four group- 
oriented traits. The ratings were correlated 
with scores arrived at by combining the cate- 
gories which might be included in the global 
trait characteristic. Thus, for example, the rat- 
ing on authoritarianism was correlated with the 
total incidence in the following categories: dis- 
agrees or is skeptical, argues with others, vigor- 
ously argues with others, deflates others, gives 
bald commands or prohibits (in disagreeable 
fashion). The correlations of five trait ratings 
with appropriate combined category scores are 
all positive and range from .30 to .86. The 
agreements tend to be highest where the relia- 
bilities of the two types of scores are highest. 
The data Carter reports support the contention 
that if the two variables are assessed with high 
reliability there will probably be a strong re- 
lationship between ratings of behavior and ap- 
propriate category scores. 

In spite of the essential similarity of categor- 
izing and rating tasks for observers and the fact 
that under certain circumstances the rating job 
and the categorizing jobs can be identical, there 
are considerations which have led experiment- 
ers to assign one rather than the other of these 


tasks. 
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The usual assumption in the case of category 
systems is that all behaviors classified in a given 
category have equal weight. Thus, an expres- 
sion of gratification, a joke, and laughter are 
each entered into the category of Tension Re- 
leasing. They are also assumed to have as much 
weight as any entry into any other category. 
However, the experimenter might feel, or his 
theory might dictate, that while all of these be- 
haviors had the same properties, they do differ 
in significance. He could take this into account 
by asking the observers to enter the category 
more than once for the more significant behav- 
iors. As indicated above, the problem could 
also be handled by a category system along a 
single dimension whose categories had pre- 
scribed numerical relationship to each other. 
When observations are necessary along a num- 
ber of dimensions it is often too time-consum- 
ing for the observer to use a continuous cate- 
gory system along each one. The alternative 
is to ask the observer to assign a numerical 
weight to each of the behaviors, as in a rating 
scale. 

It sometimes happens that the behavior oc- 
curs with such rapidity that it is impossible to 
note each occurrence. It may be desirable un- 
der these conditions to have the observer rate a 
large unit of interaction along the prescribed 
dimension, rather than have frequency of oc- 
currence be the score, as would be the case with 
a category system. 

By and large, observer ratings have been used 
primarily because they are easier to obtain and 
more aspects of social interaction can be cov- 
ered by a single observer. This is particularly 
true when large units of interaction are given a 
single rating. Observer ratings have, however, 
all of the general deficiencies of ratings. Most 
serious are those which result from statistical 
manipulations involving assumptions which the 
rating data cannot meet. Rating scale data ap- 
pear more likely to seduce the experimenter 
into using statistical indices demanding unwar- 
ranted assumptions than are category system 
data. 

Finally, rating scales are more often useful in 
the early stages of research and theory develop- 
ment in a particular area. It is sometimes 
easier to obtain better agreement among ob- 
servers with ratings when the dimension of ob- 
servation is not precisely defined, and the cues 
are multiple and not clearly set down, than 


*when category systems are used. This lack of 


precision is, of course, not a desirable state of 
affairs, and ultimately the point should be 
reached where clear definitions are available. 
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THE DIMENSIONS OF OBSERVER SYSTEMS 


There are many differences among existing 
category systems or sets of rating scale items. 
They differ in the frame of reference the ob- 
server is to use, the aspect of social behavior 
toward which they are directed, the extent to 
which they call for special training of observ- 
ers, and many other respects. This section pre- 
sents a listing of some of the more important 
formal properties of observer systems. While 
the subsequent paragraphs are written with 
category systems uppermost in mind, virtually 
all of these methodological dimensions apply 
as well to rating systems. By a rating system 
we mean not only the specific items in the scale 
but definitions of the items and the instructions 
given to raters. 

The dimension of exhaustiveness. In the case 
of some category systems all the behavior can 
be classified into one of the defined categories in 
the set. As we shall see, Bales’ (1949) interac- 
tion categories make up such a system. His 
objective was to develop categories capable of 
enveloping all the behavior in a small face-to- 
face group. A contrasting system was that used 
by Jack (1984) in her study of ascendance and 
submission in a play situation, Her set of cate- 
gories refers to forms of ascendant behavior; be- 
havior not in these categories was not coded. In 
a sense, of course, all less-than-exhaustive cate- 
gory systems are really exhaustive systems, since 
behavior not codable into one of the categories 
is implicitly in a category labeled “not in the 
system.” The distinction is nevertheless a real 
one and essentially consists of determining how 
much of the total observable behavior is to be 
classified into a positively defined set of cate- 
gories. Typically, in a less-than-exhaustive sys- 
tem, there is no record of the amount of be- 
havior which occurred which was not in the 
defined categories. This may be a serious defi- 
ciency, since there are occasions when scores are 
meaningless without an estimate of the total 
activity which has taken place. For example, in 
a study of social influence in a camp situation 
by Polansky, Lippitt, and Redl (1949), it was 
necessary to correct the influence scores of the 
individual camper for total amount of activity 
he engaged in with other campers. 

The dimension of inference. Observer sys- 
tems differ in the amount of inference they re- 
quire from observers. Let us suppose that an 
experimenter is concerned with the influence of 
leader behavior on various aspects of the pat- 
tern of verbal participation in a problem-solv- 
ing group. He may ask his observer to make a 
notation each time a. member talks. This is es- 
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sentially a two-category system: talking and 
being silent. It requires practically no infer- 
ences on the part of observers, and requires only 
that the cues indicating talking and those indi- 
cating silence be specified. If this same experi- 
menter were concerned with motivation to par- 
ticipate verbally, he might set up categories 
such as “eager to talk” and “reluctant to talk.” 
This would require that the observers scrutin- 
ize not only the verbal behavior but also the 
nonyerbal behavior for cues suggesting one or 
the other of these tendencies. The observer de- 
duces from the behavior which he sees the un- 
derlying orientation toward participation on 
the part of the members. 

In every case, the theory dictates the level of 
conceptualization and the kinds of observations 
which must be made. In some cases, reports of 
directly observable data are sufficient to test 
hypotheses. In others, inferences must be made 
from behavior and the hypotheses tested by in- 
ferred data. In the latter case, the question still 
remains as to who is given the task of inference. 
The experimenter faces the choice of specify- 
ing low-inference categories from which he can 
later make inferences or asking the observers to 
make inferences on the spot. In both cases, 
there is the necessity for defining the properties 
of behavior which permit classification into the 
high-inference categories. In other words, if 
the theorizing which permits the experimenter 
to proceed from low-inference categories to an- 
other set of categories of higher inference is 
sound, it is also sound enough to specify the 
kinds of behavior which permit the observer 
to make certain inferences and not others. 

While the kinds of things observers are asked 
to make inferences about vary enormously, the 
most common are concerned with motives, in- 
tentions, or feeling states of either the actor or 
the recipient of the action. 

The number of aspects of social behavior un- 
der scrutiny. 1t is obvious that in virtually all 
social situations there are a number of processes 
operating simultaneously. Thus we can view à 
group discussion in terms of the number and 
kinds of influence attempts that are being made. 
We can choose rather to observe the changes in 
close interpersonal relationships over time. Or 
a single category system, in the sense of a set 
of categories given to a single observer, might 
be directed at both of these processes. In the 
Bales category system, which will be described 
in detail later, there are six such dimensions 
group interaction being observed. 

Discrete versus continuous categories. The re- 
lationships between the categories in any SIE 
tem may be of many different sorts. We are 
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particularly concerned here with whether or not 
the categories have a quantifiable relationship 
to each other, such that one category might be 
said to have less of a specified property than 
any category above it. For purposes of illustra- 
tion, we might take the following category sys- 
tem, applied to the social behavior of children: 
Category 1: Frowns and grimaces. Category 2: 
Uses words indicating unhappiness. Category 
3: Cries bitterly, Here we might argue that 
Category 3 contained behaviors which showed 
more extreme evidences of distress and pain 
than did Category 2 or 1. There remains the 
real problem of the distance between categories, 
but the system, to the extent that our assump- 
tions are correct, does have the properties of a 
continuous relationship between categories. 
Very few of the category systems which have 
been used have this continuous property. Most 
of them consist of discrete categories. 

Size of unit. Observer systems differ widely 
in the definition given for the unit of behavior 
which is to be classified or rated. The range 
goes from classifying or rating sentences or parts 
of sentences to classifying total meetings. In 
the case of systems directed toward quantifying 
physical acts, the range is typically from single 
movements to total behavior sequences. 

Certain systems specify what might be called 
natural units: an act, a sentence. In others, the 
unit of observation is imposed, as with a time 
unit. While often the natural unit is a small 
one, this need not be the case. An agenda 
item or an activity sequence is an illustration of 
a natural but macroscopic unit. Typically, the 
unit to be categorized is defined in part at least 
by the category system being used, That is, the 
unit is usually defined as that amount of be- 
havior which is classifiable into a single cate- 
gory. 

Range of applicability. Some observer sys- 
tems were established with the deliberate inten- 
tion of making them useful in a wide variety of 
situations. Others were developed with specific 
purposes dictated by a particular theoretical 
orientation or research design. There is no nec- 
essary reason why systems of the latter sort 
might not be useful to other experimenters. 
Very frequently, however, since they were estab- 


lished with a particular situation in mind, the 
definitions of the categories are not appropriate 
to another situation. Thus, for example, a cate- 
gory system set up to describe the behavior of a 
teacher in a school situation might not be im- 
mediately appropriate to describing a leader 
in a group discussion. This may not be, how- 
ever, because the dimensions of teacher be- 
havior are necessarily inappropriate to describ- 
ing leader behavior, but rather that the kinds 
of behavior in the leader situation which should 
be classified in each of the categories might be 
different from those in the teacher situation. 
That is, the categories might be the same but 
the description of the specific behaviors which 
belonged in each category would differ. 

This distinction is actually an important one. 
The current state of affairs, so far as observer 
methodology is concerned, is that new observer 
systems are created whenever existing systems 
seem inadequate. They are often regarded as 
inadequate for the wrong reasons; they are dis- 
carded not because the categories in an existing 
system are inappropriate, but because the pre- 
cise behaviors which are used to define the cate- 
gories are not appropriate. The net result is 
that we are gathering very little data which are 
commensurate from one experiment to another. 
Often the experimenter need only change the 
behavioral referents; however, he usually also 
changes the categories or the dimensions in 
terms of which observation takes place. 

If we look at the dimension of applicability 
of a system in these terms it becomes clear that 
some systems are more generally applicable 
than others in the sense that they define appro- 
priate behaviors at a level of generality which 
permits their application in a variety of situa- 
tions. Others are much more limited in their 
applicability because the behavioral referents 
are limited to the setting in which they were 
developed. When the conceptual meaning of 
the categories is clear, the behavioral manifes- 
tations in other situations often can be speci- 
fied. Thus the question of the applicability of 
a category system to a variety of situations de- 
pends on the clarity of the conceptualization of 
the categories. 


ILLUSTRATIVE OBSERVER SYSTEMS 


The principal objective of this section is to , 
present a representative set of techniques which 
have been used, a sampling of the types of prob- 
lems which have been attacked with the use of 
observers, and the settings in which observers 
have functioned. The classification scheme in 


terms of which the various systems are presented 

is admittedly a rough one. The major headings 

are not mutually exclusive; a given observa- 

tional procedure might well be classified under 

several headings. This is a problem, however, 

which arises no matter what classification 
€ 


376 HANDBOOK OF SOCIAL PSYCHOLOGY 


scheme is proposed. The main justification for 
the form decided upon here is that typically the 
experimenter, in search for a method, thinks 
first about the kind of phenomenon he is con- 
cerned with. This, in a general way, corresponds 
to the classification scheme proposed in this 
section. The reader is asked to keep in mind 
the dimensions along which observer systems 
differ that were suggested in the preceding sec- 
tion. 

Most of the systems described in this section 
are category systems, in spite of the fact that 
observer rating schemes are probably more fre- 
quently used. Our justification for this empha- 
sis is based on two propositions. First, the kinds 
of rating scales and their limitations are treated 
in several areas, notably in‘the industrial and 
personality literature (Ghiselli and Brown, 
1948; Allport, 1937). Social observation has 
made no unique contribution to rating method- 
ology. Second, although the basic problems are 
the same for both kinds of observer tasks, the 
two approaches have developed their own ter- 
minology; as a result, a discussion which alter- 
nates from one type of task to the other is 
unduly complicated. 


OBSERVATION OF FACE-TO-FACE 
GROUP INTERACTION 


Much of the current interest in observer 
methodology springs from theoretical and prac- 
tical interest in small-group functioning. The 
largest single category of observer systems is 
concerned with describing aspects of small- 
group interaction. Systems of this sort run the 
gamut from those which attempt to categorize 
virtually everything that goes on to those which 
describe particular dimensions of a single in- 
dividual, the leader, for example. The follow- 
ing section describes five systems which are 
fairly representative of the work that has been 
done in this area. 

Bales’ category system. (1) Purpose and 
theoretical framework. Bales is interested in 
utilizing the study of small groups to contribute 
to a body of theory that will be relevant not 
only to small face-to-face groups but to larger 
social systems (Bales, 1949). He deliberately 
set out to develop a standard set of categories 
of observation which had general utility. While 
Bales does not object to specific systems, closely 
tied to the intention of the investigator and 
the specific kind of group with which he is 

working, he feels that a general theory of face- 
to-face group interaction can be developed more 
readily by the systematic collection of compar- 
able data on a wide variety of groups. This is 
facilitated by a general category system. Bales 
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also feels that the widespread use of such a 
general category system will promote the de- 
velopment of norms on significant dimensions 
of group process. 

To achieve this objective of a set of categories 
which would have wide applicability over 
groups with different histories, functions, and 
composition, Bales tried a variety of ap- 
proaches. He searched the literature for existing 
category systems. He utilized free observation 
situations to note the kinds of categories which 
emerged spontaneously. He deliberately at- 
tempted to observe a wide variety of groups, 
eg. teaching seminars, leadership training 
groups. All in all, about a dozen revisions of 
the original category system took place. At one 
time there were as many as 85 categories, and 
at a later time as few as 5. The final system 
consists of 12 categories. It represents a com- 
promise between the conflicting pressures of 
theoretical adequacy on the one hand, and 
such practical considerations as the number 
and kinds of distinctions which the observer 
can make, and ease of analysis and interpreta- 
tion of the resulting data. The category system, 
as Bales says, consists essentially of a set of 
hypotheses about the basic dimensions of inter- 
action in face-to-face groups. As such, the possi- 
bility exists that it will be modified because of 
information about groups which the instrument 
itself provides, 

Table 1 shows the categories in Bales' system 
and the major interrelationships among them. 
"There are six primary dimensions of group in- 
teraction under scrutiny: the process of orienta- 
tion, evaluation, control over others, decision, 
tension management, and integration. On each 
of these dimensions, acts can be positive or 
negative. 

Put very briefly, an interacting group faced 
with a common task faces problems in six areas: 
problems involved in (1) arriving at a common 
definition of the situation (orientation); (2) 
developing a common value system in terms of 
which alternative solutions are to be evaluated 
(evaluation); (3) attempts of members to in- 
fluence each other (control); (4) arriving at a 
final decision. "To meet and solve problems in 
each of these areas, the group must deal with 
problems of its own organization. It must (5) 
deal with the tensions which arise in the group 
situation (tension management) and (6) work 
at the task of maintaining an integrated group 
(integration) . 

The group process is characterized by oscilla- 
tion from one of these problems to another. 
Attempts to deal with difficulties.in one of these 
areas give rise to problems in others. In suc- 
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THE SYSTEM OF CATEGORIES USED IN OBSERVATION AND THEIR MAJOR RELATIONS* 


1 gives help, reward: 


Social- 
Emotional 
Area: 
Positive 


shows satisfaction: 


3 


4 


Task 
Area: 
Neutral 


cy] 8 
9 


Social- 
Emotional 
Area: 
Negative 


D4 1 


out of field: 


12 


KEY: 


Problems of Communication 
Problems of Evaluation 
Problems of Control 

Problems of Decision 
Problems of Tension Reduction 
Problems of Reintegration 


*From Bales, 1951, p. 59. 
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cessful groups, the final solution represents a 
consistent set of solutions to problems in all 
of these areas, 

There is another classification scheme which 
can also be applied to the categories. The 
middle categories (4-9) are concerned prin- 
cipally with the problem-solving tasks; the 
behaviors falling in these categories are emo- 
tionally neutral, They are instrumental to per- 
formance of the task, whether successful or not. 
The extreme categories are appropriate for 
behaviors which are essentially social-emotional 
in direction; they are concerned with problems 
of group organization and integration and have 
emotional loading. Rather than being instru- 
mental to task performance, they are expressive 
behaviors. 


Shows solidarity, raises other’s status, 
2 Shows tension release, jokes, laughs, 
Agrees, shows passive acceptance, un- 


derstands, concurs, complies: 


Gives suggestion, direction, implying 
autonomy for other: 


ion, evaluation, analysis, 
expresses feeling, wish: 


, information, repeats, 
clarifies, confirms: 


1 Asks for orientation, information, 


repetition, confirmation: 


Asks for opinion, evaluation, analysis, 


expression of feeling: 


Asks for suggestion, direction, pos- 
sible ways of action: 


Disagrees, shows passive rejection, 
10 formality, withholds help: 


Shows tension, asks for help, withdraws 


Shows antagonism, deflates other's 
status, defends or asserts self: 


A Positive Reactions 
B Attempted Answers 
C Questions 

D Negative Reactions 


This over-all framework for the category sys- 
tem is important because it means that the 
ordering of the category is not capricious. To a 
large extent the definition of a category, and 
hence the behavior which is to be classified into 
the category, rests on its position in the entire 
system. If the total set of categories is not un- 
derstood, the observer is likely to modify the 
definition of the categories he does use. This 
means, practically, that it is as important for 
the observer to understand and remember the 
total system and the theoretical relationships 
among the categories as it is for him to retain 
in detail the specific definitions for each one. 

(2) Orientation of the observer. In this sys- 
tem the observer takes the point of view of the 
"generalized other." He tries to think of him- 
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self as a member of the group or, if possible, 
as a specific member toward whom the action 
is directed. His job is to classify the behavior 
of the actor in terms of its significance for the 
recipient or viewer of the act. “In other words, 
the observer attempts to put himself in the 
shoes of the person the actor is acting toward 
and then asks himself: ‘If this fellow (the actor) 
were acting toward me (a group member) in 
this way, what would his act mean to me? That 
is, what is he trying to do, either for himself or 
for us jointly (ie, what is the instrumental 
significance of the act) or what does his act 
reveal to me about him, or his present emo- 
tional or psychological state (i.e, what is the 
expressive significance of his act)?” (Bales, 
1949, p. 39). The observer differs from the 
participant only in that he has a special set of 
categories which direct his observation and in 
terms of which he gives his descriptions. 

The categorizing task requires a minimum 
of inference from the observer concerning the 
motivation of the actor. He works at and re- 
acts to the surface meaning of the behavior. 
The categories are, however, process categories; 
that is, they are not suitable for categorizing 
topical or substantive material. Hence it does 
require that the observer abstract out of a topi- 
cal contribution the significance of the act in 
terms of the problems confronting the group. 
In this sense, the observer is required to move 
at least one step away from the substantive 
meaning of the behavior. 

(3) Unit of observation. As is typical of 
most unit designations, the unit of behavior to 
be classified is defined in this system in part 
circularly; that is, by the category system itself. 
“This unit to be scored is the smallest dis- 
criminable segment of verbal or nonverbal be- 
havior to which the observer, using the present 
set of categories after proper training, can as- 
sign a classification under conditions of con- 
tinuous serial scoring" (Bales, 1949, p. 37). 
The unit is a single interaction capable of ap- 
propriate classification into a single category. 
"Typically, a simple sentence is a single unit; 
complex sentences always contain more than 
one. Compound sentences are broken up into 
simple ones and each receives a score. 

Nonverbal behavior is also scored, but since 
it is continuous and not readily broken up into 
units the procedure calls for assessments of the 
total group by the observer at one-minute in- 
tervals. 

In a real sense, then, Bales' system purports 
to be an exhaustive one, Every act, verbal or 
nonverbal, is susceptible to classification into 
one of the 12 categories. 
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(4) Context. One of the persistent sources of 
observer disagreement results from differences 
between observers in the amount of the context 
of the act which they keep in mind. Bales' an- 
swer to this problem is to have the observers 
consider only the immediately preceding act 
in their classification of the present one. This 
kind of decision, it should be noted paren- 
thetically, is dictated in part by decisions made 
in other areas, as is so often the case. To permit 
continuous observation and categorizing of 
small interacts, which was dictated as desirable 
by the theory, it is necessary to limit the de- 
mands on the observer to follow the logical 
argument. This was also imposed by the de- 
cision to have process rather than topical 
content categories. These decisions, of course, 
also bear on decisions concerning the amount 
of inference to be demanded of observers. 

(5) Types of scores. 'The category system 
provides a variety of scores. The distribution 
(profile) of the number of acts which fall in 
each category can be computed for individuals 
as well as for groups. In addition, ratio scores, 
for both individuals and groups, can be ob- 
tained. Thus, for example, it is possible to 
compute the ratio of the number of acts in cate- 
gory 7, Asks for orientation, to the number of 
acts in this category plus those in category 6, 
Gives orientation. This index Bales suggests as 
a rough measure of orientation difficulty. 

(6) Reliability. Bales has worked inten- 
sively on training procedures. He has also de- 
veloped a manual which defines the categories 
and makes the system communicable. With 
intensive training, Bales reports agreement co- 
efficients ranging from .75 to .95. 

Conference research problem-solving category 
system. (1) Purpose and theoretical frame- 
work. This category system was developed as 
part of the methodology for studying the de- 
cision-making conference; the specific objective 
was that of describing the problem-solving 
process (Heyns, 1948). The categories are de- 
fined in terms of intellective functions which 
must be performed in order for a group to solve 
a problem and arrive at a decision. The cate- 
gories and their definitions are as follows: 


Goal setting: "These contributions have the 
function of establishing or of suggesting 
goals or objectives, both of procedure and. 
of content. They are concerned with ends 
to be attained. These objectives, goals, oF 
ends may be those of the individual, which 
he is trying to have the group attain; they 
may consist of statements of accepted goals 
of the group or part of the group. 
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Problem proposals: These contributions serve 
the function of presenting a problem, 
either in content or procedure. These con- 
tributions are concerned with means to 
ends or goals. 

Information seeking: These contributions 
have the function of seeking to obtain in- 
formation of an objective, factual, or tech- 
nical nature. The information sought is 
from the area of fact on which the group 
decision is to be based. Contributions seek- 
ing factual, objective, or technical informa- 
tion concerning the procedure of the group 
or an individual are also classified here. 

Information giving: These contributions 
have the function of providing objective, 
factual, or technical information, either in 
the subject area or with respect to pro- 
cedure. The category includes the citing 
of examples or illustrations, 

Solution proposals: These contributions serve 
the function of indicating solutions to 
problems. They are suggested means to 
ends. Modifications of or additions to the 
solution proposals previously offered are 
classified in this category if the context 
gives the contribution a solution proposing 
function. 

Development seeking: These contributions 
serve the function of attempting to obtain 
clarification of previous contributions. 
They seek to determine what was intended 
by a previous contribution, what its impli- 
cations are, what inferences are permis- 
sible. These frequently take the form of an 
inference stated as a question. Also in- 
cluded here are contributions which facili- 
tate the procedure of the group by asking 
the group as a whole (or individuals) to 
comment, indications to individuals that 
they have the floor, etc. 

Development giving: Contributions here 
elaborate, make explicit, enlarge on con- 
tributions. Included here are: inferences 
from previous contributions, self-repeti- 
tions or restatements by others of previous 
contributions; reflecting types of contri- 
butions which are distillations of previous 
contributions without attempt at clarifica- 
tion. Finally, this category includes con- 
tributions which provide the rationale, 
reasons, or arguments for the individual's 
position. They give his reasons for saying 
what he does. 

Opposing: These contributions are charac- 
terized by an opposition, resistance to, or 
disagreement with a suggestion, solution, 
interpretation, etc. Responses which point 
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out obstacles, difficulties, or objections are 
included here. 

Supporting: These contributions serve the 
function of indicating agreement or ap- 
proval of a suggestion or solution proposal. 
Included here are indications of approval 
of the fact that another has contributed, 
whether approval of content is present or 
not. This is a supporting comment in pro- 
cedure, 

Summarizing seeking: These contributions 
ask, in effect, for a summary, e.g., "I'm lost, 
where are we now?" 

Summarizing giving: These contributions 
summarize the group, or part of the group's 
progress to date. "They refer either to sub- 
stantive material discussed over a period 
of time, conclusions reached, or to the 
group's procedure over a period of time. 
Summary statements of individual partici- 
pations are not included here. 

Nonproblem directed: This includes state- 
ments which have no reference to the sub- 
ject matter of the conference or to the 
group procedure. 


'The assumption underlying the system is that 
there are, in fact, two kinds of problems which 
confront any problem-solving group. The first 
is that of the substantive problem, the agenda 
item. 'The second is that of organizing and 
mobilizing the group to deal with the substan- 
tive matter. The system makes the further 
assumption that the intellective functions 
which must be performed are the same for both 
Kinds of problems. 

(2) Orientations of the observer. The ob- 
server is instructed to code the contributions 
of participants in terms of the theoretical func- 
tion of the contribution, That is, the observer 
is told not to make judgments about the intent 
of the actor, nor to appraise the effect of the 
contribution on the subsequent process. The 
observer is outside the process and views each 
contribution in terms of its theoretical proper- 
ties as a problem-solving function. Thus, a 
particular contribution is categorized as solu- 
tion proposal because it is theoretically a solu- 
tion to a problem or a means to a goal, not 
because it necessarily functions as such later 
in the meeting. The categories involve no in- 
ference on the part of the observer concerning 
motives of the actors or feeling states of re- 
cipients of acts. Only verbal behavior is clas- 
sified. 

(3) Unit of observation. As with Bales, the 
unit of behavior to be categorized is defined in 
terms of the category system itself. The precise 
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statement of the unit is “any subject-predicate 
unit which is classifiable into a single cate- 
gory.” This entity is referred to as a functional 
unit, There may be more than one functional 
unit in a single participation of a group mem- 
ber. 

(4) Context. The same procedure is used as 
with the Bales system. The observer classifies 
on the basis of the immediate context of the 
behavior. Thus, while a particular contribu- 
tion might be classified as opposing a solution 
proposal made earlier in the meeting, if in the 
immediate context of the contribution it has 
the function of supporting the solution pro- 
posal just made it is so classified. 

(5) Types of scores. The basic data for a 
given group meeting consist of a tabulation of 
the number of contributions falling into each 
category for each participant. These scores may 
be analyzed on the level of the individual or 
they may be combined to yield an over-all pic- 
ture of the group. Both levels of analysis have 
been found to be valuable. 

For describing the group as a whole, the most 
obvious score is the percent of the total contri- 
butions ' falling into each category. On the 
individual level, two scores are possible: scores 
which describe the individual's performance in 
relation to self, and those which describe his 
performance in relation to that of the total 
group. Hence, for each group member there 
are two scores: the “% own" score which ex- 
presses the concentration of verbal behavior in 
each category as a fraction of the participant's 
own total behavior, and the “% category" score 
which indicates what part of the total contri- 
butions of the group in each category is ac- 
counted for by a given group member. These 
two scores reflect behavior patterns which may 
be quite different conceptually. If a participant 
accounted for most of the opposing contribu- 
tions during the meeting, his % category score 
on opposing would be correspondingly high, 
but his % own score for opposing need not be 
the highest of all his % own scores. He may 
actually stress supporting more than opposing 
in his verbal behavior, so that the former cate- 
gory has a higher % own score for that indi- 
vidual. 

It is readily apparent that both % category 
and % own scores may be used in designating 
the roles played by the various group members. 
The above-mentioned individual who contrib- 
uted most of the opposing done in the group 
could be described, relative to the other mem- 
bers, as the "critic." However, in terms of the 
comparative incidence of functions performed 
by that individual, he may also be referred to 


as a "supporter." Two different sets of hypoth- 
eses may thus become necessary for dealing with 
the different role designations arrived at 
through the use of the two scores. 

(6) Reliability. In several studies in which 
it was possible to determine the extent of agree- 
ment between trained observers, the correlation 
coefficients ranged from .64 to .97, for both types 
of individual scores. Rank order coefficients 
for group scores in a series of studies ranged 
from .71 to .98. 

Steinzor's social interaction category system. 
(1) Purpose and theoretical framework. Stein- 
zors purpose, like Bales’, was to develop a 
method for studying the basic processes in face- 
to-face group interaction (Steinzor, 1949). His 
more specific objective was to develop a system 
for describing group atmosphere. He starts his 
pursuit of such a system by noting that behavior 
is motivated and hence cannot be understood 
without taking into account motivational fac- 
tors. Verbal behavior is a highly important 
aspect of adult social relations and motivational 
components are present in verbal behavior. 
Steinzor believes that our understanding of 
group functioning will be greatest when the 
inferences that are made concerning the motives 
of actors in a social situation most closely ap- 
proximate the way in which the individual him- 
self would define his purpose. In addition, he 
attempts to develop a system that will be ap- 
plicable to a wide variety of verbal interaction 
situations and one whose categories are as [ree 
as possible from value connotations. His final 
system consists of two category sets. The major 
one is comprised of 18 intent categories and one 
residual category: 


Activate and originate Offer solution 
Structure and delimit Conciliate 

Diagnose by labeling ` Understand and reflect 
Evaluate Give support 

Analyze and explore Seek support 

Seek information to Oppose and attack 


learn Show deference 
Express and give infor- Conform 

mation Entertain 
Clarify conclusion Miscellaneous 
Defend 


The labels listed above roughly describe the 
categories, but each intent category is accom- 
panied by a description. Thus, for example, 
the giving of factual data is classified in the 
"Intent to Structure" category. The second 
category system describes the direction of the 
intent. Three loci of intent are specified: self, 
group, or the issue, Thus, for example, one 
can ask questions of oneself, ask for personal 
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information from other group members, or ask 
about group procedures, and, finally, one can 
ask for information about the issue. This sec- 
ond system can be conceived as a subcategory 
system within each of the larger ones. Steinzor 
regards these categories as points on a con- 
tinuum of self-involvement. 

It should be noted that Steinzor's system does 
not purport to describe the total motivational 
picture in the group at a given moment. It fo- 
cuses on the intent of the actor, ignoring the 
motivational consequences of the act in the 
other members of the group. By the same token, 
any group score represents only the motiva- 
tional characteristics of the actors. 

(2) Orientation of the observer. By defini- 
tion, the observer using this system is required 
to make inferences concerning the intent or 
motivation of the actor. This judgment about 
intent determines the category into which the 
behavior is to be placed, not the effect of the 
behavior on subsequent group processes. In the 
use which has been made of the system to date, 
observers have worked exclusively from type- 
scripts. Thus the inferences had to be based on 
the actual content of the remarks and not on 
nonverbal cues. 

(3) Unit of observation. Steinzor defined 
the unit to be categorized as the entire state- 
ment a person made that occurred between the 
statements of individuals immediately preced- 
ing and following the person’s expression. If 
the individual stopped talking for five or more 
seconds and then continued, without anyone 
else participating, his statement is counted as 
two units. In the published research using this 
system, the observer may assign more than one 
code to longer statements, but this is left to his 
discretion. 

(4) Context. Steinzor, in his description of 
his system, does not deal explicitly with the 
question of how much of the total interaction 
is to be taken into account in defining the in- 
tent of a particular verbal act. He indicates 
(Steinzor, 1949, p. 119) that the judgment of 
intent is more difficult the more restricted the 
behavior on which the judgment is based. This 
suggests that the observers are to use as much 
of what has already transpired as they can 
retain. 

(5) Types of scores. As with the two cate- 
gory systems just described, it is possible to 
obtain both individual and group scores as well, 
as ratio scores. 

(6) Reliability. The reliability figures which 
Steinzor reports are quite low. There are a 
number of factors which probably combined to 
produce a low degree of agreement, such as 
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relatively brief training of coders, poor moti- 
vation of coders, the loose definition of the unit 
to be categorized. Finally, the judgment of 
intent was made on the basis of transcribed 
material. This leaves out many of the non- 
verbal cues of motivation on which we depend 
in everyday life. The presence of these inter- 
ferences makes it unwarranted to say that the 
low reliabilities indicate the impossibility of 
inferring intent. Indeed, there is evidence that 
a well-trained coder using this system is con- 
sistent with himself when the coding job is 
resumed after a period of two months. The 
greater success of the category system described 
below, which requires much the same skill from 
observers, also indicates that inferences of this 
sort can be made reliably. At the present time, 
the over-all evidence concerning the reliability 
of scores indicates that the Steinzor system re- 
quires considerable training of observers and 
probably some combining or redefinition of 
categories. 

Self-oriented need observational system. (1) 
Purpose and theoretical framework. This ob- 
servational system was developed as part of the 
methodological arsenal used to observe de- 
cision-making conferences in business and in- 
dustry (Fouriezos, Hutt, and Guetzkow, 1950) . 
In contrast to Bales', the focus of this procedure 
is on a particular aspect of group interaction, 
namely, the motivation of the participants. Ac- 
cording to the developers of this system, the 
behavior of individuals in a group may be re- 
garded as being generated by either or both of 
two sources. The behavior may be dictated by 
the requirements of the group situation or pri- 
marily by the individual's own personality 
needs. The latter is labeled self-oriented need 
behavior and is directed principally to the 
satisfaction of the need itself, regardless of the 
effect the behavior has on the attainment of 
the group goal. The experimenters recognized 
that both sets of determinants may be operating 
simultaneously; the principal distinction is that 
of the degree to which the behavior is a func- 
tion of one or the other. 

Five need categories were selected which ac 
cording to pilot studies provided the greatest 
amount of agreement between observers. These 
are: Dependency, Status, Dominance, Aggres- 
sion, and Catharsis. The categories are defined 
fairly precisely by the authors and behavioral 
cues are suggested for each. The main function 
of the categories in the system, however, is to 
provide a framework for the observer. The 
final score is not the number of entries in each 
category but rather a rating made by the ob- 
servers. During the group session itself, the 
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observer tallies particular instances of the ex- 
pression of the types of needs. At the conclu- 
sion of the meeting, he makes a subjective 
appraisal of his tallies in the form of a single, 
over-all rating on an eleven-point scale. The 
scale runs from 0 (nonexpression of self-ori- 
ented need) to 10 (all behavior of the self-ori- 
ented type). 

There is no clear-cut evidence that the device 
of classification during the meeting, prior to 
the final rating, is essential to the method. Its 
principal value is probably that of keeping the 
observer alerted to the dimensions. 

(2) Orientation of the observer. As indicated 
above, the category system provides a “cue 
manual” for the observers. The observer takes 
into account the total situation in which the 
behavior takes place. He is concerned not only 
with the content of the contribution but its 
affect loading. This involves sensitivity to the 
nuances, the inflections, and timing of com- 
ments. The system does not assume that the 
individuals showing self-oriented need be- 
havior will do so in all situations; as a result 
there is no pressure on the observer to make a 
hard and fast personality assessment of each 
individual which ascribes to him some enduring 
properties. It is interesting to note, however, 
that in one study the ratings of an observer 
who saw the participants only in the group 
situation correlated .82 with those of observers 
who saw the individual in the group situation 
and had had four hours of personal interaction 
with the participants and knew the results of a 
clinical appraisal which included Rorschach 
scores, TAT scores, sentence completion test re- 
sults, and the results of an intensive interview. 
This clinical battery was aimed at assessing the 
same needs as were specified in the category 
system. 

It is clear that this system requires a great deal 
of inference on the part of observers. Each par- 
ticipation is scrutinized in terms of the possible 
motivational forces which are operating. The 
whole process requires making deductions from 
the behavior of the individual as to what the 
motivational factors are. 

(3) Unit of observation. This is the single 
participation, in the sense that each participa- 
tion is categorized. The final score for each 
verson is based upon the over-all impact of the 
individual's performance and hence is based on 
the entire meeting. 

(4) Types of scores. Again it is possible to 
derive both individual and group scores. The 
researchers have used two procedures to arrive 
at a group score, The first of these consists of 
an average of all the individual ratings. The 
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second is an assessment of the extent to which 
the group process as a whole was characterized 
by self-oriented need behavior. The two group 
scores correlate .67. 

(5) Reliability. In two studies of the degree 
of agreement between observers in the ratings 
given to individual participants, the correla- 
tions reported are .67 and .73. In another, the 
rank orders of two groups of 16 participants 
rated by two observers correlated .86 and .96. 
The average intercorrelation of three observers 
of the total group score on a sample of 30 groups 
was .81. The indications are that this procedure 
can provide reliable scores with a relatively 
small amount of training. 

Carter's observational procedures. (1) Pur- 
pose and theoretical framework. Carter and his 
associates have been working particularly on 
the problem of leadership in small face-to-face 
groups (Carter, Haythorn, Meirowitz, and Lan- 
zetta, 1951a, b). This orientation has dictated 
the dimensions of group process which are ob- 
served. The procedures and the categories of 
observation were developed in settings where 
small groups worked on three different tasks in 
a laboratory situation. One of these tasks was 
that of assembling objects from available ma- 
terials. As a result of this kind of observational 
situation, the Carter system includes many more 
categories that are applicable to nonverbal be- 
havior than do the systems already described. 

Table 2 lists the categories in one of the sys- 
tems which Carter and his associates have 
developed. The system contains seven princi- 
pal dimensions such as showing personal feel- 
ing, proposing and initiating action, leader 
roles in carrying out action, etc., with subcate- 
gories under each. 

By and large, all the categories are low-in- 
ference categories in the sense that the observer 
need not make judgments concerning the mo- 
tives or feelings of the individuals. This is less 
true of the first 12 categories than the others, 
but even here the observer relies on commonly 
accepted meanings of behavior and words 
rather than upon subtle cues. 

As was mentioned earlier, Carter and his 
group have also explored the use of ratings. 
At the end of the observation session, after 
coding the behavior in the categories described 
above, each observer rated each individual on 
15 personal traits and 4 group-oriented traits. 
These ratings were made on a seven-point scale 
and included such traits as authoritarianism, 
leadership, initiative, submissiveness, and in- 
sight. 

(2) Orientation of the observer. Carter's 
observers work behind one-way screens. They 
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TABLE 2 


LIST OF CATEGORIES: 


Major — M 

. Showsa personal feeling of 

Aggressiveness or anger 

. Anxiety or insecurity 

Attention or readiness 

Confusion 

Cooperativeness 

. Deference 

. Dissatisfaction 

. Formality or reserve 

. Friendliness 

10. Negativism or rebelliousness 

11. Satisfaction or accomplishment 

12. Status (superior) 

. Proposes and initiates action 

21. Calls for attention 

. Asks for information or facts 

. Diagnoses situation — makes interpretation 

. Asks for expression of feeling or opinion 

. Proposes course of action for self 

. Proposes course of action for others 

27. Supports or gives information regarding his 
proposal 

. Defends self (or his proposals) from attack 

. Initiates action toward problem solving 

which is continued or followed 

Supports proposal of another 

31. Agrees or approves 

. Gives information 

33. Gets insight 

34. General discussion re task 
Expression of opinion 

. Disagrees and argues — with a somewhat 
negative connotation 

. Disagrees or skeptical 

41. Argues with others 

. Vigorously argues with others 

. Deflates others 

. Gives bald commands or prohibits (in dis- 
agreeable fashion) 
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note the originator of the action, the type of 
behavior, and, if appropriate, the recipient of 
the behavior, Their task is to note what the 
individual is doing, looked at from the stand- 
point of the outsider and in terms of the func- 
tion of that behavior in the total situation. 
That is, the observer does not concern himself 
with the intent of the actor and typically not, 
with the effect of the act. 

(3) Unit of observation. This has been a 
particular problem to this research group be- 
cause of the presence in their category system 
of nonverbal categories and nonaction cate- 
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Minor = L 

50-59. Leader roles in carrying out action 

50. Gives information on how to carry out action 

51. Praises, commends, rewards 
. Expresses a desire that something be done 
58. Asks for assistance for others 
54. Asks for assistance for self 
55. Integrates group behavior 
. Follower and “worker” roles in carrying out 

action 
. Follows suggestion or directions 
61. Offers to help or helps 
62. Imitates others 
63. Asks for permission 
64. Collaborates with others 
65. Answers questions 
66. Performance of simple work unit (group 
oriented) 
67. Performance of simple work unit — an inde- 
pendent effort 


68. Passively helps 
70-79. Abortive or nonproductive behavior, re 
problem 
70. Initiates action which is not followed or con- 


tinued 
71. Verbal interplay without outcome 
Listens, but doesn't express self or enter in 
90-99. Miscellaneous 
. Stands around doing nothing 
91. Engages in “out of field” activity 
. Engages in incidental conversation while 
working 


gories. For example, category 72 is “Listens but 
doesn't express self or enter in.” How fre- 
quently should this category be checked in ten 
minutes of listening? An early attempt to deal 
with this problem was to call for the assessment 
of each subject every half-minute on expressions 
of feeling or behavior which involved no 
change. The interaction proved too rapid for 
this solution to be practical. The resolution 
finally decided upon was to categorize at least 
one unit of behavior for every change in be- 
havior. Accompanying this decision was that 
of defining a change in behavior quite narrowly. 
€ 
e 
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Thus, for example, each repetition of a routine 
task is regarded as a change in behavior. The 
research group using this procedure feels that 
the net result of this unit designation is that 
virtually all action and changes in action are 
recorded, but that there is a less adequate pic- 
ture with respect to feelings and inactions. 

(4) Context. While Carter does not discuss 
this explicitly, it is probable that the acts are 
coded in the context of immediately preceding 
acts and in terms of the immediate task situa- 
tion, rather than using any long-range body of 
impressions which the observer might accumu- 
late. 

(5) Reliability. Carter reports an average 
correlation between two observers’ (postmeet- 
ing) ratings for all traits as .68. For individual 
traits the correlations run from zero to .94. 
The highest reliability coefficients were ob- 
tained on the "leadership" ratings and those 
for "orientation. towards group acceptance" 
were lowest. These were .84 and .34 respectively. 
These coefficients are corrected for double 
length since, in most calculations, the actual 
score used is the average of the two observer 
ratings. 

Carter and his associates compute the reliabil- 
ity of category scores by correlating the number 
of units of behavior for particular categories 
awarded each individual subject by independ- 
ent observers over a designated time period. 
The resulting figures are also corrected for 
double length because of the way the final 
score is computed. The reliability coefficients 
reported are those of combined category scores, 
where a number of categories have been com- 
bined to provide a particular score. For ex- 
ample, Categories 25, 26, and 29 are combined 
to provide an initiative score. Coefficients 
for five such combined category scores range 
from .20 to .89. Carter's data indicate that the 
reliability figures vary depending on what task 
was being performed when the observations 
were made. This is principally a function of 
the incidence of behavior in the category in- 
volved. Thus, for example, the insight cate- 
gory score has low or indeterminate reliability 
in the group discussion situation because of the 
low incidence of this sort of behavior in the 
task. There is little variation in reliability co- 
efficients when the data are computed separ- 
ately for 4-man and 8-man groups. In general, 
the evidence indicates that the system provides 
reliable scores in the hands of trained observers. 


OBSERVATION IN A TWO-PERSON SITUATION 


In reality, the use of observers in a two- 


person situation presents no new problems; in 
e 
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fact, the two-person situation is often less com- 
plex and permits greater exploitation of the 
potentialities of the observer as a measuring 
device. However, some of the newer observa- 
tional procedures have been developed in a 
two-person setting and we shall discuss two of 
these in this section. 

Chapple’s procedure for measurement of in- 
teraction. (1) Purpose and theoretical frame- 
work. The total theory, together with its impli- 
cations for formal and informal organizations, 
which underlies the Chapple procedure is too 
extensive to be discussed here. Chapple is in- 
terested in objective description of interaction, 
and was dissatisfied with interview material as 
a source of information about the dynamics of 
interpersonal relationships. As a result of a 
variety of experiences, Chapple developed the 
notion that there is a relationship between such 
objective events as the amount and duration 
of contacts between individuals on the one 
hand and emotional relationships between 
people and their attitudes toward one another 
on the other (Chapple, 1940). 

Proceeding from this point, Chapple and his 
associates have developed an observation pro- 
cedure which is concerned exclusively with the 
measurement of the time aspects of interper- 
sonal relationships. The observer's task is to 
press a key when an action begins, record how 
long it continues, and when it terminates. This 
kind of record is kept for both persons in the 
situation, The observer makes no inferences 
about emotional states or motives. Nor does 
he attempt to code the content of what is said 
or to describe the act. Chapple deals quite ex- 
plicitly with these decisions. He believes that 
feeling states and content categories are actu- 
ally described by his quantitative measures of 
the time characteristics of the interaction of two 
persons. While the following quote does not 
do justice to the support Chapple marshalls for 
his position, it does give the reader the flavor of 
his position: 

If the reader sharpens his powers of observa- 
tion, he will see that in many cases people whom 
he does not like or cannot get along with say ex- 
actly the same things that people he does like 
say. So actors frequently take a short play, play 
it first as a tragedy, and then, using the same 
words, play it as a comedy. Here the language is 
seen as unimportant, and the timing is the factor 
which makes the difference in its effect on the 
audience. (Chapple, 1940, p. 33) 


Actually, the time record provides a great 
deal more information about the interaction 
process than our brief description of what is 
recorded would suggest. It is possible to note 
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not only how much each person talked or acted, 
but also characteristics of their interaction: 
whether they interrupted each other, whether 
there were or were not delays between the ac- 
tion of one and the action of the other, and 
so on. These various indices have been con- 
ceptualized to a certain extent (Chapple, 
1949). Chapple has described and labeled such 
scores as the following: tempo (how often a 
person starts to act); activity or energy (how 
much longer he talks or responds than is silent 
and unresponsive); adjustment (the length of his 
interruptions against his failures to respond) ; 
initiative (the frequency with which one person 
takes the initiative as against the other) ; domi- 
nance (the frequency with which one person 
outtalks or out-acts the other when there has 
been an interruption); synchronization (the 
frequency with which one person interrupts 
or fails to respond). In addition to these, there 
are indices of flexibility, ability to listen, and 
others which are arithmetic combinations of 
the ones listed above. 

Chapple’s entire procedure is a marked de- 
parture from the general trend toward using 
the observer to make delicate judgments and 
to code behavior into highly conceptualized 
categories. Those who have been in this main 
stream find it hard to believe that an essentially 
contentless observational system, concerned 
with the mechanics of interaction, can have 
much predictive efficiency. As Chapple himself 
suggests, whether his system of measurement is 
complete enough to describe interaction suc- 
cessfully will depend on what it is able to 
predict and how well it does that prediction 
job (Chapple, 1949, p. 296). On the basis of 
work completed, Chapple reports that vari- 
ous of his indices discriminate between effective 
and ineffective sales personnel, between occu- 
pants of different job groups in a company, and 
between normal and psychiatric patients 
(Chapple and Lindemann, 1942). On the basis 
of evidence available, Chapple's results indi- 
cate that it would be unwise to move too far in 
the methodology of observation from the more 
formal properties of interaction. In a sense, 
Chapple's work constitutes a return, with more 
sophistication and more theory, to some of the 
earlier work with observers. 

(2) Unit of observation. It is apparent from 
the description already given of the procedure 
that the observer records more than verbal acts. 
He notes the occurrence of physical acts such 
as gestures and nods of the head. Chapple deals 
more explicitly with the definition of the unit 
than do many developers of observer systems. 
In his system the unit of observation is the unit 
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of action. He defines a unit of action as that 
period of activity recorded from the initial 
change in muscular state until a second change 
brings the activity to an end (Chapple, 1940, 
p. 24). There are undoubtedly instances in 
which this definition brings small comfort to 
an observer faced with deciding whether an 
action is a new one or part of an old one. How- 
ever, in the situation in which the procedure 
has been most extensively used, namely, the 
interview, where actions are primarily verbal 
and movement possibilities are limited, prob- 
lems of confusion about the beginning and end 
of an act are undoubtedly minimal. 

(3) Reliability. Data bearing on the reliabil- 
ity of the observer record are limited. Chapple 
indicates that on a half-dozen records done by 
two observers on two tape recorders, there were 
"no differences in the means obtained or the 
number of units recorded" (Chapple, 1949, p. 
297) . 

Description of interaction in a therapy situa- 
tion. (l) Purpose and theoretical framework. 
The basic procedures of observation, developed 
in other settings, are currently being employed 
in the description of the therapeutic process 
(Keet, 1948; Bergman, 1951; Osburn, 1951; Ditt- 
man, 1952). Most of the work of this sort has 
been done by analysis of typescripts or of re- 
cordings rather than with direct observers. 
However, this limitation to typescript or record 
analysis seems to us to have been a result of 
characteristics of the situation in which thera- 
peutic treatment takes place and not a function 
of the dimensions of interaction which have 
been measured. Since there is good reason to 
think such considerations as the need for pri- 
vacy can be dealt with satisfactorily and still 
permit direct observation, it seems useful to 
discuss one of the techniques that has been 
developed for describing an aspect of the ther- 
apy process. 

The main impetus toward description of this 
special kind of social situation has come from 
Rogers and his associates. One of the pioneer- 
ing efforts in this group was made by Snyder, 
and while the Rogers group and others (Berg: 
man, Dittman) have elaborated and refined 
the procedures, the essential characteristics of 
this sort of application are well illustrated in 
this study (Snyder, 1945) . 

The theoretical approach which dictated the 
category system which Snyder developed is that 
which underlies client-centered therapy (Rog- 
ers, 1942). The system finally arrived at actu- 
ally consists of two category sets: one developed 
for describing the responses of the counselor 
and the other for describing the client. The 
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counselor category system contains 16 content 
categories and a residual category for unclassi- 
fiable statements. Within this total set are 
major groupings of categories. An abbreviated 
statement of the counselor categories, indicat- 
ing the major groupings, follows: 


Lead-taking categories 
Structuring 
Forcing client to choose and develop 
topic 
Directive questions; specific type of ques- 
tions 
Nondirective leads and questions 


Nondirective response-to-feeling categories 
Simple acceptance 
Restatement of content or problem 
Clarification or recognition of feeling 


Semidirective response-to-feeling category 
Interpretation 


Directive-counseling categories 
Approval and encouragement 
Giving information or explanation 
Proposing client activity 
Persuasion 
Disapproval and criticism 


Minor categories 
Ending of contact 
Ending of series 
Friendly discussion 
Unclassifiable 


Analyses of client responses indicated that 
many of them had an important second dimen- 
sion. Some clear content statements were also 
accompanied by explicit or implicit feeling or 
emotion. Accordingly, the final set for classify- 
ing client responses includes a group of cate- 
gories having to do with the feeling tone or 
attitudes of the client. The resulting set, also 
abbreviated, consists of the following cate- 
gories: 


Problem category 
Any statement of problems of symptoms 


Simple response categories 
Asking for advice or information 
Answer to a question 
Simple acceptance 
Rejection of a clarification or rejection 


Understanding or action-taking categories 
Understanding or insight 
Discussion of plans and decisions 


Minor categories 
Ending of contact 
Ending of series 
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Not related to problem 
Friendly discussion 
Unclassifiable 


Client feeling categories 

Positive attitude toward self 

Positive attitude toward counselor or coun- 
seling situation 

Positive attitude toward other persons 

Negative attitude toward self 

Negative attitude toward counselor or 
counseling situation 

Negative attitude toward other persons 

Ambivalent attitude toward self 

Ambivalent attitude toward counselor or 
counseling situation 

Ambivalent attitude toward other persons 


In the study in which this system was de- 
veloped and applied, 48 interviews were ana- 
lyzed; altogether slightly under 10,000 responses 
were classified. The unit of classification was 
n “idea,” defined as a clearly indicated change 
in subject matter or attitude. 

With the exception of the feeling categories, 
and these only when the feeling is implicit 
rather than explicitly stated, the categories re- 
quire very little inference. Their definitions are 
clearly stated and illustrated, Thorough study 
of them, and the framework from which they 
come, makes the categories quite clearly under- 
stood. 

No attempt was made in the original study 
(Snyder, 1945) to determine the relationship 
between either client or counselor behavior and 
a success criterion. Snyder's data did provide 
rich descriptive material on the process itself, 
the relationship between counselor responses 
and client responses, and changes in the inci- 
dence of certain content categories over time. 
The findings make good sense and indicate that 
the system is a very useful one. 

(2) Reliability. Data on reliability comes 
from two sources. Snyder himself repeated the 
classification of the responses in four interviews 
more than a month after the original classifica- 
tion. Eighty percent of the items were classified 
into the identical categories; eighty-five percent 
were classified in the same major group. Total 
scores in each category correlated between .76 
and .87. 

In the second reliability check, a person un- 
trained in psychotherapeutic interviews was 
given the category definitions. He studied these 
and the classifications which had been made of 
the responses in two sample interviews. His 
scores correlated from .52 to .78 with those of 
Snyder. These results indicate that the system 
can be quite readily communicated; with à 
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minimum amount of training moderate relia- 
bility was attained. Undoubtedly more training 
would raise this figure considerably. 


OBSERVATION OF SOCIAL BEHAVIOR 
IN A FIELD SETTING 


Up to this point we have been discussing ob- 
servational procedures developed and applied 
to small group or two-person interaction under 
more or less controlled conditions. The subjects 
are localized geographically and the activities in 
which they engage are typically limited and pre- 
scribed. When the social scientist's interests 
take him into less controlled situations per- 
mitting a greater variety of behaviors, new prob- 
lems of observations arise. Among other things, 
the observer must be mobile, and this has impli- 
cations for the kinds of equipment he can use. 
Problems of unit definition become more acute, 
as do problems of identification of actors. The 
definitions of the dimensions of observation 
must be clear enough either to make them ap- 
plicable in a wide variety of activity situations 
or to indicate their nonapplicability under spe- 
cified conditions. For example, the investigator 
may be interested in the influence process. The 
problem then becomes that of conceptualizing 
the influence process in such a way as to make 
influence categories useful and appropriate 
whether the actors are deciding to go swimming 
or actually playing ball. A brief review of some 
of the observation systems which have been de- 
veloped for use in field situations will illustrate 
these and other problems. 

Study of behavioral contagion in a camp set- 
ting. (1) Purpose and theoretical framework. 
Polansky, Lippitt, and Redl (1949, 1950) were 
interested in the spread of mood, behavior, or 
attitude from one person to another through an 
entire group. This is essentially a problem of 
influence, and necessitated a procedure for de- 
scribing the influence proces. They define 
social influence as that phenomenon which oc- 
curs when a change in behavior of one individ- 
ual appears to produce a change in the be- 
havior of a second individual. The first person 
in this chain is referred to as the actor and the 
second as the recipient. The investigators were 
particularly interested in contagion, by which 
they mean the spread of behavior or mood that 
does not result from direct influence attempts. 
An instance of contagion is defined as follows: 
an event in which a recipient's behavior has 
changed to become more like that of the actor 
or initiator, and when the change occurs in a 
social interaction situation in which the actor 
has not communicated intent to evoke such a 


change. In the principal study reported, the 
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relationship between group status and influence 
was investigated. 

"The observer team in this study used the fol- 
lowing categories: 


l. Direct attempts at influencing (indicating 
actor and recipient) 

a. Manner, including: Directs, Demands, 
Uses force or threat of force, Requests- 
suggests, Pleads 

b. Goal, including: Personal (actor's 
own) goal, Group (or shared) goal, 
Recipient’s own goal, Some third per- 
son’s goal. ` 

c. Response of the Recipient, including: 
Accepts and complies, Accepts in modi- 
fied form, Acquiesces but does not 
comply, Ignores, Rejects, Could not be 
determined. 

2. Status indicators 

a. Asks permission 

b. Implies superior knowledge or skill in 
recipient 

c. Ridicules 

3. Incidents of contagion 

a. Initiator 

b. Recipients (in order) 

c. Content of the contagion 


(2) Orientation of the observer. The camp- 
ers were divided into groups and, except for 
periods in which reliability checks were made, 
one observer followed each group, usually for 
an entire day. He made his observations 
through all the activities in which it was a dis- 
tinguishable group. "Three observers were used 
and were rotated through the groups. Observa- 
tions were continuous for intervals of 15 min- 
utes. By and large, the categories require a 
minimum of inference. 

(3) The unit of observation. This is not 
clearly specified as to boundaries, but it is an 
interaction, especially in the case of direct influ- 
ence attempts. Typically, in these cases, the 
unit is a verbal interaction. This particular 
category system illustrates more clearly than do 
some others the interdependence between cate- 
gory definition and unit designation. In the 
case of behavior entering the contagion cate- 
gories, any act of a group member is potentially 
a contagion behavior. It becomes one in actu- 
ality, however, only when there is a change in 
the behavior of another person in the direction 


pf the actor's behavior. 


(4) Reliability. Reliability checks indicated 
the need for grouping some of the categories 
that required differentiations the observers were 
unable to make in the actual situation. Thus, 
the broad category "manner of influence at- 


tempts” was reduced from five subcategories to 
two: “directive,” which included all the in- 
stances in the first three categories; and "non: 
directive,” which included the last two cate- 
gories of the former system. This is a particu- 
larly happy instance of combining categories 
into broader ones in which the resulting broad 
categories retain some conceptual clarity. The 
"response-of-recipient" categories were also re- 
duced from five to two: "successful," including 
"accepts" and "complies," and "accepts in modi- 
fied form;" and "unsuccessful," including "ac- 
quiesces but does not comply," "ignores," and 
“rejects.” 

When the rank order of both actors and re- 
cipients of one observer is compared with that 
of another observer, rank order correlation co- 
efficients rank from .00 to .87, depending on the 
particular score. Twenty-three of the thirty- 
four correlations reported are above .60. In 
general, actor scores are more reliable than re- 
cipient scores. This is partly due to less diffi- 
culty in identification of actors, especially since 
in many of the situations of contagion there are 
many recipients. These data also reflect the 
fact that two observers may differ in the number 
of interactions observed. A separate reliability 
check, when identical interactions were ob- 
served, had very high percent agreement figures 
in categorization, indicating that the final cate- 
gory system itself can be used quite reliably. 
The difference between the correlations of ob- 
server scores assigned to actual children and the 
percent agreement scores, where the reliability 
of the observer system itself is under test, illus- 
trates that there can be a real difference between 
the adequacy with which categorizing is done 
and the reliability of the scores actually as- 
signed individuals. In this case, the difference 
is in favor of the system itself, which is contrary 
to usual experience. It is due in this case to 
differences between observers in the extent to 
which the system is actually applied to people 
and not to differences between them in the way 
they apply it. 

As was mentioned earlier, it was necessary to 
correct individual scores both for total activity 
and for the amount of time each person was in 
the presence of others. 

Observation methods in psychological ecol- 
ogy. (1) Purpose and theoretical framework. 
Wright and Barker and their associates have 
embarked on an extensive investigation, the 
objective of which is to describe as completely 
as possible the conditions of life and the be- 
havior of children in a particular small, mid- 
western town (Wright and Barker, 1950). The 
procedures of the total study and the theoreti- 
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cal framework guiding this work are too com- 
plex to be covered in detail in a chapter such 
as this. Their study raises some interesting 
problems and in many ways is a marked depart- 
ure from the prevailing usage of observers, and 
for these reasons is included in this survey of 
existing observer systems. 

In the sense that we have been using the term 
system, the observer in this study does not have 
a system. His job is to describe the behavior of 
the child being observed and the situation in 
which the behavior takes place. A brief, quite 
typical sample of a single record of the behavior 
of a seven-year-old boy named Raymond and 
of what happened to him is as follows: 

Stewart said pleasantly, “It will somersault if you 
let it go now.” Stewart seemed to sense Ray- 
mond's discontent and sought to placate him. 

Raymond agreed immediately that it probably 
would but listlessly let it go anyway. 

He watched closely as the car rolled over and over 
and stopped at the bottom. 

Raymond absently scanned the landscape. 

Noticing a large crate in another corner of the 
basement, he proposed, “Let's go over and play 
with that crate.” 

Stewart said flatly, “No.” 


(The wobbly old crate was constructed of sev- 
eral loosely connected. boards and measured about 
five feet by thirty by thirty inches. It was open 
completely at one end and the other sides had 
openings where the boards crossed. The openings 
were large enough for the children to climb 
through. The crate was resting on its open end.) 
Raymond hopped down from the bank into the 

excavation and walked purposefully over to 

the crate. 

Strenuously he pulled on the crate. He evidently 
intended to drag it to the place where the three 
boys had been playing together. 

The crate caught; he had to yank strenuously to 
move it at all. 


He looked at me and said, "I'm a big strong man." 
He grinned at me as he said this, making a joke 
of his ineffectiveness. 

Raymond absently chanted "Dum, Dum, Dum" 
as he tugged energetically on the crate. 

It stuck again on something in the weeds. 

He jerked quickly, leaning all his weight back- 
ward as he tried to pull the crate. 

At the crate came loose suddenly, he stumbled 
back a few steps. 

He looked at me just briefly. 

Raymond grunted as he tugged it with both 
hands. 

(Barker and Wright, 1951, pp. 344-345) 
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(2) Orientation of the observer. It is appar- 
c.t from the record above that no special ob- 
server skills are particularly demanded by the 
procedure; certainly none which are outside the 
repertoire of adults capable of getting along 
effectively in social situations. One of the ob- 
jectives, indeed, is to keep theorizing about be- 
havior to a minimum and out of the record. 
Occasionally, inferences about intentions are 
recorded but these do not spring from a pro- 
found analysis of behavior. The record pro- 
vides a running description of what a person is 
doing, what the situation is, and, by inference, 
what the situation, or parts of it, means to the 
actor. It also contains, as the example above in- 
dicates, relatively minor interpretations and 
descriptions of the way in which the action was 
accomplished, 

There are several other characteristics of the 
record which should be noted, There is a de- 
scription of the situation in which the behavior 
takes place and aspects of the physical environ- 
ment—the crate in the sample above. A single 
sentence covers only one action. In addition, 
every sentence which describes a particular ac- 
tion is placed alone in a paragraph. If more 
than one sentence is used to describe the same 
action, these are grouped into a paragraph. Sen- 
tences referring to different actions are never 
placed in the same paragraph. 

The purpose of these specifications concern- 
ing the written form of the record is to permit 
later designation of larger and smaller units of 
activity. The specified form of the record im- 
poses no limitation on this later task. “. . . its 
organization will not prejudice any attempt to 
discover where different units and sequences of 
action started and stopped, where interruptions 
occurred, where actions overlapped. ...” (Bar- 
ker and Wright, 1951, p. 10). 

The unit of observation, at least as it is rep- 
resented on the form, is a molar act. The con- 
text of each act is recorded rather than assumed 
as in the other systems we have discussed. The 
observer records as completely as possible every 
action together with the setting in which it 
occurs. In the current procedures, to ensure 
completeness and reduce ambiguity in the rec- 
ord, each observer dictates his observations into 
a wire recorder in the presence of another mem- 
ber of the research staff. This staff member asks 
questions about omissions or incomplete reports. 
A report, edited by the observer, is derived from 
the original report and the results of the inter- 
rogations. This report is, in turn, reviewed. by 
a second staff member for eliminating duplica- 
tions, and correcting meager accounts. A very 
detailed final report emerges from this process. 


(3) Subsequent analysis. It i» beyond the 
scope of this chapter to describe in detail the 
kinds of analyses to which these observer data 
are put. It is clear from the above description 
that in thís procedure the human observer pro- 
vides the raw data record. Translation of these 
observations into concepts which have meaning 
in terms of behavioral theory is done by others. 
The analysts working on the project have a 
complex and long set of categories for describ- 
ing the situation and the behaviors, The 
amount of agreement between analysts in their 
descriptions of situations and events in theoreti- 
cal terms and the extent to which tests of hy- 
potheses are possible on the basis of their cod- 
ing, do bear on our general topic of controlled 
observation. The results available are prelim- 
inary but indicate that fruitful conceptual de- 
scriptions are possible, with a high degree of 
agreement between coders, This conclusion sug- 
gests that often it may be highly desirable to 
work in this direction of total observation, with 
no categories of observation at all or only mini- 
mally conceptualized ones, leaving the coding 
into more abstract categories for later. This is 
particularly true when sophisticated observers 
are unavailable. It should be clear, however, 
that the success of the Barker-Wright procedure 
is no mandate for free and uncontrolled obser- 
vation. Their observers are very clear about 
what they are to record and how to record it. 

Description of social behavior in terms of 
interpersonal mechanisms. (1) Purpose and 
theoretical framework. Coffey and his associ- 
ates (Coffey, Freedman, Leary and Ossorio, 
1950) have developed a category system which 
has wide applicability. It has been used by ob- 
servers on the spot and by coders working from 
typescript (Freedman, Leary, Ossorio, and Cof- 
fey, 1951). The system differs markedly from 
several we have discussed, particularly because 
the resulting description of social behavior is 
in terms of interpersonal mechanisms. This is 
in contrast, for example, to the problem-solving 
system which yields descriptions of groups in 
terms of the intellective functions performed. 

The system itself developed out of a particu- 
lar approach to the problem of personality de- 
scription, This research group conceives of the 
total personality as consisting of three levels: 
the public, the conscious, and the private. The 
levels are defined by the sources of data which 
describe them. The public level of personality 
‘data consists of ratings or categorizations of 
an individuals behavior and its effects on 
others. These must be made by objective ob- 
servers and are independent of the individual's 
own statements, The conscious level of person- 
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A 


3 


3 


BOAST 
resist 
(actively) 
establish 
independ- 


take forcibly 
compete 


3 PUNISH 
challenge 
mock 
press 
threaten 


HATE 
condemn 
criticize 
disapprove 
disaffiliate 
attack 


COMPLAIN 
resist (passively) 
nag, sulk, 
rebel, 

disobey 


3 


DISTRUST 
demand 
accuse 
suspect 


3 retreat defer 
G withdraw obey 
yield 
apologize 


3 


I 


Fic. 1. Level I variables: interpersonal mechanisms with illustrative verbs. 
ratings: 3, intense or extreme; 2, average or appropriate; 


man, et al., 1951, p. 151.) 


ality data consists of what the individual says 
about himself and others, at a level of conscious 
awareness. He may "say" things about himself 
by means of ratings as well as in unstructured 
situations. Finally, the private level of data 
comes from assessment of projective materials 
such as dreams and TAT stories. 

The system described here is that which is 
required for description of personality at the 
public level. “The chief concept employed for 
this purpose is the interpersonal mechanism, 
which is defined as the interpersonal function 
of a unit of social behavior. The rater or ob- 
server asks the following question: What is the 
subject of the activity, i.e., the individual whose 
behavior is being rated, doing to the object or 
objects of the activity? The answer, e.g., he is 


DOMINATE 
direct 
command 


SUBMIT 


HANDBOOK OF SOCIAL PSYCHOLOGY 


3 


TEACH 
give opinion 
summarize 
clarify 


offer suggestion 
interpret 


SUPPORT ° 


pity 
sympathize 
reassure 


LOVE 
affiliate 
approve 
praise 


3 


COOPERATE 
agree, confide, 
conciliate, 
accept 
participant 3 


Intensity 
1, mild intensity. (From Freed- 


aggressing against them, or him, affiliating with 
them, teaching him, etc., is the interpersonal 
mechanism.” (Freedman, et al., 1951, p. 149) 

(2) Orientation of the observer. The frame 
of reference prescribed for the observer in this 
system is exactly that which the Bales system re- 
quires, “the role of the generalized other." The 
interpersonal mechanisms which comprise the 
final category system were arrived at after ex- 
tensive analyses of records, More than eight 
thousand units of verbal behavior were studied. 
The system is shown in Fig. 1. 

The fact that the categories are presented in 
the form of a circle is a significant part of the 
conceptual system. There are four nodal points: 
dominance, affiliation, submission, and hostil- 
ity (reading clockwise from the top). Each 
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category falls at the nodal point or is a com- 
bination of these nodal mechanisms. Thus, 
supporting is conceptualized as a blending of 
affiliation and dominance; rejecting is a com- 
bination of hostility and domination. 

In addition to categorizing the behavior, the 
observer is also required to make an intensity 
rating of the act on a three-point scale. A rat- 
ing of three indicates an extreme interaction, a 
rating of two an interaction of average or mod- 
erate intensity, and a rating of one is given to 
interactions of low or mild intensity. 

(3) Reliability. The reported reliability data 
do not permit evaluation of the degree of agree- 
ment, category by category. In terms of total 
scores in all categories, however, the degree of 
agreement is moderately high. When each of 
three trained judges’ scores was compared with 
the consensus of the other two, the percentage 
of complete agreement ranged from 68 to 77. 


OBSERVATION OF SOCIAL BEHAVIOR IN CHILDREN 


As we indicated in the brief history of sys- 
tematic observation, most of the pioneers in 
the use of observers worked with children. Sub- 
sequently a large number of observation systems 
have been developed applicable to the social be- 
havior of children. Only three of the many will 
be discussed here; the principal justification for 
the selection probably lies in their representa- 
tive character and their historical importance. 

Measurement of ascendance-submission in 
children. (1) Purpose and theoretical frame- 
work. This procedure was developed by L. M. 
Jack in connection with an attempt to modify 
the social behavior of preschool children, par- 
ticularly along the dimensions of ascendance- 
submission (Jack, 1934). To determine 
whether a particular training technique was 
effective in modifying the incidence of ascend- 
ant behavior, it was necessary to develop meas- 
ures of ascendance. After considerable prelim- 
inary work, eight categories of ascendant be- 
havior were specified: 


l. Verbal attempt to secure play materials. 

2. Forceful attempts to secure play mate- 

rials. 

Success in securing play materials. 

4. Defending, snatching back materials 

taken from his possession. 

Verbal attempts to direct behavior of 

companion. 

6. Companion complies with direction. 

Forbidding, criticizing, reproving com- 

panion. i 

8. Providing patterns of behavior which 
companion imitates. 
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These observational categories were used in 
an experimental situation where each child was 
paired with another child. The two children 
were together in a sand box which contained 
three sets of toys. 

By and large these categories are of the low- 
inference sort. The observer is called upon to 
note the occurrence of fairly clear-cut acts, de- 
scribed in terms of everyday language. The 
significant difference between this category sys- 
tem and earlier ones which also concerned 
themselves with such low-inference categories 
is that in this case the behavioral categories are 
conceptualized. Jack’s definition of ascendant 
behavior is represented in the eight categories. 

One other comment about the observer task 
is necessary. The observers using this system are 
selective in the observations. There are many 
behaviors in such a situation which fall outside 
the category system, e.g., singing to self, playing 
quietly with toys, striking toy, and so on. The 
observer is sensitized by the category system to 
look at certain behaviors and to ignore others. 

(2) The unit of observation. We mentioned 
in another connection that the definition of the 
unit of observation is usually circular; that is, 
the unit to be recorded is typically that behavior 
unit which is classifiable into a single category. 
This phenomenon of the category definition it- 
self defining the unit is especially clear in such 
a system as Jack's. An attempt to secure play 
materials is a unit, no matter how long it takes. 
The same sequence which involves both verbal 
and forceful attempts is two units. Finally, if 
the attempt is successful the same interaction 
period actually enters the system three times. 

We mentioned earlier also that an experi- 
menter often faces the choice of asking for low- 
inference observations, leaving conceptualiza- 
tion until later, or he can ask his observers to 
do the conceptualizing. In either case, before 
the total job is done satisfactorily, there must be 
a clear specification of behaviors which are to 
be conceptualized in a particular way. Jack's 
system illustrates this point nicely. Her cate- 
gory system can be looked at as a set of cues 
which another experimenter might have used 
to instruct his observers as to what he meant by 
ascendant behavior. The behavior descriptions 
specify cues which indicate that he should make 
an entry in the category “Ascendant Behavior." 
In case ratings were used, there would be a sub- 
fective weighting of these behaviors to arrive 
at a final rating. 

(3) Reliability and validity. The degree of 
agreement between observers on total scores 
assigned to children is quite high. Jack reports 
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a corrected Spearman Brown correlation coeffi- 
cient of .80. 

Jack reports three relationships which bear 
on the validity of the ascendance score. First 
of all, the total ascendance scores based on ob- 
servation correlated .81 with ascendance rat- 
ings made by teachers on a six-item rating scale. 
The rating scale items were in more general 
terms than were the behavior categories. Sec- 
ondly, the observation scores obtained in the 
experimental situation showed a strong rela- 
tionship to observational scores on a different 
set of categories in the playground situation. 
This second category system was oriented to- 
ward assessment of instances of insecurity and 
lack of confidence. It included such items as 
showing fear of companion's displeasure, ap- 
pealing to adult for help, appealing to adult for 
attention, showing fear of physical objects, and 
so on. These items appeared four times more 
frequently in the nonascendant group than they 
did in the ascendant group. Finally, the as- 
cendance score reflected the effects of a training 
program which was calculated to increase the 
amount of ascendant behavior. 

Domination and integration in children. 
(1) Purpose and theoretical framework. An- 
derson (1939) has done extensive work in the 
observation of the behavior of children and 
teachers. His research and the category system 
are concerned with two concepts: domination 
and integration. Dominative behavior he de- 
fines as a technique of response to others in 
which the individual resists differences, changes, 
and growth. The dominative person, according 
to this scheme, is rigid and inflexible, has his 
mind made up. He disregards the desires of 
others, uses commands, threats, or force to gain 
his objectives. Dynamically, this is the opposite 
of growth and is the expression of fear of im- 
pending change. Integrative behavior, on the 
other hand, represents maximal growth. Classi- 
fied as integrative behavior would be: respond- 
ing voluntarily and without coercion to differ- 
ences in others, finding common purposes 
among the differences. The person characterized 
by social integration achieves a change in struc- 
ture or function, in goals or purposes as a result 
of interacting with persons who are different 
from himself. One of the research. problems 
Anderson set for himself was that of determin- 
ing the relationship between teacher-child con- 
tacts along this dimension of domination-inte- 
gration, and the nature of child-child interac- 
tion along the same dimension. This required 
the development of an observational instru- 
ment for measuring dominative and integrative 
behavior in teachers’ behavior toward children. 
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A similar instrument was developed for observ- 
ing the interaction of children. 

In the case of the teacher-child observation 
procedures, the final system consists of 20 cate- 
gories; eight of the categories are classified as 
domination, nine as integrative, two as ambigu- 
ous, and one category for undetermined be- 
haviors. Each contact between the child and 
the teacher is classified into one of the twenty 
categories. The observation schedule for use 
in assessing children's responses to each other 
consisted of 12 categories, nine of which are 
classified as dominative and three as integrative 
behavior. Since seven of the domination cate- 
gories are identical to those which Jack classi- 
fied as ascendant behaviors, Table 3 contains 
both types of categories for the teacher-child 
instrument but only the integrative categories 
for the child-child set. 

(2) Reliability. Anderson reports a high de- 
gree of agreement between observers in the 
gross number of contacts in both systems, indi- 
cating that the problems of specifying the limits 
of a given contact are minimal. In the case of 
both the teacher and the child-child system, 
there is considerable agreement in the classifi- 
cation of dominative behavior. The domina- 
tion scores assigned by two observers correlate 
in the .80's. "This is also true for the integration 
score for children. "There is less agreement be- 
tween observers in the case of the integration 
score for teachers. Anderson reports correlation 
coefficients of about .50. He attributes this 
lower degree of agreement to the fact that judg- 
ments about integrative contacts are more in- 
fluenced by their context and hence less specific 
and more difficult to record. These reliabilities 
are also reduced in part by the fact that there 
was a relatively low incidence of entries into 
these categories. 

Anderson, in the monograph which describes 
his procedure, reports a large number of rela- 
tionships between integration and domination 
scores and other variables. His data indicate 
that the domination scores of two interacting 
children are highly correlated; domination be- 
gets domination. The same is true for integra- 
tion scores, These and other findings which 
make sense theoretically indicate that the pro- 
cedure is useful in the study of social behavior 
in children. 

Measurement of child behavior in terms of 
social stimulus value. (1) Purpose and theo- 
retical framework. Olson is one of the pioneers 
in the use of systematic observation in the study 
of children. This particular system developed 
out of an attempt to conceptualize personality 
in terms of the stimulus value of the individual 
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TABLE 3 


ANDERSON’S CATEGORIES OF DOMINATION AND 
INTEGRATION BEHAVIOR 


Teacher System 
Domination 

Determines a detail of activity or acts for the child 
in carrying out a detail 

Direct refusal 

Relocating, repeating, or placing children in dif- 
ferent relations tó each other or to property 

Postponing, slowing up the child 

Disapproval, blame, or obstruction 

Warning, threats, or conditional promises 

Calls to attention or group activity 

Rations material 

Lecture method (defining a problem or antici- 
pating a question) 

Questions: lecture method (one answer ques- 
tions) 

Perfunctory questions as statements (indifference) 


Integration 

Approval 

Accepts difference 

Extends invitations to activity 

Question or statement regarding child's expressed 
interest or activity 

Build up (helps child to better definition or solu- 
tion without giving final answer) 

Participates in joint activity with children 

Gives sympathy 

Gives permission 


Children System 
Integrative 

Shows common purpose by word or action (in- 
cludes attempts to cooperate and attempts to 
share or participate) 

Verbal request or suggestion to direct companions’ 
behavior or secure materials 

Complies with request or suggestion 

Sets pattern including gestures which companion 
imitates 


(Olson and Wilkinson, 1932). The emphasis 
is that an individual's personality is not com- 
pletely described in terms of the responses he 
makes to others but also by the responses others 
make to him as a stimulus. This approach led 
to the proposition that important aspects of 
individual A could be derived from recording 
the behavior of individual B toward A. 

In the specific study reported here, observa- 
tions of behavior were made in a classroom 
situation. The ultimate objective was to arrive 
at scores for the children in the room. What 
was observed, however, was the behavior of the 
children and the teacher. The unit of behavior 
to be recorded was: "such marked restlessness 
or inattention that the teacher feels the need of 
speaking to the child or recalling his attention 
by direct word, look, or gesture." This specifi- 
cation required that the observer single out 
specific teacher behaviors of the sort described 
in the definition and record only under certain 
stimulus conditions, i.e., certain behaviors on 
the part of the child. During the course of the 
year 200 observations were made, at various 
times throughout the day. The total score for 
each child consisted of the number of occasions, 
in the 200 observation periods, in which he 
stimulated the teacher to respond in the way 
described above. 

'The system requires a minimum of sophisti- 
cation from observers, at least as far as the con- 
ceptual definition of the behavior to be re- 
corded is concerned. There is little inference 
required. Since the observer records what is 
essentially an interaction of a specific sort, there 
is strong emphasis on the context of the act of 
the teacher, The significant context is the ante- 
cedent behavior of the child. The unit to be 
recorded is clearly specified. 

(2) Reliability. As a result of the low degrec 
of inference and the clear specification of the 
unit, there is no problem in getting agreement 
between observers. Total contact scores ob- 
tained by two observers correlate .98. 


ILLUSTRATIVE EQUIPMENT Devices 


Along with the efforts to develop observa- 
tional systems, there has been considerable 
experimentation with apparatus to aid in one 
or another aspect of the total observational pro- 
cedure, In general, equipment devices have 
been used to deal with three types of problems: 
improving the efficiency of recording on-the- 
spot observations, collecting data during the 
course of the social interaction, and preserving 
a record of the behavior for later analysis. The 
principal difference between the last two is that 


the objective of the devices aimed at data col- 
lection during the interaction is typically that 
of dispensing with the observer. In the other 
cases, the main purpose is to increase observer 
efficiency, extend his areas of observation, and 

romote as much recall of the situation as pos- 
sible. This is admittedly a rough classification, 
and some of the devices which have been used 
on one problem can be used to deal with an- 
other; particularly is this true of devices which 
are here classified in the last category. 


~ 
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DEVICES FOR RECORDING 
ON-THE-SPOT OBSERVATIONS 

There have been a large number of different 
solutions to the problem of helping the observer 
record his observations, ranging from simple 
data sheets listing the category labels or rating 
scale items to machines which provide the data 
sheet and a consecutive time record of the inter- 
action process. 

Bales’ and Gerbrands’ interaction recorder. 
This recorder was developed for use with the 
Bales category system. It consists of a case con- 
taining a machine which drives a wide paper 
tape (Bales and Gerbrands, 1948). The list of 
categories fits on the top of the case. On the 
right side of the list, the moving tape is exposed. 
When a score is put down, the tape moves 
under the checklist and disappears, leaving the 
tape free for another scoring notation. There is 
a marker inside the case which indicates the end 
of each minute. In addition, a counter indi- 
cates the total elapsed time. The panel of the 
case also contains a ruby light which flashes on 
at one-minute intervals. This is the signal for 
the observer to observe all the members for 
expressive tension behavior which might other- 
wise be missed in the observer's concern with 
more overt interaction. Mechanical details and 
directions for construction have been published 
by Bales (Bales, 1948) . 

Bales and his associates have also experi- 
mented with dictating the category and the 
who-to-whom designation into a sound record 
which is synchronized with the record of the 
group’s verbal interaction. This procedure is 
feasible, but it requires a soundproof room for 
the observers and it sometimes requires the ob- 
server to speak while he is listening, There is 
the additional disadvantage that an hour of 
recording requires an additional hour of tabu- 
lation time. 

Stenotype recording. Carter and his associ- 
ates have worked extensively on techniques for 
improving the efficiency of recording on-the-spot 
observations. This group started with the prop- 
osition that any form which requires the ob- 
server to write necessitates looking away from 
the group and hence creates the possibility of 
missing important interaction. This is espe- 
cially a problem with the Carter system since 
it requires categorization of nonverbal behavior. 
One of the devices they have used to meet this 
and other difficulties has been the Stenograph 
or Stenotype machine (Carter, 1951b). This is 
a small manual printing machine used by court 

stenographers and others for taking dictation. 
Its keyboard permits the printing, either singly 
or in combination, of the numbers from 0 to 9 
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and a number of the letters of the alphabet. As 
a key is depressed, a two-and-one-half-inch paper 
tape moves forward one line. This basic unit 
has been modified to print a time line auto- 
matically every half-minute by depressing one 
of the keys. There have also been minor modi- 
fications of the keyboard. 

The basic unit and the modifications are rela- 
tively inexpensive and the recording tape is very 
cheap. The tape can be used immediately for 
tabulation purposes. For optimum use the 
observer must have memorized the category 
system and must have mastered the keyboard. 
The unit can then be used by "touch," as with 
a typewriter, and the observer can watch the 
group continuously. The machine can, of 
course, accommodate many other category Sys- 
tems besides the one with which it has been 
used. The unit is small, quiet, and easily port- 
able. Observers who have used the system re- 
port that it easier than techniques which require 
dictation, 

IBM cards. Lippitt and Zander (1943) de- 
veloped procedures whereby the observers re- 
corded their observations directly onto IBM 
cards. In the extensive investigation in which 
this was done, there were different cards for 
each area of observation: one for leader be- 
havior, another for total group behavior, and 
so on. Each observer assigned to a particular 
area was equipped with copies of the appropri- 
ate card. These were placed in a notebook 
along with code sheets which explained the 
abbreviations mimeographed on the card itself. 
Each abbreviation stood for a type of behavior 
to be recorded. The observer recorded the 
occurrence of the behavior with a special graph- 
ite pencil by drawing a line through the num- 
ber in parentheses which was opposite the ab- 
breviation on the card. 

This procedure has advantages, particularly 
in the tabulation and analysis phase of research 
operations. The cards can be fed into a "mark: 
sensing” machine to produce punched cards 
which can be used for analysis by other IBM 
equipment. These advantages, however, accrue 
only when the data collection job is a large one. 

Chapple's Interaction Chronograph. Chapple 
and his associates have developed an elaborate 
machine for recording the data required by his 
system (Chapple, 1949) . The machine provides 
a time record of interaction of two persons as it 
is observed by the operator. It also acts as a 
computer, providing summary data on the prin- 
cipal variables in the system. The present ma- 
chine has 10 columns, each produced by à 
printing counter, plus a signal counter. From 
the material fed into and summated by the ma- 
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chine it is possible to get complete scores for 
individuals separately, as well as scores which 
characterize their interaction over time. The 
Interaction Chronograph is not easily moved. 
Chapple has developed a portable recorder 
which provides a paper tape record which can 
be utilized by the computer. In contrast to 
some of the other devices which have been de- 
scribed, the Interaction Chronograph is de- 
signed specifically to provide the data required 
by the Chapple system and does not have gen- 
eral applicability as a recording device. It is, 
of course, a specific application of the basic 
chronograph recording device which does have 
general applicability. 


DEVICES FOR COLLECTING DATA DURING 
SOCIAL INTERACTION 


Strictly speaking, the kinds of equipment to 
be discussed here are not observational, since 
the human observer does not provide the data. 
They are instruments for obtaining data from 
the actors themselves during the social inter- 
action process. 

While there are many variations, with dif- 
ferent recording devices being utilized, the basic 
characteristic of this type of equipment is that 
the group member is given an opportunity, by 
pushing a button or pulling a lever, to register 
his reaction to some aspect of the social scene. 
Thelen and his associates, who have done the 
most thorough and extensive work in this gen- 
eral area, have developed a device to record the 
feeling states of group members which illus- 
trates this general type of mechanism (Thelen 
and Whitehall, 1949; Thelen, 1950) . The group 
members are trained to push a multiposition 
lever to indicate their feelings during the meet- 
ings. The buttons are connected with lights 
which flash on a board which may or may not 
be visible to the members themselves, depend- 
ing on the experimenter’s interest. The posi- 
tion of the lights can also be photographed at 
intervals during the meeting. Thelen has also 
hooked up the buttons with pens which provide 
a written record of member reactions. 

Experience with this sort of equipment indi- 
cates that considerable training is needed in 
order for the subjects to remember to use the 
buttons or levers. Practice is also necessary to 
reduce interference with the group process. 
When there are complex response categories in 
terms of which the member must respond, these 
too must be learned and practiced, At the pres- 
ent time, there is no real evidence concerning 
the extent to which the use of these devices 

-modifies the group process. Their maximal 


value would seem to accrue when used in con- 
nection with interviews of the participants and 
sound recordings. They are also valuable when 
the possible effect on the process itself is of 
minimal importance, as in training situations, 
for example. 

Thelen and his associates have also done some 
work using physiological measures during group 
interaction. They have obtained G.S.R. and 
pulse rate records along with recordings. There 
are technical as well as theoretical complica- 
tions in connection with collecting these kinds 
of data and, again, we have little evidence to 
indicate their usefulness. 


DEVICES FOR PRESERVING A RECORD 
OF THE BEHAVIOR 


We are concerned here with the fact that 
often it is desirable to obtain as complete a rec- 
ord of the behavior as possible. This may be 
necessary because observation is impossible. It 
may be desirable to check on the adequacy of 
on-the-spot observation by comparing it with 
categorizing or rating done later from the rec- 
ord. A permanent record also permits more 
analysis and study than is possible from a single 
exposure to the phenomenon. There is another 
occasion making a permanent record desirable. 
Sometimes it is impossible to make a judgment, 
or categorize a statement, until time has elapsed 
and the subsequent events have been noted. 
Thus, for example, a judgment of the relevance 
of a contribution might well have to wait until 
the final nature of the solution is known. 

The most common device used to meet all or 
one of these objectives is that of sound record- 
ing. There are many recording devices now 
available, ranging in price and varying in their 
characteristics. By and large, it is possible to 
get adequate recordings, at least in terms of 
understandability, without special equipment 
such as soundproof rooms or other acoustical 
devices. Sound records have undoubted value 
in the study of social behavior, but the value is 
not without limits. The limits become more 
and more apparent the closer one moves toward 
dimensions of social interaction which depend 
on facial expressions, postural changes, and in- 
flections of voice. The records often present 
identification problems when it is necessary to 
know who is speaking and to whom the remark 
is addressed. This difficulty can be overcome, in 


® part at least, by having a carefully timed abbre- 


viated stenographic record which contains iden- 
tifying statements as well as the who-towhom 
data. 

Closely related to the use of sound records is 
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the use of typescripts. These are verbatim 
typewritten records of verbal interaction taken 
from the sound record, typically with little in- 
terpretation on the part of the person making 
the typewritten record. These have had exten- 
sive use. Steinzor's category system, referred to 
above, was applied exclusively to typescripts. 
Experience has shown that typescripts are par- 
ticularly useful in preliminary training of ob- 
servers in the use of categories. There is unpub- 
lished evidence to indicate that when the cate- 
gory system involves very little inference, as in 
the case of the problem-solving categories above, 
there is no significant difference among the 
records obtained by observers who worked on 
the spot, by observers who worked directly from 
the recording, and by coders who worked from 
the typescripts. 

One of the main advantages of the typescript 
is that it facilitates total recall of the situation 
and hence promotes re-entry into the data. The 
principal disadvantage is the cost and time in- 
volved in the typing process. Thelen (1950) 
reports that it takes from 7 to 10 hours to type 
out an hour of discussion. 

Another possibility, although only rarely used 
up to now, is the use of photography, either 
motion picture or still. Thelen has used a 35mm 
camera which automatically photographed the 
actors at specified time intervals. This provides 
a periodic sampling of all overt behaviors such 
as locations of individuals, postures, facial ex- 
pressions, activity of the group, changes in work 
group composition, and so on. Photographs 
have also been used by cultural anthropologists, 
notably Mead, to record characteristic features 
of social interaction in a society (Mead and 
Macgregor, 1951; Bateson and Mead, 1942). 

Motion pictures have also been used to a 
limited extent. Lippitt (1940) photographed 
critical episodes in the life history of the 
groups exposed to different styles of leader- 
ship. Both still and motion pictures have pri- 


PROBLEMS IN THE DEVELOPMENT OF AN OBSERVER SYSTEM 


In this section we shall discuss some of the 
recurrent problems in the development of an 
observer system and its successful application to 
the data collection situation. This section is an 
extension of the earlier section concerned with 
dimensions of observer systems. 


RELIABILITY 


There is no need here to emphasize the impor- 
tance of reliability as an attribute of a measur- 
ing instrument. Indeed, in the development of 
most of the systems described earlier a great 
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marily been used by researchers to document 
their observations or supplement data collected 
in other ways rather than as sources of data on 
which observations are made. There are real 
possibilities here, however, especially with mo- 
tion pictures accompanied by a good sound 
track. Unfortunately, the expense involved in 
such a procedure makes it unlikely that these — 
hypothesized values will get much of a chance 
to demonstrate their presence or absence. 

Even a brief review of the devices that have 
been used in connection with observation of 
small groups indicates that their primary value | 
lies in the extent to which they increase the skill, 
efficiency, and thoroughness of the human ob- | 
server. In most cases they are aids for him, not 
substitutes. 

There are many unanswered questions con- 
cerning the use of instruments. This is true for 
instruments which require direct use by the 
subjects themselves as well as those which are 
merely present and visible, as in the case of 
recording or photographing equipment. We do. 
not yet know the extent to which they interfere 
or modify the nature of social interaction. More 
precisely, we do not know very clearly the con- 
ditions under which we should expect effects. 
We do know that there are circumstances, not- | 
ably when the subjects become thoroughly ac- 
customed to the equipment and its purpose is 
explained to them in a clear-cut, nonanxiety- 
producing way, under which the effects are 
apparently minimal. 

Unfortunately, purely methodological re- 
search of this sort is not very exciting and is 
not likely to be done systematically. It is pos- 
sible, however, to formulate methodological 
problems so that they deal with basic theoretical 
problems. This has been done partially in con- 
nection with the effect of observers on social 
interaction. When similar attempts are made in 
connection with devices, we may expect some 
answers to the questions which they pose. 


deal of time was spent in efforts to arrive at 
reliable scores. Typically, reliability is defined 
in terms of degree of agreement between inde- 
pendent observers. The statistical indices which 
have been used to represent amount of agree- 
ment have varied considerably, but the most 
common has been the correlation coefficient. 
There are many circumstances where other in- 
dices are superior, but consideration of these — 
questions falls outside the scope of this chapter. 
We propose here to discuss some general factors - 
which influence the degree of reliability at 
tained. 3 
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It seems useful to point out, at the outset, that 
there is only one score whose reliability need be 
assessed, and that is the score which is actually 
to be used in the analysis. Thus, for example, 
when group members need only be ordered in 
terms of the extent to which they showed a 
particular behavior during a meeting in order 
for a hypothesis to be tested, it is the reliability 
of the total scores for each individual which 
must be determined. It often happens that the 
reliability of this score is quite high, even 
though the degree of agreement between ob- 
servers in their categorization of single units of 
behavior is much lower. This typically happens 
when differences in coding of single units are 
randomly distributed among the categories. 

'The general evidence on reliability of ob- 
server scores indicates that the less inference 
required of observers, the higher the degree of 
agreement. With category or rating systems 
which require a good deal of inference, it is 
clear that the most reliable are those in which 
the dimensions are clearly defined and the cues 
to be used by observers are specified. The 
observational load is a factor affecting reliabil- 
ity. Other things being equal, the more the 
observer has to do, the lower the reliability. 
Some preliminary results of work done by stu- 
dents of the authors indicates that there is less 
of a decrement in reliability when additional 
observations require the same level of inference 
than when the same number of categories are 
added which require more inference than the 
original set. 

The reliability of some systems is low because 
of unclear definition of the unit. Even when 
the unit to be rated or classified may be fairly 
clearly stated, the observers may nevertheless 
disagree as to unit boundaries in actual observ- 
ing situations. It is obvious that degree of agree- 
ment as to coding can only be appropriately 
assessed when there is agreement first as to what 
is to be coded. 

Up to the present time, particularly with 
category or rating systems which require some 
inference, the degree of reliability attained is 
very much a function of the amount of training 
which observers have had. The thoroughly 
trained observer has become maximally familiar 
with the definitions of the categories or dimen- 
sions. He has gotten practice in designating 
the unit of observation. He has gained skill in 
adopting the frame of reference which the sys- 
tem demands, and has practiced his art in the 
situation in which the behavior takes place. 
His own private definitions have been elimin- 
ated or have come to be shared by others using 
the system. In short, virtually all the sources 
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of unreliability have been dealt with in the 
training proces. Lippitt and his associates 
have found that this sort of training is maxi- 
mally effective when (1) the observers assist 
in the development of the observer system, and 
(2) the training is concentrated on correct clas- 
sification of marginal or difficult units rather 
than upon items which clearly fall into a given 
category. 

Ideally, every observational system ought to 
be developed to the point that it can be com- 
municated to other social scientists and applied 
reliably by them. Only in this way can we really 
say that an instrument has been developed. This 
does not mean that the instructions, definitions, 
and procedures must be simple; it does mean 
that they should be readily shared, and with a 
minimum of ambiguity. Unfortunately, this 
state of affairs is not yet with us. The typical 
experience is for one experimenter to try the 
observational procedures of another and then 
give up the attempt because of a failure to attain 
sufficient reliability. This has not always been 
due to insufficient clarity of presentation. There 
is often a failure to communicate some of the 
more subtle aspects of the procedure which are 
a part of the technique of the originator. It is 
also quite likely that some of the failures to 
reproduce the success of the developer of the 
system have been due to insufficient training 
time. There are certain factors which affect the 
reliability of sources which no amount of speci- 
fication will enable the observer to handle. The 
observer must have the chance to develop sensi- 
tivity to the dimensions of group process under 
scrutiny. He must learn to cope with the speed 
of interaction which so frequently overwhelms 
the inexperienced. To deal successfully with 
the speed of the interaction process, the cate- 
gory system itself must no longer be strange. 

Because of the difficulties involved in obtain- 
ing satisfactory reliability and the responsive- 
ness of reliability scores to training, it is virtu- 
ally mandatory that reliability checks be run 
prior to securing the research data. Reliability 
figures obtained after considerable practice may 
have no bearing at all on the reliability of the 
original scores. 


VALIDITY 


It has been pointed out by Cronbach (1946), 
Guttman (1950), and others that the term 
*validity has a variety of meanings. In one sense, 
the validity question means: Does the observer 
score measure what it purports to measure? In 
another sense, the question of validity asks: 
Does the observer score predict anything? 1f 
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we accept the first meaning of the term, the 
question of validity of observer scores has been 
relatively ignored by those people most active 
in using observers, 

One of the reasons for the neglect has been 
that often the categories require very little in 
the way of inference. They have what has been 
called “internal validity" (Guttman, 1950). No 
one seriously doubts that the score in the cate- 
gory “asks question” represents what it is sup- 
posed to represent, least of all the people using 
the system. The definition of the category is 
universally shared and there is common accept- 
ance that the average person knows a question 
when he hears one. 

The problem is much more complex in the 
case of category systems which involve infer- 
ences as to intent or motive or feeling of the 
actor or recipient. It is a quite legitimate ques- 
tion to ask whether the entries in the “feels 
hostile” category really represent the degree of 
hostile feelings present. It is rare, however, that 
a really satisfactory independent measure of the 
same variable is available with which the ob- 
server scores can be compared. It is this diffi- 
culty which has prevented much work on the 
validity of observer scores. 

There is at least one solution to the problem 
of determining the validity of an observer score 
in the absence of an adequate external, more 
acceptable, criterion score. This solution lies 
in determining the extent to which the scores in 
question behave as they are supposed to behave 
as determined by a well-formulated and docu- 
mented theoretical position. To illustrate, let 
us take a study reported by Thibaut and Coules 
(1952). These experimenters were concerned 
with the effects of variation in opportunity to 
communicate hostile feeling on the amount of 
residual hostility. To determine the relation- 
ship, they coded the content of written docu- 
ments into categories having to do with the 
nature of the feeling and the referent of the 
feeling. Let us suppose for purposes of our 
illustration that the proposition that oppor- 
tunity to communicate directly to the instigator 
of frustration reduces the incidence of hostile 
responses had already been supported by other 

studies, using other assessment techniques. 
Then this same design could be used, because 
of the support independently provided for the 
theory, to test the validity of the content coding. 
This same thing could, of course, be done with 
observer scores. Unfortunately, it is difficult to 
find such well-articulated and documented the- 
ory which can be used to validate an observa- 
tional procedure. Nevertheless, this procedure 
is often more likely to prove useful than a 
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search for an adequate independent measure of 
the same variables. 

If we turn to the second meaning of the term 
validity, the picture is much brighter. There 
is no need to enumerate all of the instances in 
which observer scores have been found to cor- 
relate with other measures. It has been possible 
to achieve statistically significant relationships 
between participant satisfaction with meetings 
and observer ratings of the cohesiveness of the 
group, and the amount of self-oriented need 
behavior, for example (Guetzkow, 1950). Ob- 
server scores distinguish the behavior of group 
members subjected to different kinds of leader- 
ship, and so on. Indeed, it is the promise of 
these kinds of results which is responsible for 
the energy which is now being expended in the 
development of observer systems. In spite of 
this general picture of moderate usefulness of 
Observer scores, it cannot be said that there is 
evidence that all of the systems which have 
been used have such predictive efficiency. It 
seems fair to say that those which have such 
support tend to be of the higher-inference sort. 
The experience of Thomas (19332) and her 
associates indicates that it is possible to get 
extremely high reliability about unimportant 
matters. This would suggest that mere demon- 
stration that the system can be used reliably is 
not sufficient warrant for publishing it abroad. 
Evidence should be provided quickly that the 
procedure is useful in prediction of some sort. 


SAMPLING 


The question frequently arises as to whether 
the observer should be asked to record all of 
the behavior in a given period or whether some 
sampling should be specified. There is a wide 
variety of ways in which selection can be made 
from the total behavior period. The observer 
can systematically rotate his attention among 
all the members of the group, making each one 
the subject of observation for a specified period. 
Or he can select certain members and ignore 
others. Another possibility is to observe all 
members, but at specified times, rather than 
continuously. There are many others. 

All sampling problems, to be handled suc- 
cessfully, require a knowledge of the universe 
from which the sample is to be drawn. They 
also require, under ideal conditions, that the 
theory guiding the research be well formulated. 
At the very least, it is necessary to be clear about 
the assumptions one is making when one 
samples in any of the possible ways, and the 
effects on the data if these assumptions are not 
met, Unfortunately, we know so little about 
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social interaction that sampling procedures are 
virtually always risky. In fact, one of the prin- 
cipal objectives of developing procedures of 
social observation is to describe social inter- 
action. It would seem desirable to make these 
procedures as all-encompassing as possible at 
the outset and leave questions of sampling until 
we have more empirical data on the basis of 
which they can be settled definitively. 

This is not to say that there are not con- 
ditions under which the investigation can be 
carried out successfully with sampling of 
people or of time or both. Anderson (1939) re- 
ports, for example, that there is a high degree 
of consistency in teacher behavior toward chil- 
dren from one time period to another, indi- 
cating that sampling, using his system, is 
probably quite justified. In these cases, how- 
ever, it is the theory and the design which 
dictate whether sampling is sound in the cir- 
cumstances, and not propositions about sam- 
pling in general. 


PRE- AND POSTCATEGORIZATION 


Sometimes there is the possibility, in theory 
at least, of securing a complete record of the 
total behavior and doing the categorizing job 
on this recorded material. In this sense, post- 
categorization refers to coding (or rating) from 
a record rather than at the time the behavior 
takes place. Whether this is an adequate pro- 
cedure depends on the extent to which correct 
use of the categories necessitates cues which 
are not available because of the type of record 
used. Thus, for example, typescripts (type- 
written records of interaction, primarily verbal) 
typically do not provide information concern- 
ing facial expressions, voice inflections, postural 
changes, which often play an important role in 
determining judgments concerning emotional 
states, When the category system requires little 
use of such cues, the results obtained with on- 
the-spot coding and afterward from the type- 
script are very similar. 

There is another sense in which the term 
postcategorization is used. This second mean- 
ing refers to the practice of developing the cate- 
gory system in the process of studying some 
sort of total record. This seems to us to be 
almost entirely a historical question in the case 
of instruments which make some claim to being 
standardized. In these cases, there is an exten- 
sive history of total observation and total re- 
cording and the progressive refinement of the 
category system. The initial step is typically 
that of deriving the categories in part at least 
from the behavior itself. At various times in 
this sequence, categories are added or dropped 
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on the basis of what types of behavior appear 
and can be differentiated. There are, therefore, 
times in the life history of most observation 
schedules when they are essentially postcate- 
gorization systems. Indeed, any system which 
purports to be applicable to a wide variety of 
social situations must have that sort of history 
if its claim to such generality of use is to have 
any validity at all. 


THE EFFECT OF OBSERVERS 
ON THE BEHAVIOR ITSELF 


This is a problem which perturbs the non- 
user of observation techniques much more than 
it does those who have worked with them ex- 
tensively. There are undoubtedly many rea- 
sons for this differential interest in the issue, 
but certainly one of them is the common feeling 
among experienced users of observers that the 
observers have very little effect, if any. This 
belief is shared by experimenters who have 
worked in a wide variety of situations and with 
many different kinds of subjects. 

While this general belief is probably sufficient 
to encourage most people to go ahead con- 
structing observation schedules and using ob- 
servers, it is also true that our present state of 
knowledge on this issue is very inadequate. 
Whether or not the presence of the observer 
has an effect on the nature of the interaction 
depends undoubtedly on such factors as the 
nature of the group, the nature of the activity 
of the group, and the way the observer task has 
been structured to the group members. Un- 
fortunately, there has been little. systematic 
study of the relationship between these factors 
and the effect of observers. To supplement the 
following brief discussion the reader should 
turn to Chapter 21, where this problem is alsc 
treated. 

Deutsch (1949) reports that over a period of 
three weeks members of small groups became 
less and less aware of the presence of observers, 
until by the end of the period they were virtu- 
ally oblivious of them. In certain of these 
groups the activity involved cooperation 
among members, while in others the members 
were competing with each other. The com- 
petitive groups were more aware of the ob- 
servers than were the cooperative groups. This 
suggests, as indicated above, that the nature of 
Q4he activity is a factor affecting awareness of 
observers if not the social behavior itself. 

In contrast to Deutsch, Polansky et al. (1949) 
cite some experience which suggests that there 
are circumstances in which subjects being ob- 
served become more aware and less accepting 
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of observers over time. Their subjects were boys 
in a summer camp. At the outset, the boys 
accepted and tended to ignore the observers, 
who, they had been told, were trying to see how 
to improve the camp. By the third week, the 
observers were objects of attack by many of 
the boys. Polansky suggests that the ambiguity 
of the role of the observer made the boys 
anxious. When the observers became more 
friendly to the boys, the attacks were less fre- 
quent. This study indicates the importance of 
having the role of the observer clearly defined 
but defined in such a way as to have the mini- 
` mum impact on the behavior itself. Thus, for 
example, it seemed desirable to indicate to con- 
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ference groups in business organizations that the 
observers had no relationship to management, 
that no one was personally identified, and that 
the sole purpose of the observer team was to 
record what took place in the interest of find- 
ing out how business conferences were con- 
ducted. While this structuring was only part 
of the total procedure, it is interesting to note 
that almost all the members in 72 groups indi- 
cated that they felt the presence of the ob- 
servers had had no effect on what had taken 
place. Since these were groups which had a 
history of previous meetings, they were able 
to compare the meeting at which the observers 
were present with the earlier ones. 


Tue IDEAL CONDITIONS: ANALOGY TO PSYCHOPHYSICS 


Thurstone has explicitly applied traditional 
psychophysical methods. such as paired com- 
parisons and equalappearing intervals to the 
measurement of attitudes. As we mentioned at 
the outset, the relationship between human 
reactions and independently assessed stimulus 
properties has been most thoroughly investi- 
gated in psychophysics. It seems profitable for 
us to search this area for implications for the 
use of observers in social psychological research. 

By and large, experimental results in psycho- 
physics are very encouraging, since they show 
that the human being can make judgments 
about external events which bear an amazing 
correspondence to the relationships between 
these events as determined by independent 
techniques. The psychophysicists have also 
been able to isolate some of the conditions 
which affect this degree of correspondence. 
"These findings can be grouped under four ma- 
jor headings: 


1. When other conditions are optimal, the 
human being is more accurate when he makes 
ordinal judgments than when he makes judg- 
ments on an absolute subjective scale. Almost 
all psychophysical techniques require the 
judge to say whether one object is more or 
less of something than another. In general, 
human beings have shown themselves to be 
very sensitive to ordinal differences between 
objects which can be shown to correspond to 
ordinal differences determined by independ- 
ent techniques. 

2. Ordinal judgments are more accurate 
when there arc only two objects to be judged 
at a time. The greater the number of objects 
presented simultaneously for ordering, the 
more inaccurate the judgments. 

$. Ordinal judgments on two objects are 
most accurate when the objects of judgments 


are brought close together in space and time. 
With lifted weights, the longer the interval 
between the lifting of the standard and the 
comparison weight, the more inaccurate the 
judgment. In the judgment of the relative 
length of two lines, the smaller the space 
between them, the more accurate the ordinal 
judgment — up to a point at which they can 
no longer be distinguished as two lines. 

4. Ordinal judgments of two objects close 
together in space and time are most accurate 
when the objects are identical on all attri- 
butes except those being judged. Judgments 
of relative length are most accurate when the 
two lines are the same width. 


Psychophysicists are in the fortunate position 
of having an independent criterion for the ac- 
curacy of psychophysical judgments. For most 
of the attributes of interest to the social psy- 
chologist there is no such independent criterion. 
Nevertheless, it seems reasonable to assume that 
these same principles, if applied to social psy- 
chological judgments, should produce maximal 
correspondence to the objects of judgment. In 
any event, it is significant that these conditions 
are the very ones in which the human instru- 
ment experiences least frustration and uncer- 
tainty in performing his task. This should be 
an important consideration, since we do not 
want to damage this instrument any more than 
we have to. 

Let us suppose that we wish to apply these 
principles derived from psychophysics as di- 
rectly as possible to the gathering of data for 
the study of the decision-making conference. 
While the following discussion is written in 
terms of assigning scores to groups as units, it 
should be noted that it applies as well to con- 
ditions where individuals or periods of time 
are being judged. Let us also suppose that we 
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have unlimited time and money at our com- 
mand. With slavish adherence to these prin- 
ciples, what would the methodological pro- 
cedures be? 


Ordinal judgments are more accurate than 
absolute judgments, This would mean that 
we allow our observers only to rate “more” 
or “less,” never allowing them to assign num- 
bers to their ratings. 


The psychophysicists have found that such 
ordinal judgments are most accurate under 
optimal conditions. However, as one departs 
from these conditions, ordinal judgments be- 
come more and more inaccurate. Inter- and 
intraobserver variability rises. In the size 
constancy experiment, for example, if the two 
objects to be compared are so placed that one 
is in front of the observer and one behind, the 
observer will have to turn around for each 
judgment. This condition will produce a 
greater percentage of judgments which do not 
correspond to the “physical” relationship be- 
tween the two objects. 

Now the conditions of social psychological 
measurement in the field are often far from 
optimal in this respect. There may be a pro- 
longed period between observation sessions. 
When such conditions become bad enough, it 
may be that there comes a point at which ordi- 
nal judgments become more variable than abso- 
lute judgments, that is, conventional numerical 
ratings. It is conceivable that absolute judg- 
ments have a large but finite variability which 
is less than the variability of ordinal judgments 
when conditions of judgments depart far 
enough from the optimum. This is a matter 
for empirical demonstration, and if it turns out 
that there is such a point of diminishing re- 
turns, it will be a matter of decision in planning 
a study as to which method will be most accu- 
rate under a given set of conditions. 

When we consider what the task of the ob- 
server becomes when we depart from optimal 
psychophysical conditions, the rationale for 
this point of diminishing returns becomes ob- 
vious. The conditions are optimal when the 
observer can have both objects in his field at 
once so that he can quickly shift from one to 
the other in making his comparison. When 
the objects are moved far apart in time or space, 
it means that the observer must "remember" 
the first object while he is turning to the com- 
parison object. We are, in effect, asking the 
observer to compare a memory trace of the first 
object with the immediate experience of the 
second. With a longer interval, it is likely that 
this trace undergoes some changes; and there 
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is no reason to assume that these changes are 
in a constant direction. The traces are prob- 
ably affected to a large extent by what else 
happens in the interval, and the things which 
happen to one observer may be very different 
from the things which happen to another. 
Consequently, we would expect there to be 
more disagreement between observers as the in- 
terval between standard and comparison object 
is increased. Exactly the same argument holds 
for repeated comparisons of one observer. 

Now what is the situation when we ask an 
observer to make an absolute judgment, a rat- 
ing? Essentially, the task is the same as the one 
we have just described. We are asking him to 
compare an object in his immediate field with 
a trace which has been built up over a long 
period of time, a trace corresponding to 
"inches" or "brightness." If the memory trace 
of “inches” has been built up from a large 
number of experiences, one would expect that 
it will be affected relatively little by the addi- 
tional information given during the judgment. 
Jf the trace is a new one or one which has been 
established on the basis of relatively few ex- 
periences, we should expect it to be affected to 
a much greater degree by new experiences in 
the process of judging. The subjective scale for 
such an attribute would be much more vari- 
able, and consequently we should expect that 
we could depart from the optimal psychophysi- 
cal conditions a great deal more and still have 
ordinal judgments more accurate than absolute 
judgments. Again this is a matter for empirical 
determination. It would be easy to determine 
how variable a subjective scale is. In the light 
of this, one could have some criterion in terms 
of which to select the more appropriate method 
of judgment under a given set of conditions. 


Limit the objects of judgment to two. Ob- 
servers should make all their judgments with 
reference to one other group, person, or time 
period, and this other group, person, or time 
period should be the same for all observers. 


As a matter of fact, this is what is done ordi- 
narily, whether the observers are asked to give 
relative or absolute judgments. If the factors 
described above point to the advisability of 
using ordinal judgments, care should be exer- 
cised in the study design to make sure that the 
observers make their comparisons with refer- 
ence to one other meeting, and that this ref- 


* erence conference is the same for all observers 


at any one time of judgment. If the situation 
makes absolute ratings seem advisable, then 
care must be taken to make the subjective scales 
of the observers as congruent as possible and 
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to assure that the scales do not change with the 
passage of time. This is ordinarily done by 
defining a reference group, real or hypothetical, 
which is used as a base line for all ratings. 


Proximity of objects of judgment in space and 
time. The pairs of groups, persons, or time 
periods to be compared should be as close to- 
gether as possible. 


This is the condition which will probably be 
most difficult to fulfill, The judges cannot 
ordinarily schedule the groups to suit their 
needs. Obviously, the judges can’t observe two 
groups simultaneously, but the scheduling of 
groups to be observed should be as close as pos- 
sible. As has been pointed out above, this is one 
of the conditions which will determine which 
type of judgment will be most appropriate for 
a particular study. If long periods of time must 
intervene between observation sessions, then 
absolute judgments (conyentional numerical 
ratings) may contain less error than ordinal 
comparison judgments. If this is so, care must 
be taken to keep the reference norms for such 
ratings fresh in the judges’ minds. The object 
is to make the conditions approximate as closely 
as possible those of simultaneous comparison. 


Similarity of objects of judgment on all attri- 
butes not being judged. 


If accuracy of measurement is the main con- 
sideration, this rule has obvious implications 
for the selection of a sample of groups or per- 
sons to be observed. If the period of time 
elapsing between group sessions were not a 
limiting factor, then the way to achieve this 
condition would be by observing one confer- 
ence on successive meetings. This procedure 
would automatically keep many factors con- 
stant which would be almost impossible to 
control in any other way. With this technique 
we would have our observers make their com- 
parisons against the previous meeting, since 
this would be the one which was most “fresh” 
in their minds. Successive comparisons would 
be made on a sufficiently long series of meetings 
so that tests of significance could be made on 
the data so obtained. 

If it were found possible to do a study under 
these ideal psychophysical measuring conditions 
so that one had a series of ordinal judgments 
on successive meetings of one conference group, 
what sort of analysis could be done with the 
data so obtained? If more than one observer 
made judgments at each meeting, it would be 
possible to arrive at a conventional index of 
reliability on cach attribute. With such a series 
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of "more" or “less” judgments, we would be 
able to detect changes in an attribute which 
were monotonic, that is, changes which showed 
increase or decrease throughout the series of 
meetings. If the changes in an attribute were 
nonmonotonic, our data would show it, al- 
though we would not be able to specify the 
shape of this nonmonotonic function. The data 
would provide a basis for stating relationships 
between attributes. These statements would be 
functionally comparable to conventional cor- 
relation coefficients, but would not require 
either the more inaccurate methods of data 
collection, or the strenuous statistical assump- 
tions needed for the computation of a product 
moment coefficient. We could also compute a 
sampling error for the obtained relationships. 

These data, it is evident, offer about as many 
possibilities for analysis as do the conventional 
numerical ratings, although no index can be 
computed from them which requires the as- 
sumption of a distribution function. This is 
not a serious limitation, since there is a rapidly 
growing set of statistical devices which involve 
no assumptions about distribution functions. 
(See Chapter 8.) 

Undoubtedly this movement of a research 
design in the direction which makes observer 
data more satisfactory is bought at the cost of 
a reduced generality. The results of a study of 
a hundred different groups can be generalized 
more safely than the results obtained from a 
study of a single group one hundred times. 
This is unquestionably true. There are, how- 
ever, other ways in which this optimum con- 
dition of similarity on the other attributes can 
be approached without a material loss in gen- 
erality. In many cases, these dissimilarities can 
be eliminated in the final design by selection of 
subjects; in still others, these influences can be 
minimized by making their effects an explicit 
part of the training procedure. 

The implications of the results of psycho- 
physical experimentation have been developed 
in the above discussion in a setting of observers 
who were called upon to make judgments of 
group properties. Each of them can with equal 
merit be applied to observation of individuals. 
Indeed, as in the case of optimal conditions 2 
and 3, the task of implementing them is easier 
when the subject of observation and analysis is 
the individual. 

Finally, it should be noted that these con- 
ditions of optimal use do not eliminate the 
need for careful definition of the dimensions 
of observation and the equally thoughtful ex- 
amination of the other factors affecting suc- 
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cessful use of the observer which have already 
been mentioned. The discussion of the impli- 
cations of psychophysics calls attention to pos- 
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sible criteria in terms of which to evaluate 
possible procedures for the use of observers. 


CONCLUSION 


The observer is now an established member 
of the social science research team. His services 
have been called upon in a wide variety of 
situations. He has done everything from count- 
ing the number of times a person says "We" to 
making judgments about motivation. By and 
large, the observer has earned the attention and 
respect he is currently receiving. The data he 
provides are increasing our understanding of 
complex social psychological phenomena. In 
the instances where observation has failed to 
provide useful data, the difficulties are usually 
found not in the individual observer but rather 
in the design of the study, the failure to define 
clearly the area of observation, and inadequate 
training. 

There is a rapidly growing library of obser- 
vational systems applicable to many areas of 
social behavior. They are becoming increas- 
ingly sophisticated, methodologically. One of 
the most serious barriers to scientific advance 
has been the fact that these systems tend to be 
used only by their authors. Other researchers 
working in the same areas are prone to develop 


their own procedures. The result is that the 
problem of comparability of findings is compli- 
cated. It seems reasonable that some of the 
systems now available, with respectable his- 
tories, ought to be used extensively in much 
the same way that established tests and standard 
learning tasks are used in other areas of social 
science. 

Finally, there are many important methodo- 
logical questions still to be answered. This is 
primarily because almost no one who has 
worked in the area has been primarily a meas- 
urement methodologist. Most of the methodo- 
logical sophistication has resulted from being 
forced to confront technical problems en route 
to finding answers to theoretical questions. In 
spite of the lack of strictly methodological ex- 
perimentation, there is an increasing awareness 
of the critical questions which must be asked in 
the development of an observer system. There 
is, in the history of systems already developed, 
much to guide the scientist who feels obliged to 
develop his own. 
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CHAPTER 11 


Sociometric Measurement 


GARDNER LiNDZEY AND EDGAR F. Borcatra* 
Harvard University 


The web of interpersonal relations, the at- 
tractions, repulsions, and indifferences that 
characterize individuals in daily interaction, the 
informal organization of groups, the social 
status of individuals — all these are of interest 
to the social psychologist, and aspects of all 
may be measured effectively through the use of 
sociometric techniques. These instruments are 
designed specifically to provide a sensitive and 
objective picture of the interpersonal relations 
existing within a group. Consequently, they 
are of singular importance to the empirically 
oriented social psychologist, whether he is in- 
terested in theoretically derived or applied re- 
search, 

Use of the term “sociometry” has been sur- 
rounded by considerable disagreement and 
debate. Its similarity to “biometrics” and “psy- 
chometrics" suggests that it might refer to all 
measuring devices that deal with social be- 
havior. Consistently with this, Moreno (19432) 
has urged that sociometry should not be limited 
solely to instruments he has developed. More- 
over, Bain (1948) indicated that the term 
properly should refer to all attempts to measure 
social behavior, and Chapin (1943, 1940), 
Lundberg (1943), and Sanderson (1943) ex- 
press general agreement with this opinion. On 
the other hand, the introduction and persistent 
use of the term by Moreno has led to common 
association in the minds of most social scientists 
between "sociometry" and only those tech- 
niques introduced by Moreno. In modern texts 
of social psychology the term is used without 
exception in the more restricted sense. Thus, 
authorities argue one thing, common usage an- 
other. We shall bow to common convention in 
our discussion and consider the term sociometry 
appropriate to designate a limited range of 
measuring devices, all of which stem primarily 
from the work of Moreno. 3 

Moreno (1934) refers to the measures with 
which we are concerned in this chapter as so- 
ciometric tests, He reserves the more general 
term, sociometric techniques, to refer to à 
cluster of devices, including the spontaneity 
test, the various types of role tests, and the 


acquaintance test, in addition to the sociometric 
test. Although we use the terms test, technique, 
and measure interchangeably, our interest will 
be limited to those measures Moreno designates 
as sociometric tests. Further, the scope of this 
chapter prohibits consideration of the theoreti- 
cal formulations which have been commonly 
associated with sociometry and Moreno. 

The focus of our interest is upon a small 
number of closely related techniques that may 
be used to elicit responses from members of a 
given group concerning the positive, neutral, 
and negative relations existing within the 
group. In the following sections we shall con- 
sider the conditions under which such responses 
are secured, the primary ways in which the 
resultant data may be analyzed, some common 
properties found in the analyzed data, and the 
areas of investigation in which such data may be 
expected to be of significance. 

What qualities do sociometric measures pos- 
sess that make them of special interest or sig- 
nificance to the social psychologist? While the 
detailed answer to this question is contained in 
the body of this chapter, there are a number of 
general points that may be suggested at this 
juncture. First, it is evident that the variables 
these measures represent are about as purely 
"social" as is possible. The fact that these vari- 
ables describe the quality of interpersonal rela- 
tions within groups makes clear that they deal 
with the basic data with which the social 
psychologist has been most interested, both 
conceptually and empirically. Second, the in- 
terdisciplinary popularity and appropriateness 
of these devices makes them of particular rele- 
vance to a subdiscipline that has developed 


*Much of the work on this chapter was carried 
out as part of a program of research on interper- 
sonal perception at Harvard University, financed by 
the Office of Naval Research (Contract No. N5-ori- 
07646). We are deeply indebted to many colleagues 
whose comments have helped in shaping this chap- 
ter. Especially helpful were the suggestions of Joan 
H. Criswell, Mary L. Northway, and Charles Proc- 
tor. 
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largely in the void existing between two parent 
disciplines. A striking feature of social psy- 
chology during the past decade is the increase 
in interdisciplinary interest and multidiscipline 
research. Such an orientation may be expected 
to have a fruitful outcome to the extent that 
individuals in the different disciplines are able 
to agree upon the mutual relevance and sig- 
nificance of data. In this connection, it is 
interesting that from the very beginning socio- 
metric techniques have been widely employed 
by both sociologists and social psychologists, 
and to a lesser extent by social anthropologists 
and psychiatrists. The development of these 
measures is associated with no single discipline 
but, rather, appears to have occurred while 
many disciplines have had access to their use. 
Consequently, sociometric devices seem ideally 
equipped to function in a field where there is 
much interest in cross-disciplinary integration. 
Third, and this is related to the interdiscipli- 
nary popularity of these techniques, is their 
capacity to represent individuals in interaction 
within a miniature social system. Current 
theoretical formulations, in addition to the 
demands imposed by many empirical problems, 
make it necessary for the investigator to study 
the individual and his social environment si- 
multaneously, The sociometric measures pro- 
vide a means of representing an important 
part of the individual's social environment as 
it is perceived by the subject. Recent emphasis 
on the importance of the "subjective frame of 
reference" or the "psychological environment” 
is thus satisfied by these techniques. Further, 
they provide an externalized view of the same 
events through the responses of other members 
of the group. Fourth, ease and speed of ad- 
ministration and a related lack of expense are 
important qualities in an era where much in- 
vestigation has become so expensive that only 
the wealthy or well-endowed may hope to com- 
pete successfully. "These are devices that the 
independent researcher can use effectively with- 
out large-scale resources. Fifth, increasing em- 

hasis upon action research, investigation 
where the findings have direct implications for 
the concrete events under study, gives further 
merit to sociometric techniques, as they lend 
themselves to the introduction of social change 
in an efficient and compelling fashion without 
seriously adding to the labor of the investiga- 
tor. Sixth, sociometric measures can be used in 
such a way as to provide a much higher degree 
of interest and motivation on the part of par- 
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ticipant subjects than is typical of most psycho- 
logical measuring instruments. The possibility 
that his environment may be manipulated so as 
to comply with his wishes encourages a high de- 
gree of interest and cooperativeness on the part 
of the respondent. Seventh, as we shall attempt 
to document later, sociometric techniques pro- 
vide relatively acceptable indices for a large 
number of empirical concepts that the social 
psychologist conventionally employs. Such con- 
cepts as prejudice, morale, social status, rejec- 
tion, etc, can be measured effectively through 
the use of these techniques. 

What other instruments in the psychologists' 
measurement battery show similarities to so- 
ciometric measures? Although there are some 
correspondences between these measures and 
attitude scales as well as questionnaires or 
inventories, the most important congruence 
seems to exist in relation to rating scales (P. N. 
Pepinsky, 1949; Jennings, 1950). It is easy 
to conceive of the sociometric test as a variety 
of rating scale — the members of the group are 
asked to rate or order the other members of the 
group in terms of their attractiveness or de- 
sirability for sharing certain activities. 

In spite of an underlying similarity, there are 
also important differences between these two 
approaches. Sociometric measures are more 
limited in the variables they can be used to 
assess and more restricted in the settings where 
they can be employed. On the other hand, 
there is no need to train raters to engage in 
sociometric ratings. The difficult and time- 
consuming task of attempting to produce com- 
mon frames of reference and homogeneous 
criteria in terms of which ratings shall be as- 
signed is avoided. The rater is asked to apply 
exactly those particular, unique, and sometimes 
irrational criteria he has spent a lifetime de- 
veloping. Everyone is an experienced or expert 
rater when it comes to sociometric judgments. 
Each of us has a vast body of experience in de- 
ciding with whom we wish to interact and whom 
we wish to avoid. Liking and disliking, accept- 
ing and rejecting are part of the process of 
daily living. Given the past experience of these 
raters and the meaningfulness of the activity 
they are engaged in, it is small wonder socio- 
metric devices have proven a powerful measure- 
ment tool in the hands of the social scientist. 
One might say that the individual who uses 
these techniques is taking advantage of the 
largest pool of sensitive and experienced raters 
that is anywhere available. 
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ORIGINS OF SOCIOMETRY 


The origins and development of sociometry 
are intimately linked with one man — J. L. 
Moreno. There are few instances where a single 
individual has exerted so pervasive and crucial 
an influence on the evolution of a social sci- 
ence area. Although it is clear that similar 
devices were used by other investigators prior 
to Moreno and that Moreno himself had 
worked with these devices before they became 
widely known, the important first step in the 
development of sociometry was the publication 
of Who Shall Survive? (1934). This volume 
reports a series of group studies carried out in 
public school and institutional settings using a 
variety of data-gathering techniques, including 
sociometric measures. The book is not a simple 
report of research, for it contains a mixture 
of speculative theory and applications of meth- 
ods and their results to problems of broad social 
significance. This fusion of procedural skill, 
speculative theorizing, and heavy concern with 
evaluative issues has remained typical of much 
sociometric writing up to the present. Follow- 
ing this initial presentation of the method, the 
next important event was the publication of 
the short-lived Sociometric Review (1936). 
This was superseded by Sociometry: A Journal 
of Interpersonal Relations, the first volume of 
which appeared in 1937-1938. This periodical 
has played a crucial role in subsequent de- 


velopments. Although the journal has been 
generous in publishing a large number of 
studies that bear no intimate relationship to 
Moreno's work, the over-all impression is cer- 
tainly one of an official or home organ. This 
has led to more of an air of sectarianism than 
might otherwise have appeared. In spite of the 
fact that the first two editors of the journal 
were respectively a distinguished psychologist, 
Gardner Murphy, and a distinguished sociolo- 
gist, George Lundberg, the expansive, persua- 
sive influence of Moreno was so great that most 
psychologists and sociologists tended to look 
upon individuals concerned with sociometry as 
having some of the cultish qualities that were 
characteristic of Rorschachers and psychoana- 
lysts. This set of conditions may have served 
to slow down the interpenetration of concep- 
tion and method in the various areas, but at 
the same time it performed the important func- 
tion of preventing sociometric techniques from 
becoming seen as tools that were the primary 
property of either sociology or psychology. 

Whatever tendencies to set sociometrists off as 
a group by themselves may still remain, it is 
clear by now that sociometric measures are 
widely used by social psychologists and sociolo- 
gists, and that some knowledge of these tech- 
niques would be expected of any trained 
individual in either of these areas. 


Wuart Is A SOCIOMETRIC MEASURE? 


In simplest terms, a sociometric measure is 
a means of assessing the attractions, or attrac- 
tions and repulsions, within a given group. It 
usually involves each member of the group 
privately specifying a number of other persons 
in the group with whom he would like to en- 
gage in some particular activity and, further, a 
number of persons with whom he would not 
like to participate in the activity. 

The most important requirements of the 
sociometric test generally advocated by Moreno 
are: 

(1) The limits of the group should be in- 
dicated to the subjects. The sociometric test 
places no restrictions on the persons within the 
group who may be chosen or rejected, but sub- 
jects should clearly understand the nature of 
this group. : d 

(2) The subjects should be permitted an 
unlimited number of choices or rejections. 
They should be encouraged to choose as many 
or as few of the group members as they wish. : 

(3) The subjects should be asked to indi- 


cate the individuals they choose or reject in 
terms of specific criteria. Each sociometric 
choice or rejection should be made with a par- 
ticular activity in mind, and the activity should 
be meaningful to the subjects. 

(4) Results of the sociometric questions 
should be used to restructure the group. More- 
over, the subjects should be told that their 
choices and rejections will play a decisive role 
in determining with whom they will associate 
in this activity. 

(5) The subjects should be permitted to 
make their choices and rejections privately, 
without other members of the group being able 
to identify the responses. 

(6) The questions used should be gauged 
to the level of understanding of the members 
of the group. With older children and adults, 
the questionnaire form of presentation is usu- 
ally appropriate. With younger children who 
are not yet facile with writing, individual re- 
ports to the administrator of the sociometric 
test may be most appropriate. With the very 
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young and others who do not have adequate 
control of language, responses to a sociometric 
question may be meaningless, although other 
instruments may function effectively. 

Except in cases where special empirical in- 
terests prohibit it, members of the group should 
have been in association for a sufficient time to 
permit the formation of affective ties and repul- 
sions within the group. If stability of choice 
and rejection is important, it is evident that 
this cannot be expected until the members of 
the group have shared experiences over a period 
of time, Again, if it is desired that the members 
of the group should have had an opportunity 
to interact with all other members, the group 
should be small enough to permit this. Lastly, 
the group should be so constituted that it will 
be possible to restructure the group in the light 
of the sociometric findings. 


VARIATIONS IN PROCEDURE 


The requirements outlined above identify 
the sociometric measure in a more or less pure 
form, and are generally in agreement with Mo- 
reno's definition. However, relatively few 
studies in this area, perhaps not more than 
a quarter, meet all these requirements, For 
other studies, especially for those that do not 
permit restructuring of the group or do not 
ask for choice in terms of specific criteria, the 
terms “near-sociometric” and "quasisociomet- 
ric" have been used. We do not maintain this 
distinction here, although we emphasize the 
importance of reporting in detail all the rele- 
vant conditions of any study. There are many 
circumstances where modified sociometric tech- 
niques are the only ones that can be used, and 
others where modifications lead to more mean- 
ingful results. 

One of the most frequent modifications is in 
the limitation upon the number of choices the 
individual may make or, more accurately, the 
specification of the number of choices the in- 
dividual is required to make. This alteration 
has certain obvious disadvantages in that it 
reduces the range of interpersonal reactions 
that the measure assesses. Eng and French 
(1948) found that when they used a ranking 
derived from a paired comparison of each mem- 
ber in the group as a criterion, unlimited 
choices showed a closer correlation to this cri- 
terion than either three or five choices. In 
certain circumstances it is of considerable inter- 
est to examine the individual variability in 
number of choices the subjects make when there 
is no limit provided. Further, if we wish to dis- 
tinguish between the unchosen person and the 
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isolate, the isolate being the person who neither 
chooses nor is chosen, we need an unspecified 
number of choices. In spite of these advantages 
adhering to the unlimited choice, there are 
sufficient disadvantages in terms of subjects’ 
time required, rapport, and the eventual sta- 
tistical analysis of the data, so that under many 
circumstances it is desirable to specify the num- 
ber of choices to be made. When choices are 
limited or required it is roughly conventional 
to specify three. 

Frequently the technique is used without the 
promise of restructuring the group. However, 
even overlooking the social obligations of the 
investigator, it would appear wise to restructure 
the group, as this will increase the sensitivity or 
care with which subjects report choices and 
rejections. On the other hand, the impossibility 
of fulfilling such a requirement does not mean 
the data will have no value. In many cases, be- 
cause of the nature of the group or the nature of 
the criterion, it is impossible to introduce such 
changes. Subjects frequently give careful and 
accurate judgments in the absence of immedi- 
ate and personal benefit. Thus, we may view 
restructuring the group as desirable because 
of the added realism and motivation it provides 
for the subject but, at the same time, it is only 
one of many factors that determine the relation 
between subject and research worker and it 
should not be considered an indispensable re- 
quirement, 

The customary precondition that the groups 
be small enough to permit face-to-face inter- 
action is often altered to the needs of the study. 
Moreno’s original study (1934) was of a closed 
community, but it should be noted that it was 
large in size. ‘The more general use of the meas- 
ures has been in smaller closed groups. How- 
ever, there has been increasing need for and 
actual application of the techniques in large 
groups and open communities (Lundberg and 
Steele, 1938; I. A, Stewart, 1948; F. A. Stewart, 
1946, 1947a,b) . In these the problems of statis- 
tical analysis are greatly increased (Bassett, 
1948; F. A. Stewart, 1948) , but the fundamental 


"application remains the same. Subjects are still 


asked to select individuals they would like to 
have near them or participate with in particular 
activities, although in this case it is not known 
what the group is from which the subject actu- 
ally chooses—that is, we do not know exactly 
what the population of experienced individuals 
is for the subject. Similarly, the customary pre- 
condition that the groups have existed over a 
period of time is often altered. Barker's (1942) 
investigation of “first impressions" is an ex 
ample of a study where it was necessary to ad- 
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minister the sociometric techniques before any 
prolonged interaction. 


SOME RELATED DEVICES 


There are an appreciable number of tech- 
niques that fail to meet the definition pro- 
posed above but still remain sufficiently similar 
to the sociometric test to deserve consideration. 

Sociometric self-rating or relational analysis. 
This technique is an extension of the sociomet- 
ric test and is used in conjunction with it. It 
involves the use of the sociometric self-rating or 
prediction of sociometric choice, e.g., "Who do 
you think will choose you?” This method was 
first mentioned by Moreno (1942) in connec- 
tion with the training of subjects’ perception 
of others through repeated sociometric trials. 
Maucorps (1949) used a restricted form of this 
technique in a study of French Army officers, 
as did Lundberg and Dickson (1952) in their 
study of interethnic relations in a high school 
population. These investigators limited the pre- 
dictions to those individuals whom the subject 
had himself chosen. A similar application was 
made by Williams and Leavitt (1947) in a 
study of Marine Corps officer candidates. Tagi- 
uri (1951, 1952) has elaborated and developed 
this approach under the title of “relational 
analysis," and has indicated its usefulness in 
the study of social perception. There seems 
little doubt that the possibility of measuring 
the accuracy with which the individual per- 
ceives the choices and rejections that he secures 
from his associates provides an important ad- 
ditional dimension that may be expected to 
have significant psychological correlates. 

Group participation scale. Pepinsky, Siegel, 
and Vanatta (1952) used an approximation of 
the Thurstone Method of Equal Intervals in an 
attempt to develop a sociometric index that 
would measure “effectiveness of participation 
in group activity.” One hundred and seventy 
sociometric items were submitted to an under- 
graduate population for judging, and subse- 
quently a reduced list of 40 items was submitted 
to a group of professionally trained psycholo- 
gists. At the end of this process it was possible 
to select a group of 24 items that represented 
each of eight scale positions with three items 
all of which showed a reasonably small amount 
of variation among judges. In summarizing the 
results of their study they state: “The Group, 
Participation Scale, used as a sociometric test 
in four fraternity groups, yielded (1) almost 
the entire range of scores as defined operation- 
ally by our scale, (2) a more normalized dis- 
tribution of scores than seems to be provided by 
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most sociometric tests, and (3) a relatively high 
agreement . . . of group members in choosing 
given individuals” (pp. 418-419). 

Group preference record. This variation of 
the sociometric technique requires that mem- 
bers of the group respond in terms of like, dis- 
like, or indifference to every member of the 
group (Newstetter, Feldstein, and Newcomb, 
1938; Zeleny, 1939, 19402). Thus, it differs 
from the conventional sociometric test pri- 
marily in that it forces an explicit evaluation 
of all group members by the subject. It has 
frequently been used in combination with some 
system of weighting choices. 

Multirelational sociometric survey. An exten- 
sion of the customary sociometric test by sys- 
tematically combining it with the collection of 
additional data has been suggested in several 
recent publications (Weschler, Tannenbaum, 
and Talbot, 1952; Massarik, Tannenbaum, Ka- 
hane, and Weschler, 1953). In this adaptation 
two classes of questions (criteria) are utilized, 
i.e., organizational "goal directed" and “non 
goal directed.” For each goal-directed activity 
five relations are investigated: (l) the pre- 
scribed, defined by leaders of the organization 
or the table of organization; (2) the perceived, 
defined by the subjects' reports of the pre- 
scribed relations; (3) the actual relations which 
the subjects report; (4) the desired, relations 
normally ascertained by a positive sociometric 
test; and (5) the rejected, the relations nor- 
mally ascertained by a negative sociometric 
test. With these five sets of data, expandable 
by the number of criteria used, it is possible 
to establish a large number of meaningful in- 
dices for the classification of relational patterns 
in an organization. 

Estimates of time, An alternative procedure 
to the sociometric test was suggested by Moreno, 
Jennings, and Sargent (1940) in the use of esti- 
mates of time as a means of measuring the 
intensity of choice. In this procedure the inves- 
tigator asks the members of the group to esti- 
mate the proportion of time, given a finite limit, 
they would like to spend interacting with the 
other members of the group in a given activity. 
This technique has been further adapted by 
other researchers (Stogdill and Shartle, 1948; 
Stogdill, 1949) to require that the subjects esti- 
mate the actual amount of time spent in contact 
with various group members. A further modi- 
fication involves securing information about 
actual visiting patterns in a community setting 
(Loomis and Davidson, 1939a,b) . 

The Moreno and Jennings modification re- 
mains closely related to the conventional tech- 
nique, as the subjects are still requested to make 
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a choice at the level of wish or desire. The use 
of time is primarily to permit the weighting of 
choices made. The other modifications, how- 
ever, take the response out of the level of wish 
and ask for a factual report of interaction. 
These are thus perhaps closer to customary 
observational techniques than to a sociometric 
test. 

Guess who technique. Another technique re- 
lated to the sociometric test is the use of identi- 
fication or "guess who" questions. Actually the 
origins of this method are as early as those of 
the sociometric technique, since they go back at 
least to the character studies of Hartshorne, 
May, and Maller (1929). This technique in- 
volves presenting the subjects with various be- 
havior descriptions and asking them to "guess 
who" among the members of their group this 
description best fits. By varying the descriptions 
along various kinds of positive and negative 
continua it is possible to get, in terms of the 
frequency with which they are selected for the 
various descriptions, an ordering of the sub- 
jects that bears rough correspondence to socio- 
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metric data. In many cases this kind of data has 
been used to supplement conventional socio- 
metric techniques and provide a picture of the 
reputations or impressions created by the sub- 
jects on other members of the group (i.e., the 
manifest personality). This technique may pos- 
sess some advantage over the sociometric test in 
that it is slightly more indirect and the intent 
of the investigator is less apparent to the sub- 
jects. This would be an advantage only where 
there would be reason to expect that subjects 
would attempt to interfere with or disrupt the 
sociometrists' intent. Moreover, the "guess who" 
device is largely limited to "choice" or "reputa- 
tion" scores and does not provide much 
information about patterns of interaction or 
communication. 

As we have said, sociometric techniques 
merge into general methods of systematic ob- 
servation. No attempt will be made to discuss 
observational techniques here, as the reader 
may refer to Chapter 10 for a complete account 
of such methods. 


ANALYSIS AND REPRESENTATION OF SocioMETRIC DATA 


The general increase in quantitative sophisti- 
cation of social science investigators is nowhere 
better reflected than in the use of sociometric 
measures. Procedures and statistical techniques 
today are strikingly differentiated from those 
in common use some twenty years ago. During 
this period there has been a transition from 
descriptive, nonstatistical procedures of analy- 
sis to a state where virtually all investigators 
concern themselves with quantifying their data 
and most make some attempt at statistical analy- 
sis. Our discussion will consider techniques for 
the analysis and representation of data under 
the following headings: graphic methods, simple 
quantitative methods, statistical methods, ma- 
trix methods, and fractionation of groups. 


GRAPHIC METHODS 


In addition to introducing and popularizing 
the sociometric technique as a means of collect- 
ing valuable empirical data, Moreno also pre- 
sented a method for summarizing the results 
of this technique that has proven most influen- 
tial This method, the sociogram, was first 
described in Who Shall Survive? (1934) and 
during the following decade or more served as 
the primary method of presenting sociometric 
data. d 

The sociogram is a diagrammatic device for 
summarizing the choices and rejections among 
members of a group. It employs geometric fig- 


ures to represent members of the group and 
various kinds of lines joining the figures to 
represent choices and rejections. At this point 
there is no single convention for the drawing 
of diagrams but, rather, there are many alterna- 
tives available to the investigator. 

As a consequence of continued use of socio- 
grams in a variety of different settings, a num- 
ber of patterns of relationship that are fre- 
quently encountered have been described. The 
ultimate number of such patterns is virtually 
limitless. Considerable attention is paid to 
types of sociometric patterns by Moreno (1934) . 
Here we shall consider briefly a few of the pat- 
terns of psychological interest that are com- 
monly seen. 

The overchosen is the member of the group 
who receives a disproportionately large number 
of choices from other members of the group. 
Moreno (1934) has used the term “star” to refer 
to all varieties of popularity and leadership, 
whether or not they involve a large number of 
individuals centering their choices around the 
star. As the iconic qualities of the term seem 
to imply one figure surrounded by a number of 
admirers (Chapin, 1950; Moreno, 1950a), it is 


“perhaps best to reserve it for that specialized 


form of popularity or personal choice where 
one individual is chosen by a number of others 
who do not direct a large number of choices 
among themselves. The term overchosen seems 
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appropriate as a general term to refer to the 
individual who receives a large number of 
choices no matter what the pattern is in which 
they occur, Overchosen persons may be leaders 
in the group, but this is not necessarily the case, 
as has been indicated frequently (Chapin, 
1950, 1951; Criswell and Jennings, 1951; Jen- 
nings, 1947, 1950; Moreno, 1934, 1950). It is 
evident that an individual who is not over- 
chosen may occupy a position of leadership by 
virtue of receiving certain key choices. A term 
such as power figure may be appropriate for 
referring to such a person. 

The isolate is the individual who receives no 
choices and makes no choices (or receives no 
choices and rejections and makes none). The 
isolated person is essentially not in the group; 
other members are either indifferent to him or 
do not respond to him, and he does not display 
attraction to or rejection of the other members. 
In the literature the unchosen and the under- 
chosen have often been called isolates. 

The term rejected has ordinarily been re- 
served for persons who receive a large number 
of rejections. 

The mutual pair has received considerable 
attention, and the number of mutual pairs in 
the group has ben used as the basis of several 
sociometric indices. The term triangle is usu- 
ally reserved for the case where three individ- 
uals are connected by mutual choices, Larger 
units of persons connected by mutual choices 
are usually called chains, and closed chains or 
rings are possible, as well as many other varia- 
tions, according to the bonds involved. 

Since the variety of patterns which occurs is 
so great, the above terms have frequently been 
applied without precision. It is obvious that 
the researcher should specify the exact meaning 
that he gives the terms he uses in his descrip- 
tions and analyses. 

The sociogram has received criticism from a 
number of quarters because there have been 
many examples of “unreadable” sociograms. In 
many cases the fault lies in poor drawings, but 
more often the trouble has been that the per- 
sons drawing the sociograms have not had as 
their purpose the analysis of data, but rather 
the presentation of a persuasive visual device. 
Several investigators have made attempts to 
standardize the drawing of sociograms and 
thereby remove a part of their subjectivity and 
unreliability, as well as making the diagrams 
more readable. Some of these efforts have at-* 
tempted to introduce elements of quantifica- 
tion. 

Proctor and Loomis (1951) have suggested 
a set of conventions to be applied in determin- 


ing the distance separating individuals in socio- 
gram positions. Essentially, they develop an 
index of social distance based upon the recip- 
rocal relation existing between each pair of 
individuals in the group. They then propose 
that each point along this continum of social 
distance can be equated to some arbitrarily 
designated distance on the sociogram. These 
investigators assume that if x chooses y he is 
closer to y than he is to other individuals whom 
he did not choose. Further, if x rejects y he is 
further away from y than other individuals 
whom he did not reject. Finally, the combina- 
tion of mutual choice or mutual rejection is 
viewed as respectively closer and further away 
than the single choice or rejection. If we over- 
look the symmetrical duplications that are pos- 
sible, e.g, x choosing y and y rejecting x, as 
opposed to x rejecting y and y choosing x, there 
are only six different kinds of interpersonal 
relationship. It is possible to overlook the sym- 
metrical variations, as each variation implies 
the same distance or closeness. The types of 
interpersonal relations are: 

Type A: i chooses j, and j chooses i 

Type B: i chooses j, and j ignores i 

Type C: i chooses j, and j rejects i 

Type D: i ignores j, and j ignores i 

Type E: i ignores j, and j rejects i 

Type F: i rejects j, and j rejects i 


Making the further assumptions that types 
D and C are equidistant, Proctor and Loomis 
propose that it is now possible to array indi- 
vidual pairs on a continuum from close to dis- 
tant, or choice to rejection, with type A repre- 
senting the closest relationship and type F the 
most distant or aversive. Actually, if one ob- 
jects to the assumption that a choice and rejec- 
tion are equivalent to two "ignores," there is no 
reason why these two types could not be made 
separate points on the distance continuum, per- 
haps with the choice and rejection being placed 
closer than the mutual “ignorers.” The major 
intent of a sociogram drawn in this way is to 
cluster closely related persons and thus to make 
it easy to identify cliques. There is some danger, 
however, that this forced concentration may 
actually mask networks of communication, espe- 
cially if the group is large and there is a high 
incidence of choices. 

This procedure provides the investigator with 
a definite goal toward which he may aim, even 
though it will not always be attainable. At the 
same time it introduces a certain standardiza- 
tion into the practices of different individuals 
constructing sociograms. It is important to real- 
ize that the larger the group the less likely it 
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is that the conventions of this system can be 
satisfied. When a subject is placed in relation 
to a large number of individuals who are simul- 
taneously in relation to other individuals, and 
independent demands are made as to the dis- 
tance separating all these positions (persons) , 
these demands will frequently not be satisfiable 
on a single plane surface. 

Another attempt at standardization was made 
by Northway (1940) when she introduced the 
"target sociogram." In this diagram a series of 
concentric circles are used to designate areas 
in which individuals are to be placed according 
to their choice status. Usually, the field is 
divided by quartiles, and the center of the target 
is the location of the individuals in the quarter 
receiving most choices, the three bands around 
the center receiving the persons in the respec- 
tive quarters. Thus, the target sociogram identi- 
fies the relations among the group members in 
the usual sense, and further identifies the choice 
status of the individual at the same time. In 
this form the target diagram may represent only 
a slight improvement over the ordinary socio- 
gram in identifying individuals who are highly 
chosen, since in the sociogram such individuals 
may be identified by their spoked-wheel appear- 
ance on the diagram. However, if it is impor- 
tant to locate all highly chosen individuals at 
one time, this form of the target diagram is 
quicker to read. Northway and Quarrington 
(1946) further suggested the use of sectors to 
separate members of the group in the target 
sociogram by characteristics such as sex or ethnic 
background, in order to facilitate examining the 
data for cleavages. Bronfenbrenner (1944b) 
proposed the possibility of using the target socio- 
gram with the concentric areas designated ac- 
cording to the probability with which the 
number of choices received exceed or are less 
than the expected number. 

The Northway target diagram may be viewed 
as useful in indicating highly chosen and rela- 
tively unchosen persons, and quantification is 
relatively explicit, but this presentation may 
not be intrinsically superior to other varieties 
of the sociogram, e.g., one which uses identifi- 
cation symbols of different sizes to designate the 
relative number of choices received by the per- 
son. An example of the latter is seen in the 
sociograms used by Powell (1951). Further, 
the use of sectors in a target diagram to desig- 
nate sex or ethnic background distinctions can 
be duplicated in ordinary sociograms, e.g., 
Loomis, Beegle, and Longmore (1947) . 

Borgatta (1951) suggested that the primary 
principle in sociogramming should be that of 
minimizing the number of crossed lines. Socio- 
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gramming according to this procedure does not 
restrict the number of persons that can reason- 
ably be handled as severely as some techniques. 
It has the further advantage that the minimum 
number of crosses may be achieved only in a 
limited number of ways, and the alternate socio- 
grams are equally readable. 

The kind of sociogram used is limited only 
by the investigator's needs and imagination. 
There are many alternative methods of presen- 
tation of sociometric data, and these may even 
include three-dimensional sociograms (Chapin, 
1950; Weschler et al., 1952). The sociogram 
does not restrict the investigator to one vari- 
able, as multiple variable sociograms may be 
handled by using lines of different colors, over- 
lays, or other devices. 

4n appraisal of the sociogram must begin 
with a heavy credit to the method for its dra- 
matic and compelling qualities. The possibility 
of summarizing group relations in terms of a 
series of graphic conventions that appear di- 
rectly understandable and interesting to most 
audiences is an important asset. Particularly in 
circumstances where the information is being 
presented in an "action" context, the possibility 
of easily engaging an audience without the ne- 
cessity of presenting technical information or 
complex explanations may be of crucial impor- 
tance. In many industrial and military applica- 
tions, as well as in some educational settings, 
the "glamour" of the sociogram makes it almost 
a prerequisite no matter what additional tech- 
niques may be used in analyzing and presenting 
sociometric results. For small groups the diffi- 
culty of constructing sociograms is slight com- 
pared with the labor involved in some of the 
other analytic techniques to be discussed. A 
final advantage of the sociogram is that it 
presents results in context. The relations exist- 
ing in the entire group are presented at once 
rather than being abstracted into a number of 
indices or scores. 

In spite of these desirable qualities, there are 
negative attributes that inhere to the customary 
use of the sociogram. The representativeness 
of the sociogram has depended almost com- 
pletely upon the vigilance and sensitivity of the 
investigator, with few or no external checks 
upon the adequacy of the diagram he presents. 
Thus, depending upon the particular spatial 
conventions or groupings the investigator uses, 
he can create different impressions of the group 


‘being examined, This absence of external or 


formal checks upon the procedure has led gen- 
erally to relatively low inter-judge agreement 
using this technique. The suggestions of 
Northway, Proctor and Loomis, and Borgatta 
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can be seen as important steps in the direction 
of reducing or eliminating this particular short- 
coming. Where the group is very large, espe- 
cially if unlimited choices and rejections are 
permitted, construction of a sociogram may not 
be feasible. 


SIMPLE QUANTITATIVE METHODS 


Simultaneously with the development of 
graphic techniques, there have been important 
developments in the use of quantitative tech- 
niques for analyzing and summarizing socio- 
metric data. We shall begin our discussion of 
these developments by considering tabular 
methods, scores or indices, and profiles. In later 
sections we shall consider statistical and matrix 
methods, 

Tabular methods. Sociometric data are sum- 
marized completely in the sociogram, but an 
equally complete and natural way of summa- 
rization is the N X N table or matrix, in which 
N refers to the number of subjects. The sub- 
jects are placed along the left margin of the 
table in some arbitrary order, and then the 
order is reproduced along the top margin of the 
table. The responses of a given subject to the 
remaining N — 1 other subjects may then be 
placed in the proper cells. Ordinarily the main 
diagonal of the table or matrix is empty, since 
subjects are not asked to respond to themselves. 
This technique makes the summarization of 
the number of choices and rejections made and 
received by each subject simply the task of 
adding across a column or down a row. It serves 
as the basis for summarizing data into simple 
scores, and then by various operations into par- 
ticular indices, and also provides the matrix of 
raw data for the various matrix analyses. 

Scores and derived indices. Quite early in the 
use of sociometric measures some investigators, 
e.g., Moreno (1934), chose to summarize what 
appeared to be important aspects of the choices 
and rejections within a group in terms of simple 
scores or ratios. These methods have frequently 
been taken to task as oversimplifying the com- 
plexity of the group relationships. In cases 
where the score has been reasonably derived so 
as to get at an aspect of the choice and rejection 
pattern of general interest, this approach has 
led to fruitful results. Thus, although the indi- 
viduals who hold to an organismic or holistic 
view of the world may mourn the loss of the, 
complex interrelations that are to some extent 
present in the sociogram, or the N X N table, 
the use of indices and their derivatives has be- 
come increasingly popular. d 

There are many ways in which scores and in- 
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dices can be categorized. Some deal with or 
represent the individual, while others represent 
the group as a whole. Still others represent the 
individual in interaction with other members 
of the group. Our intention here is not to pre- 
sent a systematic summarization of indices, but 
rather to illustrate the breadth and variety of 
indices that have been used by different investi- 
gators. A 
Moreno’s Who Shall Survive? includes the 
explicit formulation of a number of indices and 
implicitly assumes a larger number. The identi- 
fication of the star, the isolate, and the rejected 
essentially provides the basis for a large part 
of the individual indices later developed. Mo- 
reno also proposed a number of group indices: 
emotional expansiveness refers to the amount of 
utilization of available choices. This index can 
be used only where the number of choices is not 
required, i.e., the subjects are permitted to make 
unlimited choices or as many as they wish up 
to a certain limit. Although the index is cus- 
tomarily used by Moreno in the description of 
groups, he also uses the concept in the individ- 
ual sense. The ratio of attraction is equal to 
the number of attractions divided by the total 
number of attractions and rejections, converted 
to a percentage. The ratio of repulsions is simi- 
larly computed. Moreno set the ground work 
for an index of "cleavage" in a group in his 
discussion of restriction of spheres of choice, 
and more directly in the use of the ratio of in- . 
terest for home group and for outside groups, 
which are computed on the basis of available 
choices utilized. "The index of relative popu- 
larity (of a group) refers to the proportion 
actually received of the possible choices the 
group could receive from the external groups. 
The number of choices received serves as the 
basis for a number of indices. Individuals may 
be classified arbitrarily, e.g., they may be strati- 
fied by quartiles, deciles, or other dividing 
points. An example of this is Jennings’ (1950) 
stratification of persons in terms of dividing 
points established on the base of the mean num- 
ber of choices received. She identifies one 
standard deviation above the mean number of 
choices received as overchosen, and those one 
standard deviation below the mean as under- 
chosen. Arbitrary classification (or fractioniza- 
tion) of this type extends the utility of the data 
in allowing the possibility of comparison to or 
combination with data from additional groups, 
since the size of the group is considered in the 
classification. However, it should be noted that 
being one standard deviation above the mean 
in one group may have different meaning than 
being one standard deviation above the mean 
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in another group. Still another classification is 
presented by Proctor and Loomis (1951) in 
their definition of choice status as the number 
of choices received divided by one less than the 
number of persons in the group, with a similar 
definition for rejection status. An important so- 
cial status index has recently been reported by 
Katz (1953). This measure employs vector con- 
cepts in order to take into consideration not 
only the number of choices but also the social 
position of those who choose. 

The sum of acceptances plus the sum of rejec- 
tions, divided by one less than the group size, 
is an index which McKinney (1948) identifies 
as social status index. Smucker (1949) identifies 
this index as social intensity. Choice rejection 
status is identified by Proctor and Loomis 
(1951) as the number of choices less the num- 
ber of rejections received. Such an index, how- 
ever, may assume negative values, and should 
be adjusted where necessary by addition of a 
constant factor if only positive values are 
desired. 

The number of mutual choices in a group 
serves as the basis of several indices. McKinney 
(1948) identifies a compatibility index as the 
number of mutual choices divided by one less 
than the group size. Proctor and Loomis (1951) 
identify group cohesion as the number of mu- 
tual pairs divided by the possible number of 
mutual pairs in the group (as restricted by in- 
structions for choosing). The same authors 
(1951) identify group integration as one di- 
vided by the number of persons receiving no 
choices. 

The number of indices introduced to handle 
sociometric data is so vast that it is nearly im- 
possible to enumerate them all and list the 
various dimensions with which they have been 
identified, Proctor and Loomis (1951) present 
an outline of indices that may serve as a begin- 
ning point for persons who wish to examine a 
range of the defined indices in sociometry. In 
addition to those we have already mentioned, 
Proctor and Loomis review such indices as 
in-group preference, in-group cleavage, in-group 
cohesion, and in-group climate. 

As the reader can see, the sociometric investi- 
gator faces many possibilities for devising in- 
dices. We have already indicated that persons 
may be ranked within a group on the basis of 
the raw number of choices made and received, 
rejections made and received, and combinations 
of these. Such indices are of immediate use 
where the sociometric status of individuals in 
a given group is to be examined in relation to 
other criteria. The problem of comparability 
of persons selected from two or more groups 
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may be handled in several ways. Assuming that 
choices and rejections have been made under 
the same directions, persons might be ordered 
by their relative positions in the group, as with 
Northway's quartiles or Jennings’ standard 
deviation classifications. In such cases the in- 
vestigator should be aware that groups vary in 
the distribution (and range) of choices made, 
and if he is willing to make certain assumptions, 
he is able to transform his data so as to increase 
its utility. An index based on the number of 
persons in the group will not necessarily be 
much of an improvement over the use of raw 
scores, since this merely divides all cases by a 
constant, and will not increase the comparabil- 
ity of persons in groups that differ greatly in 
size. Rather, in cases where persons are selected 
from a group of say 20 persons, and others from 
a group of say 200, comparability on the basis 
of such an index would probably be more ques- 
tionable than the use of raw scores as an index. 
Essentially, the problem of what type of index 
is proper in the situation is defined by the use 
to which the index is to be put, and further, 
the types of assumptions the investigator is will- 
ing to make. 

We have not yet discussed the problem of in- 
tensity of responses, that is, the differential 
meaning of first, second, third, etc., choices. 
Some investigators (Zeleny, 1939, 1947b) make 
extensive use of ranked choices in their analyses 
and incorporate them into their indices. This 
simply requires an adjustment in the computa- 
tion of the index, and researchers may adapt 
indices in this way without serious difficulty. 
However, very little work has been done at a 
theoretical level on the meaning of intensity in 
sociometric choices (Foa, 1950) or the value 
of assigning arbitrary weights. The researcher 
should consequently be cautious in weighting 
choices, especially if the choices are required. 
We shall discuss later Moreno’s (1934) report 
on the greater stability of first as against sec 
ond choices which might lend support to the 
weighting of choices, but since weighting is or- 
dinarily arbitrary, there is no evidence that 
equal weights (ignoring choice order) would 
not be as good as assigning weights by any 
other arbitrary technique. 

Similarly, we have not dealt with indices 
derived from choices made on several criteria. 
Converting these data to customary indices is 
a relatively simple matter, but as we shall indi- 


‘cate in our discussion of reliability, the investi- 


gator should be aware of the assumptions 
implicit in his operations. One method of deal- 
ing with responses to several criteria is through 
the use of profiles, 
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Profiles, The investigator may wish to ex- 
amine the choice status of the individual in 
terms of a number of indices or criteria at the 
same time. In this case, some unit of standardi- 
zation is necessary among the scores. Jennings 
(1948, 1950) has dealt with profiles of individ- 
uals based on (1) choices made, (2) choices 
received, (3) positive reciprocations, (4) rejec- 
tions made, (5) rejections received, and (6) 
negative reciprocations. The first three are 
given as the choice pattern of the individual, the 
last three as the rejection pattern, and the six 
together as the choice-and-rejection pattern. 
The pattern is identified by the ordered series 
of six pluses or minuses, a plus signifying that 
the person is above the mean in the particular 
item and a minus signifying that he is below 
the mean. Such presentation allows the re- 
searcher to examine the distribution of patterns 
and, essentially, the relationship between per- 
formances on the individual items, by examin- 
ing the distribution of types. Profiles are 
reported elsewhere (McKinney, 1948, Smucker, 
1949) in various contexts, but the contribu- 
tion here is in standardization on the same cri- 
terion to facilitate examination of types. 
Bjerstedt (1952) extends the work of Jennings 
and provides a systematic method of classifica- 
tion which is easily extended to at least two 
forms of sociographing. Bjerstedt’s innovation 
is especially interesting in the amount of infor- 
mation which is made available for immediate 
reading, as well as the fact that sociographing 
according to his specifications is reproducible 
by independent investigators. 


STATISTICAL METHODS 


Statistical techniques are widely used in the 
analyses of sociometric data both to test the sig- 
nificance of observed findings and also to pro- 
vide derived scores or indices. Several investi- 
gators who have used quantitative methods 
have developed scores with the intent of dis- 
tinguishing between differences or deviations 
that should properly be assigned to chance fac- 
tors as opposed to those that represent actual, 
replicable deviations from the group norms. 

Bronfenbrenner (1943, 1944a) developed the 
notion of the chance model, introducing the 
Pearson Type III Curve as the most adequate 
approximation to the binomial expansion for 


sociometric data, and elaborated on the concept. 


of statistical significance as applied to socio- 
metric data. Distribution of indices on a chance 
basis, given a level of significance, provides the 
upper and lower limits of variation beyond 
which phenomena may be judged to be statis- 
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tically different from chance. Thus, for ex- 
ample, given any distribution of choices 
received in actual data, using the limits estab- 
lished by the chance model, persons may be 
identified who receive significantly more or less 
choices than would be expected by chance. 
Bronfenbrenner's approach is intended to have 
the advantage that it provides a constant frame 
of reference for comparison or pooling of data; 
that is, for example, persons who receive sig- 
nificantly more choices than would be expected 
by chance may be recruited from several dif- 
ferent groups and pooled for comparison with 
a similarly selected group of underchosen per- 
sons. Or, persons who receive significantly more 
choices than would be expected by chance in 
one type of group may be compared with simi- 
larly selected persons in another type of group. 

The usefulness of a probability model is un- 
questioned, although there are many discussions 
in sociometric literature of misapplications or 
the adoption of incorrect (or incomplete) as- 
sumptions (Criswell, 1947; Proctor and Loomis, 
1951). One example of an inappropriate as- 
sumption is Bronfenbrenner's (1943, 1944a) 
suggestion that the distribution of mutual 
choices will be according to the binomial ex- 
pansion. Criswell (1946, 1950), Edwards 
(1948) , and Loomis and Pepinsky (1948) have 
all indicated the theoretical inadequacy of this 
formulation. Criswell's (1939) assumption that 
chi-square values may be ordered rather than 
treated as significant or not significant has 
been objected to by Loomis and Pepinsky 
(1948) . 

One special consideration in the use of Bron- 
fenbrenner's method should be noted. Separat- 
ing persons as significantly overchosen and sig- 
nificantly underchosen provides an aura of re- 
spectability in the handling of the data, but 
such a procedure may be unrealistic. Such 
classes are frequently very small and require a 
large number of samples, or large samples, to 
permit further statistical comparisons. On the 
other hand, there is good evidence that the sta- 
bility of a person's choice status is high, so that 
ordering of persons on choice status will have 
reasonably constant meaning. In using only 
classes of persons who are significantly above or 
below chance expectation, the researcher ap- 
pears to make the assumption that those who do 
not satisfy this condition of statistical signifi- 
cance receive their choice position on a basis 
of chance. This assumption does not seem nec- 
essary and results in the loss of large amounts 
of data. Thus, unless the investigator is genu- 
inely interested in the extremes of the distribu- 
tion, there seems good reason for him to use 
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in comparisons subjects who are selected arbi- 
trarily or in terms of some rational criterion 
rather than only those who deviate from the 
mean at some particular level of significance. 
Criswell (1950) has posed this criticism in a 
slightly different form in her detailed analysis 
of the constant frame of reference problem. 

In contrast to Bronfenbrenner's method, 
which uses probability values as scores, Cris- 
well's scoring procedures attempt to eliminate 
the influence of chance distributions. The first 
scoring method suggested by her (1939b) was 
for the measurement of race preference ex- 
pressed by a group as a whole. She devised a 
ratio in which a given racial group's preference 
for itself over another racial group was ex- 
pressed as: 


Group self-preference = peel 


in which § — number of choices given by a 
group to itself, St — number of choices it would 
give itself by chance, O — the number of choices 
the group gives to the other racial group in the 
experimental population, O! — the number of 
choices it would give by chance to the other 
racial group. Another method of probability 
scoring devised by Criswell (1946, 1947) defines 
group coherence as the obtained ratio of recip- 
rocated to unreciprocated choice divided by the 
corresponding chance distribution. The for- 
mula is then: 


Group coherence — 


Number of reciprocated choices X q 
Number of unreciprocated choices X p ' 


where p/q is the chance ratio of reciprocated to 
unreciprocated choices. 

Statistical methods have also been used to ex- 
amine the likelihood that a given finding is the 
result of chance factors. The first use of the 
chance model with sociometric data was made 
by Moreno and Jennings (1938) when they set 
up an empirical random selection procedure 
and generated sociometric groups. They then 
compared these groups in terms of the distribu- 
tion of choices with a probability model (pro- 
vided by Lazarsfeld) based on the binomial 
expansion and found no apparent differences. 
The chance model was then compared with 
actual data by means of the chi-square test, and 
significant differences between the obtained 
and chance distributions were immediately ap- 
parent, In the actual data, unchosen and 
highly chosen persons occurred more fre- 
quently, and mutual pairs also occurred more 
frequently than in the chance distribution. 


Several attempts have been made to apply 
statistical methods in testing differential choice 
patterns within a group. The first of these was 
suggested by Criswell (1939), who used a chi- 
square test in which: 


(Self-preference expressed — 
i z 2 
Chisquare dees self-preference) A 
Theoretical self-preference 
A critical discussion and extension of this 
approach is provided by Loomis and Pepinsky 
(1948). Seeman (1946) presents an alternate 
procedure in his study of intragroup Negro atti- 
tudes, in which he uses an analysis of variance 
design. It should be noted that Bronfenbren- 
ner's (1943, 1944a) approach, using the Pearson 
"Type III curve, may also be employed here. 


MATRIX APPROACHES 


In recent years there has been a rapidly ac- 
celerating interest in the analysis of sociometric 
data through methods of manipulating the 
N X N table on which sociometric responses 
are summarized. 'These methods of matrix 
analysis are still in the process of development, 
but for certain kinds of problems they show 
marked advantages over other methods. In par- 
ticular, they are explicit and repeatable, and 
lend themselves to statistical analysis; some of 
them deal with large amounts of data relatively 
efficiently, and certain graphic features are 
retained. 

The first of the matrix approaches to socio- 
metric data was developed by Forsyth and Katz 
(1946). In their method, the N X N table is 
taken as the raw matrix. The original N X N 
matrix is then changed through moving pairs 
of rows and columns (the order of columns 
must always remain identical to that of the 
rows) to reveal subgroups within the structure, 
stars, isolates, and other characteristic forms. 
The ordering is one of trial and error, in large 
part, with the main goal of forcing mutual 
choices to gravitate towards each other or, more 
specifically, to concentrate choices along the 
main diagonal. Presumably, if rejections are 
included in the data they will gravitate away 
from the main diagonal as the choices gravitate 
towards it, The recommended procedure is to 
pick two persons with mutual choices and place 
them adjacent to each other in the matrix. 
Then select another person who has mutual 


“choices with both, if such a person exists, or 


who is related by choice with one or both of 
the original two. Rearrange rows and columns 
as is necessary to maximize the concentration. 
When no persons can be added to this nucleus 
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of individuals, consider this a subgroup and 
begin the rearrangement of the remaining indi- 
viduals. Continue the procedure until all per- 
sons are related to a subgroup, or until only 
isolated or rejected individuals remain. Sub- 
groups should be rearranged so that the maxi- 
mum concentration of rejections (if any) is 
located in the corners of the matrix away from 
the main diagonal. Individuals who choose 
into a subgroup but are rejected or ignored 
are placed on the upper right boundary of the 
relevant subgroup, and persons who are rela- 
tively underchosen but apparently related to 
the subgroup may be placed at the lower left 
boundary of the subgroup; where both condi- 
tions are apparent, the second takes precedence 
in the arbitrary placement. Genuine isolates 
should appear near the center of the matrix. 
At this point the matrix may be re-examined to 
emphasize the concentration of subgroups, etc. 

The advantages of this matrix approach as 
suggested by the authors include the following: 
Different investigators will tend to produce the 
same or similar matrices. Subgroup structures 
are evident in the concentrations along the 
main diagonal. Potential conflict betwen two 
subgroups may be indicated by concentration 
of rejections in the two intersections of the 
subgroups, and the relative rejection of one 
subgroup for another may similarly be exam- 
ined. Relative permanence of the group struc- 
ture may be indicated by the extent to which 
subgroups are “well knit’ and have absorbed 
satellites. Isolates and the overchosen may be 
identified as before by column and row totals 
and, in addition, one may see whether the over- 
chosen are chosen primarily by one subgroup, 
or two, or from the group at large, and similarly, 
whether a rejected person is rejected by a group 
in particular or from the group as a whole. The 
major disadvantage of the technique is that the 
manipulation of rows and columns is cumber- 
some. 

Katz (1947) indicated as additional advan- 
tages of the matrix procedure that not only 
could positive and negative choices be included 
in the matrix but that, essentially, indifference 
is accounted for, and there is no necessary 
restriction that excludes the utilization of 
weighted choices. Moreover, choices may be 
unrestricted, limited, or required. The major 
contribution of this second paper, however, is 
in describing a canonical form for the matrix 
which would serve as the preferred ordering,’ 
namely, the maximization of concentration of 

ositive choice about the main diagonal. Essen- 
tially, the form may be stated as follows: Given 
N row vectors and N column vectors, and the 
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interval from one row (or column) to the adja- 
cent one taken as one, the square of the distance 
of the element in the intersection of the ith 
row and jth column from the main diagonal is 
Y (i — j) *. Thus, ignoring the constant, the 
least square solution of this form is defined by 
the minimum value of X, E, ey (i — j) *, where 
ey is the element of the ith row and jth 
column. Once in canonical form, the matrix is 
subject to various statistical operations. How- 
ever, because of the difficulty of bringing a 
matrix to canonical form, these statistical sug- 
gestions of Katz are primarily of academic inter- 
est at present. 

The utility of the matrix approach suggested 
by Forsyth and Katz is quite obvious, but the 
limitations should not be underestimated. 
Moreno (1946) commented on the presentation 
of Forsyth and Katz and suggested that he 
viewed the matrix approach as supplementing 
the sociographic approach rather than replac- 
ing it. He indicated that certain types of rela- 
tionships, such as chains of communication, are 
quite difficult to locate in the matrix, while 
they are quite simple to trace on the sociogram. 
The “cumbersome” aspect of the matrix manip- 
ulation should not be underestimated. The fact 
that a canonical form has been described does 
not make the task of manipulating the matrix 
any easier; it still remains a matter of sophisti- 
cated trial and error to get as close to the can- 
onical form as possible. In some cases this may 
be as time consuming as drawing and redrawing 
a number of sociograms. On the other hand, 
the Forsyth and Katz approach may have an 
advantage when the number of choices and 
rejections made by the subjects is very large. 
In sociograms of the Loomis variety the prob- 
lem of congestion would be evident, and in 
sociograms minimizing the number of crossing 
lines the number of such lines would be very 
great. In such circumstances the Forsyth and 
Katz matrix should make the subgroup struc- 
tures more evident. A final objection to this 
method is that it forces the assumption of line- 
arity upon the data examined when it may be 
that this assumption is not warranted. 

A matrix approach related to that of Forsyth 
and Katz is presented by Beum and Brundage 
(1950). Their method is one which is designed 
to maximize choices about the main diagonal. 
The method makes no restrictions on the data 
to be entered in the cells. The procedure is 
based on the demonstrable condition that: 


If weights are assigned to the rows of a socio- 
matrix (from 1 to N beginning with the bottom 
row) ...and the average product of the ele- 
ments in each column and the corresponding 
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weights is maximized for each column, the sum 
of the squares of the elements about the principal 
diagonal is minimized. (p. 142) 


This would produce a canonical form defined 
in a way similar to that of Katz (1947). The 
procedure for achieving the canonical form is 
not one of trial and error, as in the Forsyth 
and Katz approach, but one of a systematic 
iterative process. When iteration of the pro- 
cedure no longer produces changes in the 
structure (arrangement of rows and columns) , 
the canonical form is reached. When iteration 
produces recurrence of arrangements of rows 
and columns, alternate and equally "good" 
arrangements are displayed. The procedure has 
the advantage over the Forsyth and Katz method 
that it is more systematic (different investiga- 
tors get the same results). However, the au- 
thors point out that the iterative procedure 
converges slowly, and that consequently the 
labor involved may be excessive. 

Bock and Husain (1950) consider the For- 
syth and Katz (1946) matrix approach similar 
to identifying “clusters” of psychological tests. 
They introduce the notion of ranking choices 
in terms of the “relatedness,” that is, the degree 
of reciprocation (this assumes the subjects 
ranked their choices). One generates a new 
matrix by replacing the original values by the 
rankings in this dimension. Having made this 
conversion, the matrix may be analyzed using 
Holzinger's B-coefficient. The procedure is not 
unlike that of the Forsyth and Katz approach 
of beginning with two persons who are mutu- 
ally related, and then picking additional per- 
sons who are highly related to the initial nu- 
cleus. If the Bock and Husain technique offers 
any single advantage, it is that a reasonable but 
quite arbitrary decision point can often be es- 
tablished to determine whether the last person 
added to the subgroup actually belongs to the 
subgroup, but it is not certain that this is actu- 
ally better than what could be accomplished by 
inspection. 

Luce and Perry (1949) with Festinger (1949) 
have developed a second method for matrix 
analysis of sociometric data which is based on 
matrix multiplication. This method is immedi- 
ately differentiated from the Forsyth and Katz 
approach, as it places severe restrictions on the 
type of data which can be handled. Only 
choices or rejections on a single criterion can 


be treated and there can be no weighting of ` 


responses. 

The N x N table or matrix is again the be- 
ginning point of this method. The basic opera- 
tion involved is raising the raw matrix to 
various powers. The simplest problem for which 
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the method may be used is to identify chains of 
communication, and the easiest of these to find 
is that of the two-step chain, where individual i 
chooses individual j who in turn chooses some- 
one else. Given an N X N matrix, the two-step 
chains can be identified through squaring the 
matrix. The operations involved in raising 
the matrix to various powers are described by 
Luce and Perry (1949), Festinger (1949), and 
Festinger, Schachter, and Back (1950). Given 
the squared matrix, the entry in the ijth cell 
indicates the number of two-step paths from i 
to j. The cells on the main diagonal contain 
the number of mutual choices in which each 
individual is involved. The total number of 
mutual choices in the group is obtained by sum- 
ming the cells of the main diagonal and divid- 
ing by two, as each mutual choice appears twice. 
Through raising the matrix to higher powers, 
cubing, etc., it is possible to use the method to 
identify cliques. Ordinarily this method has 
defined cliques in a highly restrictive fashion, 
asking that each member of the subgroup select 
all other members of the subgroup. Luce 
(1950), however, extended the generality of 
this approach through indicating how it is pos- 
sible to alter the definition of the clique to 
include looser structures, i.e., identifying cliques 
in which there were indirect or incomplete pat- 
terns of choice among the members. 

It is important to realize that having identi- 
fied particular kinds of patterns or structures it 
is virtually always necessary to return to the 
original matrix in order to identify the persons 
who make up the pattern. One should also be 
aware of the fact that as the matrix is raised to 
higher powers redundancies will occur, that is, 
the same pattern may be counted more than 
once. A method for determining redundancies 
in five- and six-step chains is proposed by Ross 
and Harary (1952). 

The use of matrix multiplication in the 
analysis of sociometric data has the greatest 
potential for the development of standardized 
techniques for the identification of structures 
and channels of communication. However, the 
nonmathematically inclined may find some dif- 
ficulty in understanding the procedure. At 
present the techniques are not sufficiently well 
developed for proper assessment. Nevertheless, 
the following difficulties are evident: (1) The 
limitation to unweighted choices or rejections 
may be found restrictive. (2) The matrix oper- 
ations are laborious unless done on IBM equip- 
ment, and programming for this matrix 
procedure may be more effort than it is worth. 
(3) Even as extended by Luce, the definitions 
of structures which may be isolated are restric- 
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tive. Identification of patterns of a given defini- 
tion will still leave many other patterns 
unidentified, requiring either retreatment or 
neglect of the data. (4) The holistic view pos- 
sible with the Forsyth and Katz method and the 
sociogram approaches is not possible. In favor of 
the method, it is the only approach available 
which is relatively unrestricted as to the size of 
the group which may be examined, and this 
alone may be sufficient reason for giving it at- 
tention. In addition, treatment of the data is 
readily repeatable in the hands of different in- 
vestigators. Persons interested in reading rela- 
tively nonmathematical expositions of this 
approach should refer to Festinger (1949) and 
Chabot (1950). 

Machine computation. It is obvious that some 
of the analyses we have discussed, the matrix 
multiplication as an example, if attempted by 
hand necessitate a prohibitive amount of labor. 
Fortunately, many of the operations required 
can be programmed and done on IBM equip- 
ment, and the investigator should become fa- 
miliar with its use, especially if he is to deal 
with large samples. Beum and Criswell (1947) 
presented a method for machine tabulation of 
sociometric data. The technique has great ver- 
satility, and is unrestricted as to the size of the 
group to be considered. Most operations which 
can be reduced in form to simple equations can 
be done on IBM equipment either with the orig- 
inal cards or with decks generated in accord- 
ance with the programming of the operations. 

Katz (1950) presents an alternate procedure 
for IBM tabulation. All operations possible by 
the Beum and Criswell technique are equally 
possible with this method, and for relatively 
small samples it may be more expedient. How- 
ever, the technique has a limit of practical use- 
fulness, since the number of persons in the 
sample is restricted by the number of available 
columns in the deck. Katz presents some sample 
procedures for getting mutual choices, measur- 
ing cleavages, raising a matrix to a power, trial 
and error manipulation for using the Forsyth 
and Katz method of matrix analysis, etc. 

Persons planning analysis of large-scale socio- 
metric data will do well to investigate the feasi- 
bility of using machine techniques, whatever 
the actual method of analysis, However, if the 
sample size is not large, the investigator may 
spend more time in preparing for analysis than 
other methods might require to complete the 
analysis. 


FACTOR ANALYSIS 


The possible application of factor analysis to 
sociometric data has been suggested many times, 


although as yet there have been only a few such 
extensions reported. Cervinka (1948) explicitly 
suggests the application of factor analysis to 
sociometric and similar structures in terms of 
isolating factors related to the manifest data, 
and Katz (1950), in his treatment of machine 
techniques, makes similar suggestions. Bock 
and Husain (1950) have carried out a factor 
analysis of a particular sociometric group and 
were able to identify two factors, "sex" and 
“reputation for science ability.” Their presen- 
tation is primarily an exercise in which they 
demonstrate the possible utility of such an ap- 
proach, especially in terms of factor scores 
computed for the group members. Proctor 
(1953) applied factor analysis to the classifica- 
tion of multiple sociometric questions. 


FRACTIONATION OF GROUPS 


The problem of identifying subgroups is basi- 
cally a general one, and is encountered in most 
investigations. Much of our discussion of quan- 
titative, statistical, and matrix methods has 
direct implications for this problem. Here we 
wish simply to stress again the wide range of 
possibilities for arriving at principles of clas- 
sification and provide one example of a well- 
contrived solution to a fractionation problem. 

Given a particular set of operations, the re- 
searcher may feel confident that he has the 
"best" way of dividing or ordering his sample 
into classes for further statistical treatment. 
Basically, however, the operations involved are 
arbitrary and should be accepted as such. While 
logic and theory are always important considera- 
tions there are points in the analysis of data 
where expediency must enter in to some degree. 
Thus, if less “exact” fractionation is used, the 
only question the researcher must ask himself 
is whether or not this lack of exactness could 
account for the failure to observe expected rela- 
tions. Even if the classification is to be corre- 
lated with another measure, the “exactness” of 
the fractionation may not be nearly as impor- 
tant in the determination of the magnitude of 
the correlation as the number and size of the 
classes. Thus, as we have indicated before, sta- 
tistical operations and quantitative respecta- 
bility may not always lead to maximum utility. 
This point of view is consistent with Criswell's 
(1950) defense of the within-distribution frac- 
tionation, e.g., Jennings (1943), as against the 
use of classes of persons significantly above or 
below the chance expectancy. 

At this point, let us examine an example of 
intelligent compromise in the process of frac- 
tionating groups. In considering the problem 
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of differentiating status groups using multiple 
criteria (choices and rejections), Lemann and 
Solomon (1952), following Bronfenbrenner 
(1943), considered the possibility of making 
their high status group those individuals whose 
choice frequency exceeded the expected at the 
1% level of significance and whose rejections 
fell below the expected at the 1% level of sig- 
nificance. The low status group would be the 
converse—falling below the expected in choices 
and exceeding it in rejections. The subjects 
that did not fall into these two groups would 
be placed in the middle status group. Partly as 
a consequence of the fact that the choice and 
rejection distributions did not coincide when 
one was inverted, they found that their logically 
defensible principle of fractionation was not 
workable. The vast majority of the subjects 
fell in the middle status group and there were 
few cases in either the high or low status groups. 
In the face of this experience they reasoned: 
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Jet the high status group consist of all 
those who received significantly more choices than 
would be expected, and who at the same time did 
not receive more than the mean number of 
rejections; and let the low status group consist 
of those who received more rejections than would 
be expected, and who at the same time did not 
receive more than the mean number of choices; 
and let all others not included in these two ex- 
treme groups make up the middle status group. 
(p. 31) 

This method seems defensible, as it places in 
the high group all those who secure significantly 
more choices than would be expected by chance, 
unless they also receive a large number of re- 
jections. The low status group is conversely de- 
fined, and the middle group is determined 
residually. Most important is the pragmatic 
outcome of reasonable numerical divisions of 
subjects into the three groups, and this was an 
important factor in the decision to employ this 
classification. 


RELIABILITY 


Discussion of the reliability of sociometric 
measures is fraught with the same difficulties 
that beset discussion of all instruments pur- 
porting to measure variables that change with 
the passage of time. In dealing with test con- 
sistency, at least, the sociometric tester finds 
himself sharing sentiments with the projective 
tester. Just as the user of the TAT or Ror- 
schach protests, somewhat futilely, that he would 
be disastisfied with his test if it showed no 
change in an individual following the passage 
of time and concomitant changes in the indi- 
vidual, so the sociometric tester claims that his 
test must vary with the passage of time and 
changes in group structure. 

The issue appears to be primarily one of the 
imputed effect of environmental forces upon 
the variable being measured. The psycholo- 
gist’s consideration of reliability has been 
heavily influenced by the definitions and pro- 
cedures developed by "capacity testers” who 
for the most part were willing to assume that 
the variables of primary interest to them were 
not readily susceptible to environmental change. 
Such an assumption greatly simplifies the prob- 
lem of examining instrument consistency, as it 
assigns all variability to test unreliability, For 
most psychological variables, however, it is em- 
pirically so thoroughly unwarranted as to make 
its adoption impossible. The user of sociometric 
measures seems bound to accept the probability 
of change over time in the variables he is inter- 
ested in studying. Thus, the task of specifying 
the elements of change in test data that may be 


correctly attributed to the actual changes in the 
variable under study, as opposed to those ele- 
ments of change that must be attributed to 
unwanted or chance fluctuation in the test, is 
a complex and exacting task. 

Interpretive reliability. To begin with, it may 
be useful to distinguish between, questions of 
reliability having to do with interpretation of 
the data provided by the test and the reliability 
of the data itself. Interpretive reliability is a 
much simpler problem and ordinarily is as- 
sumed in studies of test reliability. We are 
interested here in the extent to which the de- 
scription of data by a given investigator, once 
the data are on hand, can be repeated by other 
investigators given the same data, Our discus- 
sion of the various kinds of analysis an investi- 
gator may employ has made clear that the 
interpretive reliability may be expected to vary 
with the particular technique chosen. Thus, 
if the data are analyzed in terms of an index 
the reliability of the procedure is apt to be very 
high. It is true that when the data are analyzed 
in terms of scores or indices the picture of the 
group that results may vary depending upon the 
particular quantitative choices the investigator 
makes. However, the problems are no different 
here than with any set of quantitative data that 
permits expression in terms of more than one 
set of scores. If the data are summarized in 
terms of a matrix and arranged according to 
procedures such as those suggested by Forsyth 
and Katz (1946), there should again be high 
agreement between two investigators treating 
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the same data independently. It is only in the 
purely graphic representation that a great deal 
of unreliability is apt to occur (Loomis, 1948) . 
Even here procedures such as those proposed by 
Proctor and Loomis (1951) and Borgatta 
(1951) reduce this variability considerably. 

Thus, the problems posed by interpretive 
unreliability are not of great magnitude. Only 
when sociograms are used to analyze sociometric 
data is there apt to be excessive unreliability, 
and even this can be reduced by agreeing upon 
certain conventions in advance of analysis. 

Test reliability. As we have implied, this is 
a more complex matter. In most situations it is 
not possible to provide definitive answers to 
the question of just how stable sociometric 
measures are. Better, one might say that it is 
very difficult to divorce the stability of the 
measure from the instability of the phenomenon 
being measured. 

The paiticular problems encountered vary 
with the kind of consistency in which the in- 
vestigator is interested. Thus, if we are con- 
cerned with the repeat reliability of the test, its 
capacity to produce consistent results over a 
period of time, appraisal of the reliability of 
the instrument is made difficult on the one hand 
by the effect of memory of original responses 
and on the other by the changes that the group 
may reasonably be expected to undergo. If we 
examine similarity of sociometric patterns over 
a short period of time we must account for a 
high degree of consistency not alone in terms 
of the consistency of the measurement but also 
in terms of the contribution of the subject's 
recall of his first test responses. On the other 
hand, if the two administrations are separated 
by a sufficient interval of time to make accurate 
recall of original choices unlikely, we are then 
faced with the dilemma that any lack of con- 
sistency between the two sets of responses may 
be at least partially attributed to actual changes 
in the structure of the group. 

A further difficulty is encountered in connec- 
with restructuring groups in compliance with 
sociometric choices (Gottheil, 1952). If the 
group is changed, as orthodox sociometric prac- 
tice demands, the retest is carried out on a new 
group and differences in the response data 
would, of course, be expected. If the group is 
not changed it might be argued that the ineffec- 
tiveness of the subjects’ responses in producing 
significant changes in their interpersonal world 
alters their attitudes toward the sociometric 
questions so that one would again expect differ- 
ences in response to the first and second admin- 
istration of the technique. 


We are thus partially opposing test reliabil- 
ity to test sensitivity. If we accept human groups 
as more or less constantly undergoing changes 
in their internal composition, only an insensi- 
tive test could give us a consistent measure from 
two administrations widely spaced in time. In 
such a circumstance changes in response may be 
evidence of the sensitivity of the test. Or, more 
convincingly, temporal consistency of the test, 
in these circumstances, would be evidence for 
the lack of sensitivity and representativeness of 
the data it provided. 

Instead of repeat reliability, one may inquire 
into the internal consistency of the sociometric 
measure. Here one might in some arbitrary 
fashion divide the group in half, so that for any 
index each individual is given two scores, one 
from each half of the group. As P. N. Pepinsky 
(1949) has pointed out, reliability here be- 
comes a measure of how consistently the indi- 
vidual is reacted to by the various members of 
the group, and no one has ever claimed that all 
members of a group ordinarily react to an indi- 
vidual in the same way. Thus, differences 
between the two halves are to be expected and 
are not evidence of the unreliability of the test. 
There are further difficulties involved in this 
approach because the average number of indi- 
vidual choices received in most groups is quite 
low. Dividing them in half leads to a large 
number of zeroes and small numbers, which in 
turn produces serious quantitative problems. 

Finally, where multiple criteria are used in 
the test one may inquire into the consistency of 
choice patterns occurring in connection with 
the various criteria. This method corresponds 
roughly to the use of alternate forms in estab- 
lishing the reliability of psychometric instru- 
ments. However, the consistent emphasis of 
sociometric investigators upon the crucial im- 
portance of the particular activity or criterion 
in terms of which the choice is made makes it 
difficult to equate the reliability of the test to 
the relation between different sociometric ques- 
tions. Unreliability here may be a function not 
of the test's fallibility but rather an empirical 
demonstration that personal choice patterns 
vary directly with the activities in terms of 
which the choice is made. 

The one circumstance when it may be appro- 
priate to use the relation betwen separate socio- 
metric questions as a measure of the internal 
consistency of the test is when the investigator 
is using a cumulative choice score which is made 
up of the combined total of choices received on 
two or more criteria. If in these circumstances 
it is assumed that the scores or choices on the 


422 


various criteria are functionally equatable, it 
is perfectly legitimate to ask that there be con- 
sistency from one criterion to another. 

To talk of the reliability of sociometric meas- 
ures in general is like talking of the reliability 
of projective tests, questionnaires, or rating 
scales, without noticing that the range of reli- 
ability coefficients within any one of these tech- 
niques (depending upon the particular variety 
examined, the group being studied, the vari- 
ables used, etc.) is much greater than the dif- 
ference between these techniques. In reporting 
sociometric reliabilities it is a matter of great 
importance to specify the nature of the socio- 
metric question used, the group being studied, 
the period between administrations (if a test- 
retest method is used), the particular score or 
index used, the number of choices required, etc. 

In spite of the difficulties involved in studying 
the reliability of this instrument, there have 
been a large number of studies that provide 
information bearing on this issue. The inter- 
ested reader should consult Bell and French, 
1950; Bonney, 1943a,b; Byrd, 1951; Criswell, 
1939, 1949; French, 1951; Gottheil, 1951; Hunt 
and Solomon, 1942; Jennings, 1950; Moreno, 
1934; Newstetter, Feldstein, and Newcomb, 
1938; Northway, 1952; and Zeleny, 1939. The 
results of these studies, and others, suggest the 
following tentative generalizations concerning 
the reliability of sociometric measures: 


1. Most investigators report a relatively high 
degree of consistency in the sociometric pattern 
or sociogram over time, even though individual 
choices and rejections may fluctuate consider- 
ably (Moreno, 1934). It is important to ap- 
praise this generalization in terms of the data 
upon which it rests. As the analysis of socio- 
grams is almost solely impressionistic (made by 
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inspection) it is clear that this report is based 
upon impressions that investigators received 
when they examined two or more sociograms 
representing data collected from the same group 
at different times. 


2. With somewhat better quantitative evi- 
dence, one might apply the same generalization 
to indices or scores derived from sociometric 
data (Criswell, 1939; Jennings 1950). Al- 
though the particular choices and rejections 
may show considerable variation, the scores, 
e.g., choice score, emotional expansiveness, etc., 
show considerable consistency when observed 
over a period of time. 


3. The reliability of the instrument seems 
somewhat greater when it is used with adults 
than when it is used with children. This is a 
rather ambiguous finding, as it is not clear 
whether the higher reliability in adults is medi- 
ated by greater stability in interpersonal rela- 
tions or superior memory on the part of adults. 
Either interpretation seems consistent with 
other evidence. 


4. There is some evidence that the least im- 
portant or salient choices show the largest 
amount of change or unreliability. "Thus, Mo- 
reno (1934) reports that following a three- 
month interval there was a shift in 8% of first 
choices and 18% of second choices, and this 
finding was in general replicated by Criswell 
(1939) . 


5. The stability of sociometric choices ap- 
pears to increase with the passage of time dur- 
ing which the group has been in existence 
(Hunt and Solomon, 1942). There is a much 
higher incidence of change in sociometric choice 
during the early stages of group formation. 


VALIDITY 


Just as psychometric notions of reliability are 
difficult to apply directly to sociometric tech- 
niques, so too the conception of validity as 
"whether the test measures what it purports to 
measure" is not readily applicable. This diffi- 
culty in application is related to the question of 
whether the sociometric measure may be used 
in such a way that it purports to measure noth- 
ing more than verbal choice behavior, which it 
transparently does, and no further demonstra- 
tion of which is needed. It is also related to the 
possibility that one may specify numerous "pur: 
ports," all of which go beyond choice behavior 
and show considerable variation in the hands 
of different investigators. 

Face validity. If the investigator limits his 


interest to a measure of interpersonal choice, it 
is clear that the somewhat dubious concept of 
“face validity” is applicable here. The socio- 
metric measure is made up of interpersonal 
choices, and to talk of external measures or cri- 
teria to demonstrate that this is the case is to 
overlook the operational base of sociometric 
scores. Thus, where the investigator is willing 
to accept this limitation upon what he is meas- 
uring, it is clear that his problems are quite dif- 
ferent from those of the test constructor who 
devises a set of items to measure some general 
disposition or underlying attribute. In the 
latter case there is a clear need for an inde- 
pendent demonstration of the test's sensitivity 
to the variable of interest. This difference be- 
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tween sociometric measures and many other 
psychological tests has been pointed to by both 
P. N. Pepinsky (1949) and Jennings (1950). 

Data appropriateness. Even where the investi- 
gator is willing to limit his concern to socio- 
metric measures as indicative of verbal choice 
behavior he does not altogether escape prob- 
lems of evaluating the sensitivity of his meas- 
ure. First of all, there is the question of 
systematic biases on the part of his subjects that 
may lead to deception in their responses. For 
reasons of dislike, resentment, or insecurity the 
subjects may be motivated to give an untruthful 
picture of the individuals they would actually 
like to interact with in particular activities. 
Second, although the subject may not be moti- 
vated to deceive he may be unmotivated to give 
careful responses. The sociometric answers may 
be insensitive either because the subjects did not 
fully understand the nature of the question or 
because they were not willing to devote the time 
and effort necessary to provide a thoughtful 
answer. This might result either from too long 
a questionnaire or from a general lack of inter- 
est in the investigation. 

In both of these contexts the question of the 
sociometric questionnaire as a "motive" for the 
subject is worth stress. Moreno (1934) has em- 
phasized the importance of this consideration 
and his demand that the sociometric test lead 
to restructuring the group is aimed at eliminat- 
ing just such possible failures of cooperativeness 
or truthfulness. 

Relation to independent measures. Sociomet- 
ric measures are not used alone as indications of 
interpersonal choice patterns. In many cases 
investigators may equate an index derived from 
sociometric responses to morale, prejudice, social 
adjustment, or some other variable. In this 
case the relation of this measure to other meas- 
ures of the same variable becomes a question of 
considerable interest. Failure to observe sub- 
stantial relations is not, however, necessarily an 
indictment of the instrument's validity. If one 
defines minority group prejudice in terms of 
the extent to which an individual selectively 
chooses or avoids members of various minority 
groups it would not necessarily be an indication 
of the invalidity of this measure of prejudice to 
show that it did not correspond closely to preju- 
dice as measured by one of the conventional 
paper-and-pencil attitude scales. Such a finding 
would represent an interesting question for 
formulation and further investigation, but it 
would not prevent any individual who so wished 
from continuing to define prejudice in terms of 
sociometric responses rather than attitude ques- 


tionnaires. A similar consideration applies to 
differences between choice behavior as revealed 
in sociometric questionnaires and actual choice 
behavior as revealed in a study of overt be- 
havior. There are important determinants or 
conditions which operate to determine overt be- 
havior (physical constraints, social obligations, 
perceived likelihood of reciprocation, etc) that 
do not operate so strongly in the anonymous 
verbal choice situation. Difference between be- 
havior in the two situations is an interesting 
empirical finding but does not necessarily indi- 
cate that the one measure is invalid or insensi- 
tive. Rather it is confirmation of the fact that 
these two kinds of responses are differently 
determined, although closely related. Most of 
the reported studies dealing with this problem 
show moderately high correlations between 
sociometric choice and “actual” behavior, e.g., 
Byrd, 1951; Jennings, 1950; Danielsson, 1949. 
On the other hand, Polansky, Lippitt, and 
Redl (1950) report little relation between ob- 
served interaction and choices of whom one 
likes to be around. 

The multidimensionality of the sociometric 
test makes the task of demonstrating its validity 
a rather unending one. Separate demonstrations 
would be necessitated or entailed for each vari- 
able which might conceivably be measured by 
the instrument. 

Research utility versus validity. Actually, a 
far more reasonable question than that of 
whether a test actually measures the variable or 
variables it sets out to measure is the more gen- 
eral question of the extent to which it controls 
or relates to significant independent measures 
that are of interest to the theorist and investiga- 
tor. Whatever it is eventually decided the test 
measures, if it relates successfully to a large 
number of pertinent, independent measures, it 
is clear that the findings will be of value and 
interest in developing knowledge in this area. 
In the following section we plan to deal at some 
length with this question and, so far as any 
meaningful estimate of the usefulness of the in- 
strument is concerned, this is where one should 
look, rather than under studies specifically 
labeled validity studies. 

In summary then, an examination of the 
validity of sociometric techniques is made com- 
plex by (1) the possibility of limiting one's 
measurement interest to verbal, interpersonal 
choice, in which case no validity demonstration 
is necessary; (2) the possibility of specifying a 
large number of variables which it is intended 
the test shall measure, in which case the job of 
relating sociometric response to independent 
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measures becomes relatively unending; (3) the 
difficulty of interpreting failure of sociometric 


RESEARCH 


The empirical value or usefulness of a meas- 
uring instrument can be appraised in many dif- 
ferent ways. Here we propose to discuss the 
utility of our technique in terms of two major 
questions: 


l. What are some of the empirical concepts 
currently used by social scientists that can 
be defined in terms of sociometric opera- 
tions? 

2. What significant independent measures of 
behavior, or its environmental context, can 
be related to sociometric variables? 


Although these questions are designed especially 
to provide a representative picture of the actual 
and potential usefulness of sociometric instru- 
ments, similar questions might be asked 
concerning any psychological measuring instru- 
ment. If one views the main purpose of an 
empirical discipline as the building of stable, 
significant relationships that are explicitly 
interrelated (in the ideal case derivable from 
a general set of assumptions), then answers to 
the above questions can be expected to provide 
a relatively comprehensive picture of the likely 
contribution of any measuring tool. 


EMPIRICAL CONCEPTS DEFINABLE IN 
SOCIOMETRIC OPERATIONS 


Our purpose in this section is to discuss briefly 
some of the major variables of psychological 
interest that sociometric measures have been 
used to assess or appraise. Our consideration of 
the actual investigations that have employed 
sociometric measures to get at such variables 
will be highly schematic, as the number of rele- 
vant studies in many cases is extremely large. 
We shall be content here to give some impres- 
sion of the general ways in which these meas- 
ures have been used in connection with some 
of these variables. 

Leadership. Although measures of leadership 
are legion in the psychological literature, virtu- 
ally all of them have dealt with interaction be- 
tween individuals, usually in a group setting. 
It is consequently understandable that many 
psychologists have found sociometric techniques, 
useful in their studies of leadership. These in- 
struments have assumed a particularly impor- 
tant role as psychologists have come to think of 
leadership as a function of the individual, the 
activity he is engaged in, and the group he is a 
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measures to show a high relationship to inde- 
pendent measures of the variables of interest. 


Un 


member of, rather than as a quality that in- 
heres solely in the person. 

Perhaps the first sociometric study to address 
itself specifically to problems of leadership was 
Jenning’s (1937) longitudinal study of a 
single cottage in a school for delinquent girls. 
Using as her sociometric criterion choice of eat- 
ing compaion, she defined leadership as receiv- 
ing five or more choices — this accounted for 
roughly twenty percent of the group. She dis- 
cusses the constancies and the changes that ap- 
pear in the sociometric pattern over a two and 
a half year period. Further reports by the same 
author (Jennings, 1947, 1950) describe in more 
detail the study of this same institution. As 
part of her extensive investigation she selected a 
group of overchosen and a group of underchosen 
subjects, the overchosen representing persons 
who were one standard deviation above the 
mean in the number of social choices they re- 
ceived, while the underchosen were one stand- 
ard deviation below the mean. Approximately 
three months before the sociometric data were 
collected the girls in the institution had elected 
a House Council. From each cottage four coun- 
cil members were elected. In comparing socio- 
metric choice with the election results Jennings 
used only the highest two, as these persons some- 
times received such a preponderance of the votes 
that the individuals in the lower two positions 
were elected with very few votes. Examination 
of the two sets of data revealed that 90 percent, 
or eighteen of the twenty council members, 
were in the overchosen group. Further, the two 
persons who were not in this group fell only 
slightly below the necessary standard. The au- 
thor concludes: “When allowance is made for 
the difference between being chosen from a 
community-wide base and being elected from 
the limited house population, it is evident that 
there is practically a one-to-one relationship be- 
tween being elected to represent the house body 
in matters concerning the group and being 
chosen by community members on the socio- 
metric criteria of living and/or working with 
them" (p. 38). Bell and French (1950) used 
sociometric measures to assess leadership in in- 
formal discussion groups. 'They found that 
changing the composition of the group and the 
discussion topic did not produce marked 
changes in the leadership standing of most of 
the subjects. Individuals tended to receive sim- 
ilar sociometric choice patterns even in groups 
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with different members discussing different 
topics. 

A number of investigations have been con- 
cerned with leadership in the military service 
as measured through sociometric devices. 
Wherry and Fryer (1949) examined the utility 
of sociometric ratings as criteria of leadership 
and concluded that they serve this purpose sat- 
isfactorily. Williams and Leavitt (1947) found 
that frequency of sociometric choice, made in 
terms of the criteria of “all-round ability as a 
combat officer," was positively associated with 
ratings of combat performance. Stogdill 
(1949), in a study of Naval officers, defined 
leadership in terms of the number of times an 
individual was mentioned by other officers as a 
person with whom they spent time "getting 
things done." When this measure was related to 
other measures it was found to correlate with 
military rank, methods used with subordinates, 
and other measures concerned with how the in- 
dividual used his time. The sociometric data, 
when plotted graphically, also produced inter- 
esting information concerning the informal re- 
lations existing within the formal structure of 
the group. Gibb (1950) proposes a compre- 
hensive definition of leadership and then sets 
out to examine the relation between ratings of 
this dimension and three kinds of sociometric 
response. The three sociometric questions em- 
ployed the criteria of spending leisure time 
with, working with, and the person whose 
removal Irom the group would bring about the 
largest group change. 'Two sets of ten-man 
leaderless groups (college students and Army 
officer candidates) were studied. Observer rat- 
ings correlated in the neighborhood of .40 with 
sociometric choice for the criteria of spending 
leisure time and working together. Sociometric 
choice for the third criterion, however, corre- 
lated about .80 with the leadership ratings. 
Gibb's findings indicate both the imperfect 
association between sociometric choice and some 
aspects of leadership, and also the importance 
of the particular criterion that is used as the 
basis for the sociometric choice. 

Usually when the sociometric investigator is 
interested in leadership he has employed criteria 
that direct the respondents to think of leader 
functions, e.g., "Who would you choose to lead 
your squad on a combat mission?" In some 
cases, however, the investigator has attempted 
to infer information about leadership from 
sociometric questions not directly related to 
leadership, e.g., choice for roommate, eating 
companion. As Gibb’s findings imply, not all 
sociometric criteria are equally appropriate for 
the measurement of leadership, and the person 


who uses remote criteria may secure a relatively 
poor measure of leadership. 

A further difficulty involved in sociometric 
measures of leadership is the possibility that the 
actual leader of the group may not be distin- 
guished by the frequency with which he is 
chosen by other group members. Moreno 
(1934) described the individual who secured a 
few key choices and consequently occupied an 
influential position in spite of the fact that his 
choice status was quite low. Gibb (1950) has 
pointed to the distinction between the individu- 
al who is the social center of the group and the 
person who is the functional leader of the group. 
In some cases the leader and the person who has 
the highest choice status are the same, but this 
is by no means always the case. Hunt and 
Solomon (1943), in a study of summer camp- 
ers, report that individuals who stand out as 
leaders in the judgment of camp counselors are 
high in social choice but not clearly distinguish- 
able from the rest of the group on these 
grounds. Katz’s (1953) recent proposal of a 
status index that takes into consideration both 
the number of choices and the identity of the 
choosers may prove the most efficient means of 
meeting this difficulty. "The reader will find a 
more detailed discussion of leadership research, 
including sociometric studies, in Chapter 24. 

Our discussion implies that there are many 
pitfalls involved in the sociometric measure- 
ment of leadership. Further, it seems clear that 
any comprehensive study of leadership necessi- 
tates more information than can be supplied 
by this technique alone. However, sociometric 
measures can provide important data bearing 
on leadership and in certain circumstances can 
be used directly to measure this variable. 

Minority group prejudice, From the very 
outset of sociometric measurement there has 
been interest in applying this technique to the 
study of group cleavages and prejudice. Al- 
though in some instances sociometric measures 
have been used to assess individual prejudice, 
they have been used much more frequently to 
examine groups in search for cleavages that are 
established along minority group lines. 

In Who Shall Survive?, Moreno (1934) re- 
ports a study of public school children in which 
minority group cleavages were observed only 
beyond the fourth grade. He also introduces 
the notion of a "saturation point,” a point 

«beyond which introduction of further minority 
group members will produce cleavages and 
other group changes. Criswell (1942) has dis- 
cussed this concept and concludes that it is 
imprecise and difficult to measure, and so 
should be used only with great caution. 
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The first study to focus primarily upon mi- 
nority group cleavages was Criswell’s (1937, 
1939) investigation of Negro-white relations 
in a large number of urban classrooms ranging 
from kindergarten to the 8th grade. She meas- 
ured cleavage by comparing the choices directed 
within the "own" ethnic group to the frequency 
of choices that would be expected by chance. 
Initially, she found that sex appeared to be a 
much more important determinant of cleavage 
than ethnic group membership, and conse- 
quently she limited her group comparisons to 
individuals of the same sex. Her findings, in 
general, support Moreno's, as there appeared 
to be little cleavage between the Negro and 
white children prior to grade three, and by 
grade five the cleavage had reached its height. 
She also reported that withdrawal of one group 
is not always associated with withdrawal by 
the other group, and further that the nature 
of the cleavages observed was influenced by 
the relative proportion of white and Negro 
students present in the class. A detailed discus- 
sion of methods of measuring group prefer- 
ences using sociometric data is supplied by 
Criswell (1943). A study by Loomis (1943) 
employed a modified sociometric technique to 
study cleavages in two southwestern high 
schools. He found that Spanish-American and 
American students reported much greater asso- 
ciation with members of their own groups. The 
minority group, whether American or Spanish- 
American, showed a stronger tendency toward 
in-group selection and exclusion of the out- 
group than was shown by the majority group. 

Northway (1946) has indicated her belief 
that the sociometric technique can be used 
effectively to depict prejudice, and her sector 
target diagram (Northway and Quarrington, 
1946) provides one means of examining minor- 
ity group relations through sociometric data. 
The preliminary findings she reports suggest 
that social status varies with the ethnic group 
of the individual and that the size of the 
minority group has an important influence 
upon group relations. A novel approach to 
ethnic group cleavages, using the relational 
analysis, is reported by Goodnow and Tagiuri 
(1952). They found that religious groups 
tended toward in-group choice and, further, 
that social status varied with the group mem- 
bership. Most interesting was the finding that 
when asked to.guess who would choose them, 
the subjects showed a high degree of accuracy 
in their predictions. Thus the rejected minor- 
ity group appeared to be well aware of its status 
in the group. 

The use of sociometric response to provide 
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an individual measure of prejudice is described 
by Zeleny (1947a). He asked members of a 
class to make choices among the class members 
for membership on a committee established to 
combat prejudice. The “prejudice index” was 
derived from the number of times an individual 
was chosen or rejected by the other members 
of the class — the higher the index, presumably, 
the lower the prejudice. In the same study, 
Zeleny related his prejudice index to scores 
provided by a questionnaire designed to meas- 
ure willingness to accept Negroes and Spanish- 
Americans, and he found a moderate positive 
correlation between the two measures. Zeleny's 
method is somewhat cumbersome and indirect, 
since he depends upon the judgments of peers 
as to how prejudiced the person is. Further, 
the choices for committee members are bound 
to be influenced by personal considerations 
that have no relation to prejudice. 

Where the sociometric technique is adminis- 
tered to groups containing appreciable num- 
bers of one or more minority groups, a much 
more direct and satisfactory measure of preju- 
dice can be obtained. Various ratios or indices 
representing the relative choice by each in- 
dividual of minority and majority group mem- 
bers can be obtained. Such a measure possesses 
obvious advantages because it gets away from 
the abstract, class notion of the minority group 
member. Instead of asking the respondent to 
verbalize his attitudes toward the Protestant, 
Jew, Catholic, or Negro, he is asked to select 
those individuals with whom he would like to 
associate in various activities, and perhaps 
those with whom he would not like to associate. 
A measure based on this kind of evidence repre- 
sents not the subject’s generalizations concern- 
ing his relations with minority group members, 
when they have been identified as minority 
group members, but rather an indication of 
whether this person actually uses minority la- 
bels as criteria in the process of determining 
with whom he will interact. 

Thus, sociometric measures may provide a 
convenient means of-estimating the extent to 
which groups are divided or disrupted by with- 
drawal from minority members. They also 
permit, in certain circumstances, a sensitive 
portrayal of the extent to which prejudice is 
determining an individual's personal choices 
and rejections. 

Social adjustment. One of the measurement 
possibilities inherent in sociometric response is 
estimation of the effectiveness with which an 
individual interacts as a member of a group. 
The success of his interaction (social adjust- 
ment) may be equated to various choice and 
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rejection measures obtainable through socio- 
metric techniques. There are obvious difficulties 
involved in such a procedure, one of these being 
the danger of equating this measure to personal 
adjustment or freedom from conflict, anxiety, 
or disrupting psychological problems. Al- 
though an individual's social adjustment may 
be expected to bear considerable relation to 
his personal adjustment, the two variables are 
not the same. In particular cases there may be 
striking discrepancies between them, eg., the 
basis for an individual's social success may be a 
lack of personal security that makes it impos- 
sible for him to countenance rejection or loss 
of love. 

The use of sociometric methods to measure 
social adjustment is implicit in Moreno's origi- 
nal formulations. While he did not refer 
directly to his measures as indicators of social 
adjustment, this area of interest is involved in 
much of his writing. He implies that low 
choice status or high rejection status is evi- 
dence that the adjustment of the subject is not 
good. Concepts such as "isolate," "neglectee," 
and "rejectee" clearly connote adjustment. 

Jennings (1943, 1950) points out that descrip- 
tion of the individual as an “isolate” refers to 
the choice behavior of the group and provides 
no evidence of whether the isolated individual 
is isolated of his own accord or involuntarily. 
Similarly, being overchosen does not necessarily 
imply awareness of this on the part of the over- 
chosen, or that he is "happy" because of this 
status. Jennings suggests, reasonably, that the 
desirable sociometric pattern should be decided 
through consideration of the particular setting 
in which the choices are being made. She also 
implies that the mutuality and stability of 
choice are important considerations in any 
question of this sort. Northway (1944) has 
suggested distinguishing between social "ac 
ceptability' and “acceptance.” She suggests 
that sociometric data provide evidence of the 
"acceptance" of the person within a specific 
social setting. The social “acceptability” of the 
person can be measured only with a much 
wider range of information concerning reac- 
tions of people to this person in a variety of 
social settings. While this is a legitimate cau- 
tion, it is important to realize that a single set 
of sociometric responses is no better or worse 
than a single set of observations of any other 
kind. The problem of generalizing findings is 
always present and is no different with socio- 
metric data than elsewhere. 

A number of studies have shown that aspects 
of sociometric response are related to measures 
used to estimate personal adjustment. Many of 
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these studies will be summarized in a subse- 
quent section of this chapter. 

Our discussion has implied that the investi- 
gator who wishes to do so may use sociometric 
results directly as a measure of how well the 
individual adjusts to or is accepted in a social 
setting. However, it is dangerous to assume 
that such a measure reflects perfectly the per- 
sonal adjustment of the individual, although 
there is considerable evidence to suggest some 
relation between these two variables. 

Social sensitivity. In assessing the quality of 
interpersonal relations an important considera- 
tion is the accuracy with which an individual 
can identify the reaction he arouses in others. 
Presumably, the well-adapted person should be 
able not only to arouse positive reactions from 
others but should also be aware of this state of 
affairs, and ideally he should be able to identify 
those persons from whom the choices stem. 

The use of the sociometric self-rating or "re- 
lational analysis" provides a ready means of 
assessing this variable. 'The investigator may 
simply ask the respondent to indicate the gen- 
eral reaction of the remaining members of the 
group or he may ask for estimates of the in- 
dividual reactions of all other members of the 
group. Tagiuri (1951) has shown that the 
accuracy with which an individual can identify 
the reactions he produces in others is related to 
personal adjustment as well as to a complex of 
factors having to do with minority group mem- 
bership and the particular social setting in 
which the judgment is made. 

Social status. The concept social status has 
been associated with sociometric measures in 
at least two contexts. First, choice status under 
several definitions has been directly equated 
to social status in the group, and second, some 
investigators have attempted to demonstrate 
that high choice status would be associated with 
a favorable position in the general status struc- 
ture of the community. 

Of the many investigators who have used 
sociometric response to measure social status, 
Zeleny (1940a, 1941) was perhaps the first to 
propose a specific sociometric index under this 
label. He proposed that social status be meas- 
ured by the sum of choices times an intensity 
score divided by N — 1. More recently, Lemann 
and Solomon (1952) recommended measuring 
social status through the use of choice and 
rejection. Such indices have the advantage of 
not relying solely upon positive choice but 
rather including rejections and (reciprocally) 
indifference or no response. 

There is excellent evidence that social status 
as measured by sociometric measures is related 
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to other indices of the individual's group status. 
A number of studies to be discussed later in 
the chapter (Bonney, 19443; Loomis and Proc- 
tor, 1950; Lundberg and Steele, 1938; etc.) agree 
in indicating that socioeconomic status is posi- 
tively related to sociometric status. Further 
investigations (Lundberg and Beazley, 1946; 
Bonney, 1946) suggest that socioeconomic 
cleavages are accompanied by sociometric 
cleavages, or at least that there is a tendency 
for individuals choosing one another to be of 
similar socioeconomic status. Other measures 
of social status in the community, such as occu- 
pation (Loomis, 1946) and observer's judg- 
ments (Powell, 1951), also show association 
with sociometric data. 

Loomis, Beegle, and Longmore (1947) have 
presented a detailed critique of the concept of 
social class or social stratification and propose 
a procedure for testing empirically for the 
existence of social classes. "They suggest using 
a modified sociometric technique (family visit- 
ing patterns) to provide a means of finding out 
whether, within the particular group studied, 
individuals tend to cluster or form interper- 
sonal cleavages corresponding to the classes that 
have been used in describing the group. A 
study by Longmore (1948) illustrates the appli- 
cation of this method in a concrete setting. 

The use of sociometric data to provide in- 
formation concerning individual social status 
and social class cleavages appears both feasible 
and defensible. It is, of course, important to 
realize that there are many other sources of 
data that are important for a full representation 
of social status. 

Morale. Morale is considered an important 
dimension in the description of groups, and 
aspects of this dimension may be approached 
through sociometric techniques. Zeleny (1939) 
was perhaps the first investigator to attempt a 
direct measurement of morale through the use 
of sociometric data. He defined morale as the 
"shared feeling of like among members of a 
group" and proposed that it could be measured 
by means of the morale quotient, which was 
the number of choices a person received, times 
an intensity score divided by the total possible 
number of choices. This index could be com- 
puted either for the group as a whole or for 
the individual member. Zeleny was able to 
show that this index possessed reasonable stabil- 
ity on successive daily administrations and that 
it also correlated to a moderate and significant 
degree with the results of a questionnaire 
measure of morale, He demonstrated that, 


given the sociometric data, it was possible to 
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reform classroom groups so as to produce size- 
able increases in the morale quotient. 

Several of the studies to be discussed later 
in connection with military applications (Jen- 
kins, 1947; Goodacre, 1951) have found socio- 
metric measures useful, indicators of morale. 
Most of these studies have defined morale in 
terms of the relative amount of in-group choice, 
with some emphasis upon the existence of posi- 
tive choice of the formal leaders by members of 
the group. 

Again we find that an empirical concept 
widely used in social psychology may be defined 
either partially or wholly in terms of sociomet- 
ric data. It seems likely that the information 
provided by sociometric measures concerning 
group morale is at least as valuable as that pro- 
curable through the use of customary question- 
naire devices. 

Group structure. "The emphasis of most socio- 
metric research has been upon studying group 
structure, and a large number of measures have 
been used in describing this phenomenon. The 
various methods of matrix analysis, index 
analysis, and sociographic analysis are all as- 
sumed to be alternate means of intercepting or 
representing this structure. Concepts such as 
clique, subgroup, isolate, chains of communi- 
cation, index of cohesiveness, etc., are employed 
as means of describing specific aspects of the 
structure, 

Moreno (1934) dealt with group structure 
by comparing the relative distribution of scores 
or indices in different groups. For example, he 
examined the percentage of isolates, mutual 
pairs, triangles, and chains in different grades 
of schools and found that the grades could be 
distinguished in terms of these scores. He also 
showed that these same grades could be dif- 
ferentiated in terms of sex and minority group 
cleavages. The description of groups in terms 
of cleavages has been an object of considerable 
interest. The work of Criswell (1939; 1943) in 
measuring cleavages, group integration, and 
group preference, in addition to various publi- 
cations of Loomis (1943, 1946), is among the 
best known investigations in this area. Bron- 
fenbrenner (1944a) derived an index of group 
coherence from the number of mutual choices 
and found that the degree of coherence in- 
creased from kindergarten to the upper grades. 
He also found that the amount of differentia- 
tion within the group (cliques and cleavages) 
increased with the higher grades. Many other 
studies have used these and other measures 
derived from sociometric data to describe and 
classify groups, and some of these are consid- 
ered in Chapter 22, 
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Although sociometric data can be used to 
infer important information concerning group 
structure, it is obvious that many other sources 
of data are important for the full description of 
these structures. Information derived from 
direct observation of the group, the formal or- 
ganization of the group, the occupational and 
avocational associations within the group are 
all of considerable importance in this context. 


SIGNIFICANT MEASURES RELATED TO 
SOCIOMETRIC VARIABLES 


Information concerning the range of vari- 
ables to which sociometric response has been 
related is of interest both as a means of indi- 
cating areas of investigation in which the 
instrument may be useful and as an index of 
the general fruitfulness of the instrument. If a 
measure can be shown to relate to a number of 
significant variables, no matter how vacuous 
or psychologically uninteresting the content of 
the measure may appear, it will be of interest 
to many investigators. 

We have chosen to divide studies that have 
related sociometric responses to independent 
measures in terms of whether the study empha- 
sizes demiographic, cultural, intellectual, per- 
formance, attitude, or personality variables. 

Demographic variables. ‘There are numerous 
studies that show some aspect of sociometric 
response to vary with age, sex, religion, socio- 
economic status, or educational level. We limit 
our interest here to a representative group of 
such studies. 

Early in the use of sociometric measures it 
was found that socioeconomic slatus in many 
settings showed a positive correlation with 
sociometric choice and, further, that there was 
some tendency for individuals to choose other 
individuals within the same socioeconomic 
class. A study of personal relationships in a 
small New England village by Lundberg 
(1937), Lundberg and Lawsing (1937), and 
Lundberg and Steele (1938) revealed that socio- 
economic status was positively related to the 
frequency of sociometric choice and that in- 
dividuals tended to choose individuals who 
were slightly higher in socioeconomic status 
than the chooser, Another community study, 
conducted by F. A. Stewart (1947a) in a south- 
ern town, indicated that individuals who were 
chosen as most influential tended also to be the 
wealthiest. Investigation of a group of fifth 
grade students by Bonney (1944a) indicated 
that sociometric status was positively related 
to socioeconomic status. Grossmann and Wrigh- 
ter (1948) found that in a sixth grade class- 
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room socioeconomic status as measured by 
the Barr Scale was related to social status. An 
investigation of agricultural county agents and 
librarians who were participating in a univer- 
sity workshop indicated that sociometric status 
was positively related to annual income (Loo- 
mis and Proctor, 1950) . 

Evidence for the tendency of individuals to 
choose other persons of the same socioeconomic 
background was supplied by Bonney (19462), 
who reported that among fifth grade students 
mutual sociometric choice is accompanied by 
similarity in socioeconomic status. In a similar 
study conducted in a woman's college, Lund- 
berg and Beazley (1948) used two measures of 
socioeconomic status (tuition reduction and 
self-support) and found that, using either 
measure, individuals tended to choose other 
persons within the same socioeconomic bracket. 
Using family visits to permit sociometric repre- 
sentation of a German village, Loomis . (1946) 
was able to show that occupational role was an 
important determinant of sociometric cleavages. 

Related to socioeconomic status is the ques- 
tion of association between family size and so- 
ciometric variables. A series of studies carried 
out by Bonney (1942, 1943b, 1944, 1949) have 
indicated that the number of siblings appears to 
influence social choice, particularly at the ex- 
tremes. Thus individuals who are “only” 
children appear to secure more than their share 
of positive choices, while at the college level 
individuals who have six or more siblings tend 
to receive less than their share of positive 
choices. Loomis, Baker, and Proctor (1949) 
found little evidence of any relation between 
family size and social status in a group of col- 
lege freshmen. 

There is relatively clear evidence that religion 
and ethnic group membership are important 
determinants of choice status as well as clique 
formation. Moreno (1934) and Criswell 
(1939) both report convincing evidence that 
sociometric cleavages are established with mi- 
nority group membership as a primary de- 
terminant. A recent study by Goodnow and 
Tagiuri (1952) shows both the relation between 
social status and religious-ethnic grouping and 
also the tendency to form cleavages around 
these distinctions. Lundberg and Steele (1938) 
found membership in the same church to be 
one of the most important determinants of 
clique structure in a small New England village. 
Bonney (1949) found no relation between 
church membership and the number of choices 
received in a large college sample. He did find, 
however, that those individuals who were not 
affiliated with any church received significantly 
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more choices than they would be expected to 
by chance alone, Further, six of the seven 
groups Bonney studied showed significant 
tendencies toward in-group selection — they 
made more choices within their own religious 
group than would be expected by chance. 

Beginning with Moreno's early investiga- 
tions, there has been an increasing amount of 
evidence that sex differences are related to both 
social status and clique formation. One of the 
earliest and most enduring of the group cleav- 
ages that appear in our society is the one based 
on sex (Moreno, 1934; Criswell, 1939). A 
group of teen-age campers are reported by 
Faunce and Beegle (1948) as showing marked 
sex cleavages. Further, there is interesting evi- 
dence that social status, depending upon age 
and the particular group, is related to the sex 
of the individual. F. A. Stewart (1947b) re- 
ports that in a southern town women selected 
slightly more men than women as influential, 
while men selected almost 30 percent more men 
than women. This difference was observed in 
spite of the fact that subjects were asked to 
name both men and women. Several studies by 
Bonney (1942a,b, 1944b) have suggested dif- 
ferences between sexes in elementary school 
children, generally showing girls to have higher 
choice status. 

A number of studies, e.g, Lundberg and 
Beazley (1948), Lundberg, Hertzler and Dick- 
son (1949), Festinger, Schachter, and Back 
(1950), have shown that choice varies with 
physical proximity. Individuals who are near 
to one another are more likely to choose each 
other than persons who are more remote. 
Danielsson (1949) reports that in a nonliterate 
society rejection is positively related to physical 
distance, e.g, the most distant groups within 
the prescribed area had the highest percentage 
of rejection. However, in none of these studies 
was acquaintance or degree of contact con- 
trolled. 

Level of education frequently appears as a 
sociometric correlate. F. A. Stewart (1947a) 
concludes that in Southtown education was 
"virtually a prerequisite" for occupying an in- 
fluential position. In a study of college stu- 
dents, Bonney (1949) reports that those 
individuals who had been in school for less 
than one semester received significantly fewer 
than the expected number of choices, while 
students who had been in school for more than 
six semesters obtained significantly more than 
the expected number of choices. Vreeland 
(1942) had fraternity members rank other mem- 
bers of their group in terms of how friendly 
they felt toward them. Results indicated a 
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strong tendency for individuals to direct their 
preference within their own class and, further, 
individuals in upper classes were given higher 
ranks than those in lower classes. 

Age also is known to show association with 
sociometric variables. F. A. Stewart (1947a) 
found in his community study that individuals 
selected as influential tended to be older than 
individuals not so selected. Moreno (1934) re- 
ported characteristic differences in sociometric 
patterns that could be expected to appear with 
increasing age from shortly after birth on. Both 
Moreno (1934) and Criswell (1939) have 
provided evidence that various kinds of group 
cleavages are especially likely to occur in par- 
ticular age groups. Faunce and Beegle (1948), 
studying adolescent summer campers, found 
that age was a significant determiner of cleav- 
ages. 
Cultural variables. The application of socio- 
metric techniques to cross-cultural study is still 
in an embryonic stage. The possible value of 
such a union has been considered by several 
authors (Mead, 1947; Ashley-Montagu, 1945), 
but only a small number of anthropological 
investigations have been reported that make 
use of sociometric procedures. Perhaps the best 
example of these is Danielsson's (1949) study 
of attraction and repulsion among the Jibaro 
Indians, a South American head-hunting tribe. 
The author examined three subgroups within 
the tribe and also compared some of his find- 
ings with those reported by Lundberg (1937) 
from observation of a Vermont village. Daniels- 
son found that for sociometric variables such 
as cohesiveness and interaction the three sub- 
groups differed markedly among themselves 
and also differed from Lundberg's data from 
our own society — the Jibaro groups showed a 
higher level of interaction. The author dia- 
grams the sociometric position of various “key” 
figures in the subgroups and discusses these re- 
sults in view of the information he has about 
the status and function of these individuals. 
In brief, the investigator finds that sociometric 
procedures serve a useful function in a field- 
work setting. Loomis and Powell (1949) used 
sociometric measures in comparing class struc- 
ture and friendship patterns in two Costa Rican 
communities. One of the communities was 
characterized by peasant-owned farms, while in 
the other community most members of the com- 
munity were employed on a large estate. The 
two groups differed not only in class structure 
but also in terms of such variables as the rela- 
tive number of sociometric choices directed 
outside the group and the number of cliques 
formed within the group. A sociometric com- 
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parison of stable and changing neighborhoods 
in two large cities by Deutschberger (1946) re- 
vealed specifiable subcultural differences. In- 
dividuals in changing neighborhoods made 
significantly fewer choices and directed these 
choices more to their own neighborhood. 

Intelligence and performance. Most studies 
that have related sociometric variables to in- 
telligence have found some relation between 
status and intelligence. Thus, Bonney (1944) 
found that sociometric choice in a group of ele- 
mentary school children was positively related 
to intelligence as measured by group intelli- 
gence tests, and he concludes that the upper 
quartile in social choice’ was distinctly superior 
intellectually to the lower quartile. This same 
finding held for third, fourth, and fifth grades. 
An investigation of a group of sixth grade stu- 
dents carried out by Grossmann and Wrighter 
(1948) related intelligence as measured by the 
Stanford-Binet to choice and rejection. The 
authors report that the group that was high in 
sociometric status was significantly higher in 
intelligence. They suggest, however, that high 
intelligence does not necessarily increase the 
likelihood of choice over that of the person of 
average intelligence. 

There is slight evidence that individuals may 
choose other individuals who are similar to 
them in intelligence. Bonney (1946) found 
that a group of elementary school children who 
mutually chose each other were more similar 
in intelligence, as measured by group tests, than 
a corresponding group of subjects who did not 
choose each other. However, Lundberg and 
Beazley (1948) found little tendency for female 
college students to select other students who 
were similar in ability as measured by an aca- 
demic aptitude test. 

If achievement and performance are divorced 
from economic factors, there is only limited 
evidence indicating association between socio- 
metric variables and these factors. Bonney 
(1946) found very little relation betwen friends 
in their academic achievement as measured by 
a reading test. Lundberg and Beazley (1948) 
found no significant tendency for female col- 
lege students to select individuals in their own 
scholarship group. On the other hand, a study 
of sixth grade students conducted by Gross- 
mann and Wrighter (1948) revealed sociomet- 
ric status to be related to reading ability, at least 


to the extent that individuals in the lowest | 


quarter of their group in social status were 
significantly lower in measured reading ability 
than those in the upper quartile of social choice. 
Interested in a different kind of performance, 
Hunt and Solomon (1942) found that coun- 


selor ratings of athletic ability correlated .60 
with a sociometric measure of status in the first 
week of a summer camp but dropped to only 
.25 by the eighth week of the camp. Smith 
(1944) found a slight tendency for individuals 
engaging in similar athletic activities to select 
each other more often than would be expected 
by chance. Again most of these studies do not 
control for degree of contact or acquaintance. 

Attitude and value. In view of the preoccu- 
pation of many social psychologists with the 
measurement of attitudes it is no surprise to 
find that many studies have been conducted 
exploring relations between sociometric vari- 
ables and attitude measures. In general, this 
research suggests that within certain groups 
maintenance of particular attitudes is corre- 
lated with sociometric status and, further, that 
individuals who choose one another tend to be 
similar in their attitude structure. 

One of the most interesting of the studies 
concerned with attitudes is Newcomb's (1943) 
attempt to assess the role of personal and com- 
munity factors as determinants of attitudinal 
shifts in a small college for women. He found 
that liberal political and economic views were 
positively associated with sociometric choice. 
This was equally true when the choice was 
made in terms of "community prestige" and 
when it was made in terms of an individual to 
serve as an enduring friend. When individuals 
were assigned "community reputation" scores 
by means of a "guess who" technique, those 
individuals who received the highest scores were 
the least conservative. An unpublished study 
by Smelser (1952) related attitude measures to 
sociometric dimensions in a sample of six fe- 
male college dormitories. His attitude ques- 
tionnaire measured extremeness or conservatism 
of attitudes in eight areas: dominance-submis- 
siveness in the female courtship role, career 
versus marriage, sexual freedom versus sexual 
restraint, motherhood, glamour, political-eco- 
nomic conservatism, religious conservatism, and 
ethnocentrism. When the results for the six 
dormitories were combined there was no evi- 
dence of any association between choice status 
and the eight attitude dimensions measured. 
When the dormitories were considered indi- 
vidually, however, there was evidence for a 
significant relation between all of these atti- 
tudes and social status, although the direction 
and magnitude of the correlations varied con- 
siderably between the different dormitories. 
Smelser concluded that there were separate 
norms or social climates that operated within 
the dormitories and that these were much more 
important in determining social status than any 
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college-wide norm. He also divided his groups 
into cliques, using the method of Forsyth and 
Katz (1946), and found his cliques to be gen- 
erally homogeneous in their attitude structure. 

One of the earliest studies concerned with 
the similarity of attitudes among friends was 
Winslow's (1937). He administered a long 
questionnaire to a group of 86 students and 
asked each of them to have a friend fill out the 
questionnaire. When the scores of friends were 
correlated for measures of attitudes toward the 
Negro, American foreign policy, current eco- 
nomic policy, religion, and domestic political 
policy they were found to be significantly re- 
lated in a positive direction for all but attitudes 
toward foreign policy. Even here the relation 
was positive and approached significance. A 
study by Richardson (1940) used observational 
data to determine patterns of friendship and 
found that friends, both female college students 
and adult women, tended to resemble one an- 
other in values as measured by the Allport- 
Vernon Study of Values. Bonney (1946) 
reports a significant positive relation between 
the interests, as measured by the Kuder Prefer- 
ence Record, of individuals choosing each 
other. Using an interest inventory for elemen- 
tary school children, he found little evidence 
for any similarity in the attitudes of individuals 
choosing each other. If we accept the subject 
that an individual selects for his college major 
as implying certain interests or attitudes, there 
are several studies that demonstrate definite 
in-group choice tendencies on the part of in- 
dividuals who have made the same educational 
choice, e.g., Lundberg and Beazley (1948), 
Lundberg, Hertzler, and Dickson (1949). It is 
difficult to interpret such findings, as it is 
clear that other factors, such as physical pro- 
pinquity, operate in addition to the possible 
attitudinal similarity. In a postwar study of 
a German village, Loomis (1946) has provided 
evidence that sociometric cleavages may be at 
least partly determined by political attitudes, 
as inferred from membership in the Nazi party. 

Personality. It is a matter of absorbing in- 
terest to both professional and lay audiences 
that particular patterns of interpersonal rela- 
tions may be accompanied regularly by speci- 
fiable motives or aspects of personality. The 
study of motivational variables in relation to 
sociometric measures has been characterized 
by many of the shortcomings that have been 
manifest in other areas where motivational 
analysis has been a topic of recent interest. 
A major weakness derives from a reliance upon 
measures that are easy and inexpensive to ad- 
minister but whose sensitivity to motivational 
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variables is highly suspect. A further deficiency 
has resulted from the tendency to explore rela- 
tions between sociometric findings and personal- 
ity attributes “at random," with little or no 
attempt to engage in conceptual analysis in 
advance of investigation to permit prediction of 
the kinds of variables that may be expected to 
associate with sociometric data. 

Among psychologists, measurement of per- 
sonality variables through the use of rating 
scales is an old but not especially honored 
method. The most important questions to be 
asked of any study using this approach are the 
care with which the variable has been defined, 
the adequacy with which the raters have been 
trained, and the certainty that the raters have 
been kept from knowing of the variables to 
which their ratings are to be related. Very few 
of the studies to be discussed have answered 
these questions satisfactorily. 

In a recent study of three small women’s dor- 
mitories in an urban college, Lemann and 
Solomon (1952) studied six personality vari- 
ables. All of the variables were bipolar and 
the investigators divided them into two cate- 
gories: alpha, those ratings that ran from good 
to bad (generous-stingy, affectionate-cold, en- 
thusiastic-apathetic) ; and beta, those that ran 
from bad to good to bad (dominating-submis- 
sive, shy-bold, stubborn-yielding). Their find- 
ings indicated that the three status groups were 
clearly differentiated on the alpha scales but 
not on the beta scales. The high in social status, 
as measured by a combined choice and rejection 
score, were characterized as generous, enthu- 
siastic, and affectionate, When the subjects 
were grouped in terms of their noticeability 
or their indifference score (the number of 
combined choices and rejections subtracted 
from the total number of responses possible) , a 
relationship was found between this division 
and ratings received on the beta scales. In- 
dividuals low in noticeability are rated as 
dominating, bold, and stubborn. The investi- 
gators also secured a measure of insight based 
on the discrepancy between self-ratings and 
other-ratings for the same variables, This index 
did not appear to be related to social status. 

It is apparent that having the same individu- 
als perform the personality ratings and the 
sociometric ratings poses certain problems in 
the interpretation of any association between 
these variables. Lemann and Solomon suggest: 
"It may be that a general ‘halo effect’ surround- 
ing girls with high status causes them to be 
rated favorably on most traits" (p. 35). 

In a similar investigation, French and Mensh 
(1948) studied 34 college students who were 
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members of the same sorority. The subjects 
were asked to make three choices and three 
rejections for roommates and also rated each 
other on punctuality, sociability, fair-minded- 
ness, intelligence, self-confidence, and sense of 
humor. Social status was measured by dividing 
the group by inspection into the majority in- 
group and nine isolates. The majority group 
was then further subdivided into those who 
received rejections and those who did not. This 
resulted in three groups of varying social status. 
A comparison of these three groups in the 
personality ratings they received indicated that, 
in general, the individuals high in choice status 
were rated highest on those personality vari- 
ables that are normatively most valued. 

In considering the association they found 
between social status and rated personality 
variables the authors suggest a number of pos- 
sible explanations: "Do individuals attain high 
status because they manifest these traits? Or 
do they develop these traits in consequence of 
having attained high status? Or are they merely 
thought by raters to stand high in these re- 
spects because they have high status or are 
admired as individuals? Or does high status 
permit the conspicuous display of these traits 
without there necessarily being any fundamen- 
tal personality differences?" (p. 344) 

The findings of these two studies are both 
interesting and consistent with what we would 
expect if personality factors were closely related 
to social choice. However, the interaction be- 
tween sociometric ratings and personality rat- 
ings makes such findings of little significance 
as evidence of the relation between interper- 
sonal relations and personality variables. ‘To 
secure data that will be pertinent it is necessary 
that the personality ratings be provided by 
someone other than the individuals making the 
social choice. 

Beginning with Seagoe's (1933) investigation, 
there have been a number of studies which 
have shown that sociometric choice is related 
to behavior ratings secured through "guess 
who" techniques. Although these studies meas- 
ure personality somewhat more indirectly than 
ordinary rating methods, they nevertheless seem 
open to the same criticisms that have been 
leveled at the studies discussed above. 

Probably the majority of studies concerned 
with personality and sociometric choice have 


relied upon paper-and-pencil questionnaires | 


for measurement of personality dimensions. 
The following investigations are quite typical 
of a much larger number that have been carried 
out during the past twenty years. In a study 
conducted prior to the public description of 


the sociometric technique, Flemming (1935) re- 
quired a group of students to nominate their 
best friends and in addition fill out question- 
naires designed to measure such variables as 
pleasingness of personality, steadiness of emo- 
tional response, social adjustment, introversion- 
extraversion, and social intelligence. The 
reported findings indicated that friends were 
similar to each other in most of these charac- 
teristics, especially pleasingness of personality 
and social adjustment. An early investigation 
of a large number of elementary school children 
by Pintner, Forlano, and Freedman (1937) used 
questionnaires to measure ascendance-submis- 
sion, extraversion-introversion, superiority-in- 
feriority, and emotional stability. In addition, 
the children were asked to name their three 
best friends. The authors found no relation 
between friends in the measured personality 
characteristics. Van Dyne (1940) investigated 
the relation between personality factors and 
friendship in a group of 42 female summer 
camp members. "The campers were asked to in- 
dicate the five persons they would most like 
to have as friends, and they also filled out the 
Bernreuter Personality Inventory. Correlational 
findings indicated that girls tended to choose 
as friends other girls who were similar to them 
in dominance and stability. Although the cor- 
relations for these two variables were significant 
they were very low in absolute magnitude. 
There was no evidence of any similarity be- 
tween friends for the variables of self-sufficiency, 
emotional stability, introversion, and self-con- 
fidence. A group of 80 children in an elemen- 
tary school were studied by Bonney (1944), 
using the California Test of Personality and a 
multiple criteria sociometric questionnaire. He 
found the total adjustment score of the inven- 
tory correlated. .49 with sociometric status. 
Consistently, when the subjects were divided 
into quartiles in terms of social choice, the 
quartiles were arranged in a regular order with 
the high in social choice also the highest in 
mean adjustment scores. A more recent study 
by Bonney (1946) showed little association be- 
tween friends in their scores on the California 
Test of Personality in either elementary school 
or college groups. There was, however, a mod- 
erate positive correlation between friends in 
their scores on the Social and Emotional dimen- 
sions of the Bell Adjustment Inventory. In a 
similar study, Grossmann and Wrighter (1948) 
found that sixth grade students who were very 
high in sociometric status secured much higher 
total adjustment scores on the California Test 
of Personality than did a similar group of stu- 
dents who were very low in sociometric status. 
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The upper and lower quartiles in social status 
of a large number (692) of ninth grade stu- 
dents were compared by Kuhlen and Bretsch 
(1947) in their responses to the Mooney Prob- 
lem Check List. The findings indicated that 
the low in social status tended to check a sig- 
nificantly larger number of problems as “often” 
present, but the two groups were not differen- 
tiated in the frequency with which they checked 
problems as "sometimes" present. The pressing 
problems of the low in social status centered 
around social skills, lack of status, family prob- 
lems, and dislike of school. A large number of 
fifth and sixth grade students were divided into 
three social status groups by Baron (1951) and 
compared in their responses to a mental health 
inventory. In general, the inventory appeared 
to differentiate between the three status groups, 
with the low in social status making more “un- 
favorable" responses than either of the other 
groups. Lindzey and Urdan (1954) related 
measures of security, dominance, happiness, and 
achievement derived from self-ratings, question- 
naires, and a sentence completion test to various 
sociometric measures, including sociometric 
status and clique structure. They concluded 
that the questionnaire measures showed much 
stronger relations to the sociometric indices 
than either of the other two types of measures, 
although there was some reason for questioning 
the adequacy of the projective measure of these 
variables, They also found evidence for a 
curvilinear relation between social status and 
the variables of dominance and security, i.e., 
high social status was associated with median 
security and dominance. 

Although the findings of these studies are not 
altogether consistent, there does appear to be 
some evidence that personality factors as meas- 
sured through self-inventories relate to aspects 
of sociometric response. 

A small number of studies have attempted to 
measure personality factors through interviews, 
collection of observational data, or some com- 
bination of the two methods. Perhaps the first 
of these was reported by Richmond (1936). In 
her paper she indicated simply that when stu- 
dent nurses were interviewed after sociometric 
data had been collected, the impressions they 
created in the interviewer were consistent with 
the interpersonal data. Another study in this 
area, carried out by Northway (1944), was fo- 
cused upon children who were very low in 
choice status. She collected personality data 
through interviews and observation of children 
in the fifth and sixth grades who were in the 
lowest quartile in social choice. This data sug- 
gested to Northway that there were really three 
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distinct types of “outsiders,” or individuals low 
in social status. "The first group was made up 
of the "recessive children" who were low in 
energy, possessed little or no drive, and in gen- 
eral created a picture of listlessness and general 
lack of motivation. Second, there were the 
"socially disinterested children" who were less 
defective in general energy and motivation but 
who did not seem interested in gaining social 
approval. Finally, there were the "socially in- 
effective children" who were characterized by 
aggression, boastfulness, and other asocial re- 
sponses but who appeared strongly motivated 
to secure social acceptance. In a study of over 
400 adolescent delinquent girls Jennings (1947) 
compared the overchosen (l standard devia- 
tion above the mean in frequency of choices 
received) and the underchosen (1 standard de- 
viation below the mean). She concluded from 
observational and interview evidence that the 
overchosen showed a large number of distinc- 
tive qualities, among which were ability to 
establish rapport quickly and effectively, fair- 
ness, exhibiting anger primarily towards those 
they consider "should know better," securing 
more and more responsibilities for members of 
their groups. The underchosen more frequently 
interfered with the group activities, showed 
little initiative in starting projects or promot- 
ing understanding among group members, and 
in general seemed to impede rather than facili- 
tate interpersonal relations. 

If we consider accident proneness a personal- 
ity variable, there are several studies that have 
used observational evidence to relate this vari- 
able to sociometric indices. A study by Fuller | 
and Baune (1951) of third grade students 
found that when the subjects were divided into 
upper, middle, and lower thirds in terms of 
social choice there appeared to be a descending 
order of accident proneness; ie, the most 
popular children tended to have the smallest 
number of accidents. Not only did the children 
high in sociometric choice tend to have fewer 
accidents in general, but those accidents they ` 
did have were primarily nonsocial, that is, they 
did not occur while interacting or playing with 
other children. On the other hand, the children 
low in social choice tended to have a relatively 
high incidence of social accidents, A related 
study was carried out by Speroff and Kerr (1952) 
with a group of 90 steel mill workers. They 
found that a social status score based on socio- 
metric choice and rejection showed a highly 
significant relation with the frequency of in- 
dustrial accidents, the low in social status hav- 
ing accidents more frequently. 

Several relatively well-controlled studies have 
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attempted to relate sociometric response to per- 
sonality factors as measured through the use 
of differentiated groups. A recent study by 
French (1951) attempted to measure adjust- 
ment through psychiatric referrals, sick-bay 
attendance, and. disciplinary offenses, and re- 
lated these indices to a sociometric measure of 
social status. In the large group of Navy en- 
listed men studied, sociometric choice was 
negatively related to sick-bay attendance and 
disciplinary offenses. The number of psychiat- 
ric cases was not large enough to provide an 
adequate test of the relation to sociometric 
data, although the evidence observed was in 
the expected direction, suggesting that those 
individuals who had undergone psychiatric ex- 
amination were relatively lower in social status. 
Tagiuri (1951), employing the method of re- 
lational analysis, compared three groups of 
prep school students — those who were success- 
ful (good athletes, high grades, officeholders) , 
average students, and maladjusted students 
(seeing a psychiatrist). He found that the 
maladjusted students were characterized by low 
social choice and a small number of guesses that 
others would choose them. Further, the success- 
ful students tended to be higher than the aver- 
age students on both of these variables. 

The use of projective techniques for measur- 
ing personality variables to be related to socio- 
metric measures is just beginning to attract 
attention. The relative paucity of studies in- 
volving this method of measurement, widely 
considered most sensitive to covert aspects of 
motivation, is in large part attributable to the 
relative effort involved in the collection and 
analysis of such data and the small number of 
individuals skilled in the use of these tech- 
niques who have shown interest in group data 
or patterns of interpersonal relations. 

Northway and Wigdor (1947) have com- 
pared Rorschach responses of three groups of 
eighth grade boys and girls who were high, low, 
and middle in their sociometric status. Each 
group contained fifteen students. The reported 
findings indicate that the three groups were 
differentiated in response to the test, although 
it is rather difficult to evaluate these results in 
view of the dangers involved in quantitative 
treatment of untransformed Rorschach scores. 
Northway and Wigdor indicate that the high 
in status are characterized by “greater sensitivity 
in sensing the feelings of others, and a conscious , 
striving for the approval of others” (p. 196). 
The middle group “. . . seems a more 
shallow, less introspective group with few 
anxiety or emotional disturbances” (p. 194). 
The low group “seems to be the most 


seriously disturbed. They show less ability to 
control their emotions, and seem to be a more 
egocentric, moody, and impulsive group” (p. 
194). All of these interpretations are based 
upon differences between the groups on specific 
Rorschach scoring dimensions. In a further 
analysis the different status groups were com- 
pared in a general clinical interpretation of 
signs of disturbance. The low social choice 
group was divided into two subgroups, one 
made up of the recessives and the other the 
aggressive children. The findings indicated 
that there were more serious signs of disturb- 
ance in the recessive group than in any other. 
Pepinsky, Siegel, and Vanatta (1952), as part of 
an ingenious attempt to develop a criterion 
that could be used in evaluating therapeutic 
change, related sociometric choice to perform- 
ance on the group Rorschach, Correlation of a 
large number of Rorschach variables with the 
number of sociometric choices given and re- 
ceived provided no evidence of any association 
between these measures. Multiple correlation 
analysis suggested the possibility of a relation 
between the number of choices given and two 
Rorschach scores (XM, Dd%). 

A study by Lindzey and Goldwyn (1953) ex- 
amined the relation between sociometric status, 
measured by choice and rejection, and certain 
P-F Study scores and TAT variables. A corre- 
lational comparison of a selected number of 
P-F Study variables and social status revealed 
very low correlations in the expected direction, 
some of which attained statistical significance. 
Further, when the ten individuals highest in 
social status were compared with the ten in- 
dividuals lowest in social status, significant 
differences were observed on several of the P-F 
Study variables. The findings indicated that 
the high in social status were less extrapunitive, 
more intrapunitive, and showed lower group 
conformity ratings than the low in social status. 
TAT story ratings designed to measure anxiety 
and aggression did not relate to social status. 
In a study of student nurses Holzberg and Pos- 
ner (1951) used a sociometric technique with 
criteria designed to secure information about 
the assertiveness of the group members. They 
found a significant positive relationship be- 
tween sociometric status in this situation and 
scores on the A-S Reaction Study, as well as 
with ratings of assertiveness carried out by su- 
pervisors. There appeared to be no relation 
between sociometric scores and performance 
on the P-F Study or ratings of assertiveness de- 
rived from a modified TAT. Miller and Stine 
(1951) used a story-completion test in an effort 
to relate pregenital fixations to social choice as 
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measured by a sociometric questionnaire. The 
expectation that individuals who were most 
characterized by pregenitality, as evidenced by 
the story-completion test, would be most re- 
jected was, in general, verified. The investi- 
gators divided their subjects, primary school 
children, into five status groups in terms of the 
incidence of choice and rejection, and found a 
general increase in rated pregenitality as the 
degree of rejection increased. When the mean 
pregenital scores of the two high-status groups 
were combined and compared with the means 
of the low-status groups for each of eleven pre- 
genital variables (orality, anality, passivity, 
direct aggression, etc.) , the low-status group was 
higher on ten of eleven variables, five of these 
differences reaching the five percent level of 
significance. Thus, in adequately controlled 
circumstances, there appeared to be evidence 
that social choice was related to psychoanalytic 
variables as measured through the use of indi- 
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rect or projective methods. 

In general, we may suggest that the evidence 
is moderately convincing for the existence of 
regular relationships between certain sociomet- 
ric dimensions and attributes measured by 
conventional personality instruments. Personal- 
ity, at both overt and covert levels, appears to 
relate to sociometric variables in significant 
respects. 

The purpose of this section has been to 
sample briefly some of the studies that have 
shown sociometric data to be associated with 
variables of psychological interest. In spite of 
serious limitations in the conceptions and pro- 
cedures typical of many sociometric inves- 
tigations, there appears to be evidence for 
substantial relations between aspects of be- 
havior measured by sociometric techniques on 
the one hand and demographic, cultural, intel- 
lectual, attitude, and personality variables on 
the other. 


SocioMETRIC MEASURES IN “ACTION” 


Moreno's initial insistence that sociometric 
measures could be used properly only when 
the groups studied were restructured in the 
light of sociometric findings has led to a heavy 
tradition of “action” use of these measures. 
The earliest sociometric studies were carried 
out in settings where the findings were used to 
implement or modify practical programs of 
education or therapy. The ease with which 
the results of sociometric measures can be 
transformed into practical recommendations 
has become one of their most attractive features. 
Moreover, the use of sociometric findings to 
introduce change or restructure into the phe- 
nomena being studied does not preclude the 
likelihood of observing important empirical 
findings. In some cases this likelihood is actu- 
ally enhanced. 

Our discussion of sociometric applications is 
organized in terms of whether the application 
is primarily in the field of education, industry, 
military service, mental hospitals, or commu- 
nity service. 


EDUCATIONAL APPLICATIONS 


Sociometric measures first appeared in con- 
nection with the study of public school students 
and the inmates of a school for female delin- 
quents (Moreno, 1934). The passage of time 
has seen continued application of these meas- 
ures in educational systems and correctional 
institutions. Not only do schools present groups 
that are well designed to facilitate the use of 


sociometric measures, but also many of the 


, 


problems faced by teachers and school adminis- 
trators can be approached constructively 
through the use of sociometric measures, ‘Thus, 
formation of cohesive, smooth-working groups 
is facilitated by information that the sociomet- 
ric measure can provide. The task of sensitizing 
the teacher to the existence of interpersonal 
difficulties within her classes is facilitated by 
sociometric data, and efforts to combat these 
difficulties can be appraised through the use of 
sociometric measures. Further, and perhaps 
most important, these same techniques provide 
an excellent means of identifying the child 
whose relations with his classmates are suffi- 
ciently defective to suggest the likelihood of 
more serious difficulties in the future. 

In general, sociometric measures may serve 
equally well the functions of improving normal 
interpersonal relations within the classroom 
and of diagnosing failures of interpersonal ad- 
justment which may become the psychiatric 
casualties of a later period. 

Probably the first reference to sociometric 
procedures in a classroom setting is an article 
by Moreno (1943b, originally published in 
1932). In this paper he discussed the results 
of sociometric testing in a public elementary 
school, indicating the possibility of reassign- 
ment and also reporting surprising inaccuracy 
on the part of the teachers in judgments of 
the choice status of the students. The fallibility 
of the teacher as a judge of student relations 
indicates the importance of sociometric tech- 
niques as a supplementary measure. A number 
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of additional studies (Bonney, 1947; Moreno, 
1934) indicate that teachers’ judgments, al- 
though variable in terms of the particular judge 
and the age of the students being evaluated, 
are not highly sensitive to individual differences 
in social acceptance. In particular, the teacher 
seems unable to discriminate effectively the 
object of main interest — the individual who 
receives a very low proportion of social choices. 
Perhaps the most extensive study of the accu- 
racy of teachers’ judgments of sociometric status 
was conducted by Gronlund (1950a, b) in forty 
sixth-grade classes. He found considerable 
variability among teachers in the accuracy of 
their judgments (Pearson correlations between 
teachers’ ratings and sociometric ratings varied 
from .27 to .84) but most teachers were quite 
low in absolute accuracy. Further, teachers 
tended to overestimate the choice status of 
children they liked and to underestimate that 
of children they disliked. 

The sensitivity of many teachers has been 
increased in a number of studies where the use 
of sociometric measures has supplemented 
judgments provided by teachers. Northway 
(1944) reports efforts made in cooperation with 
the school officials to bring about changes in 
the children’s interpersonal relations in the di- 
rection of greater acceptance. An early use of 
sociometric measures to permit improving in- 
dividual as well as group adjustment is re- 
ported by Kerstetter and Sargent (1940). 
In a ten month study of a fifth grade class 
they observed the rise of a subgroup that 
developed deviant tendencies and produced 
marked conflict with both school authorities 
and the police. Following administration of 
an additional sociometric questionnaire, the 
members of the delinquent group were assigned 
new seats that separated them from each other, 
while at the same time taking advantage of their 
known preferences outside the group. The re- 
sult of this "assignment therapy" appeared to 
be a trend toward dissolution of the delinquent 
group, as reflected in their responses to a third 
sociometric questionnaire administered three 
weeks later. 

There have been numerous studies in which 
sociometric measures have been used to ap- 
praise the outcome of efforts to improve group 
or individual adjustment in classroom settings. 
In some cases the sociometric techniques were 
used simply to get at the outcome of the treat« 
ment, while in other cases they are involved in 
the treatment process itself. An attempt to 
improve social adjustment in a ninth grade class 
is reported by McClelland and Ratliff (1947), 
in which a sociometric questionnaire and a 


personality inventory were administered ini- 
tially to appraise the adjustment of the class 
members. Following this, individuals who gave 
signs of particular adjustment difficulties were 
given positions that would place them in posi- 
tive interaction with other members of the 
class. The class also embarked upon a self- 
improvement program and was divided into 
small groups, each of which had its own leader. 
A second sociometric questionnaire was ad- 
ministered which indicated a much more 
equitable distribution of choices, in spite of 
the fact that the same individuals were over- 
chosen. The use of continuous sociometric 
testing in an educational setting, combined with 
“assignment therapy” and individual counsel- 
ing for the poorly adjusted, is described by 
Shoobs (1947). He concludes that employment 
of these procedures over a six-month period 
resulted in both scholastic and behavioral im- 
provement in the classroom. 

Zeleny (1940b) reports a modified sociomet- 
ric procedure that he feels may be used effec- 
tively to control the satisfactions and morale 
of individuals in a classroom setting. In a sub- 
sequent paper (Zeleny, 1941) he outlines a 
procedure for measuring social status in the 
classroom and discusses means whereby the 
social status of students can be improved. The 
results of a preliminary study of 25 students 
indicate that four weeks after these individuals 
had been exposed to individual guidance and 
sociometric regrouping there was a general in- 
crease in social status. Sociometric measures 
were used by Johnson (1939) to measure the 
effectiveness of attempts to bring about the 
integration of isolates in groups within a boys’ 
club. The measures showed that in circum- 
stances where the natural leader of the group 
was used as the agent to bring about the change, 
the isolate tended to be absorbed into the group. 

The practice of regrouping individuals in a 
correctional institution is discussed by Moreno 
and Jennings (1944), and some indication is 
given of changes brought about by this re- 
grouping. In particular, they point to the in- 
creased mutuality among the group members 
which occurred in a particular instance. They 
emphasize that one should not stop with re- 
grouping the individual members, but rather 
that this should be combined with efforts at 
retraining key figures in the group. The use of 
"assignment therapy" in a classroom setting is 
described by Thrasher and Kerstetter (1947) . 
Two college classes were given sociometric tests 
at the start of the term and a committee struc- 
ture was established on this basis. At the end 
of the semester a second sociometric test was 
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administered and comparison with the initial 
results showed a highly consistent increase for 
the group members in both choices received and 
choices given. Such findings are extremely diffi- 
cult to interpret because of the absence of a 
control group. A study by Faunce and Beegle 
(1948) reports the use of sociometric measures 
to produce compatible work groups in a sum- 
mer camp and also to measure the degree of 
integration in the camp. 

It is characteristic of the studies we have 
discussed here that they involve sufficient weak- 
ness in pertinent empirical controls so that 
their results can be considered little more than 
suggestive. This does not mean that most of 
their findings may not be replicable, but simply 
that the evidence for firm generalizations in 
this area is not yet present. 

We have mentioned here only a very small 
proportion of the studies that have dealt with 
sociometric measures in educational settings. 
However, these studies, in spite of their pro- 
cedural limitations, are consistent in suggesting 
that there is a legitimate and important area 
of functioning for sociometric measures in the 
educative process. 


INDUSTRIAL APPLICATIONS 


The potential contribution of sociometric 
measurement in this area has increased greatly 
as psychologists in industrial settings have be- 
come less concerned with measurement of spe- 
cific abilities and traits and have focused more 
and more upon describing the actual interper- 
sonal setting in which jobs are carried out. 
The increasing interest in informal organiza- 
tion of working groups, as well as the frequent 
necessity of appraising morale, cohesiveness, 
relations to peer and authority figures, etc., 
makes it almost imperative that the industrial 
social scientist be familiar with sociometric 
measures. 

One of the first applications of sociometric 
methods in industry was made by Jacobs (1945), 
who discusses the general value of such an ap- 
proach and presents illustrations drawn from 
the use of these techniques in a business setting. 
He concludes that sociometric measures have 
value for measuring morale; selecting supervi- 
sors, foremen, and managers; determining in- 
formal groups; locating social factors that 
influence production; determining how well 
individuals are adjusting socially to their jobs; 
and measuring factors of cooperation, getting 
along with people, etc., for purposes of merit 
ratings and bonuses. Rogers (1946), in a dis- 


cussion of human problems in industry, in- 
dicates her conviction that the work of 
Roethlisberger, Dickson, and others of the Har- 
vard Business School group, and the contribu- 
tions of Moreno, primarily in the form of 
sociometric measures, represent the two most 
important contributions to an understanding 
of the psychological problems of the worker in 
industry. 

A study of 24 executives is reported by 
Browne (1951), who examined the relations 
existing among a small group of executives and 
also compared sociometric choice scores with 
various independent measures. Instead of ask- 
ing the subjects to indicate the persons they 
would like to associate with, they were asked 
to indicate the persons they actually did asso- 
ciate with in the process of carrying out their 
jobs. The results are presented in the form of 
a sociogram that makes clear the formal organi- 
zation of the company as well as the interaction 
patterns. The diagram shows that there are 
marked differences between divisions within the 
same company in the relations existing between 
executives. The frequency with which an in- 
dividual was reported as involved in interactions 
with others showed some signs of positive cor- 
relation with questionnaire measures of respon- 
sibility, authority, and delegation of authority. 
Browne generalizes his experience by suggesting 
that the sociometric measure may be used in in- 
dustry in order to study interpersonal relations, 
communication channels, and methods of per- 
forming leadership functions, as well as to 
suggest needed modifications in personnel rela- 
tions, Weschler, Tannenbaum, and Talbot 
(1952) have discussed the value of a multidi- 
mensional sociometric approach to industrial 
problems. 

The possibility that sociometric data may 
provide important information concerning the 
determinants and possible control of industrial 
accidents is suggested by two studies. Fuller 
and Baune (1951), studying third grade chil- 
dren, and Speroff and Kerr (1952), observing 
steel mill workers, both report that accident 
proneness appears to be negatively related to 
sociometric choice status. The latter investi- 
gators believe that the frequency of industrial 
accidents could be reduced through sociometric 
"reassignment" combined with active counsel- 
ing of workers who are demonstrably low in 
social choice. "The reader will find further dis- 
cussion of the use of sociometric techniques in 
industrial settings in Chapter 22 and Chap- 
ter 29. 
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MILITARY APPLICATIONS 


The increased activity of social scientists in 
military settings during the past decade has led 
to a number of studies in which sociometric 
measures have been employed in order to cope 
with problems of military importance. In gen- 
eral, the reasons that sociometric measures have 
been employed here are the same as those that 
have led to industrial applications — interest 
in morale, group cleavages, social adjustment, 
etc. An organization which asks individuals to 
function in group settings with a great deal of 
interdependency in their activities naturally is 
concerned with the quality of the interpersonal 
relations that characterize the groups, sub- 
groups, and individuals. 

Studies in this area may be conveniently di- 
vided into those that are primarily concerned 
with appraising the character of the individual's 
relations to others and those that are concerned 
with the quality of interpersonal relations for 
the group as a unit. One of the first reported 
studies using these techniques in an effort to 
provide information about the individual is 
reported by Sutherland and Fitzpatrick (1945) . 
Their discussion of group techniques in the se- 
lection, training, therapy, and demobilization 
of the British army make clear that sociometric 
methods were used freely and felt to function 
effectively. An intensive sociometric investiga- 
tion of 35 French army officers is described by 
Maucorps (1949), in which a number of cor- 
relates of sociometric indices were observed. 
He summarizes, “The sociometric procedure 
has proved to be an excellent selection tool. . . . 
Certain findings have revealed the practical 
importance of such an analysis as regards both 
the reducing of prejudice and the rational re- 
construction of groups” (p. 79). Jennings 
(1949) discussed the use of sociometric and 
situation tests in military settings in Europe 
and the United States during the past war. She 
believes that the German use of these instru- 
ments provided less opportunity for freedom 
in response on the part of the subject, but that 
such techniques were used very widely on both 
continents. 

An excellent study by Williams and Leavitt 
(1947) relates sociometric choice to perform- 
ance in officers’ candidate schools and, more 
importantly, to leadership performance in com- 
bat. The value of sociometric indices was ex- 
amined through comparing the relative eff: 
ciency of sociometric choice by platoon mem- 
bers (with the criterion of “all-round ability 
as a combat officer”) and a number of other 
indices, including self-ratings, leader ratings, 


personality inventory scores, and performance 
on objective ability tests. When these measures 
were related to success and failure in officers’ 
candidate school and to ratings of performance 
as a leader in actual combat the sociometric 
measure showed a significant correlation with 
both criteria and performed better than any 
of the other measures. In discussing the su- 
periority of the sociometric index the authors 
suggest: “Group members have more time to 
observe each other than do superior officers; 
they know each other in a realistic social con- 
text, and they react directly to each other's so- 
cialdominance behavior. All these are condi- 
tions favorable to informed judgment” (p. 291). 
A similar study is described by Baier (1947), 
in which a comparison of sociometric choice 
scores for a group of West Point cadets showed 
a correlation of .51 with ratings of efficiency 
provided by superior officers after the cadets 
had completed their training and spent eighteen 
months on active duty. 

A study reported by Wherry and Fryer (1949), 
conducted in an Army officers’ candidate school, 
investigated whether “buddy ratings” (modified 
sociometric techniques) measured leadership 
qualities rather than personal popularity. Two 
groups of officer candidates were studied. A 
large number of ratings by peers and superiors 
in addition to measures based on their perform- 
ance during the training period and intelli- 
gence test scores were secured. All of the 
leadership ratings, in addition to academic 
grades, were submitted to a factor analysis 
which led to the identification of four factors: 
academic success, leadership, tactical standing, 
and group difference correction. Wherry and 
Fryer summarize: 


Buddy ratings appear to be the purest measure 
of "leadership." Tactical officers are also able to 
rate this trait but their ratings are quite heavily 
weighted by tactical standing. Academic instruc- 
tors’ ratings are practically useless for the evalua- 
tion of this trait." (p. 189) 


Stogdill and Shartle (1948) and Stogdill 
(1949), in a study of naval officers, report the 
use of a sociometric technique that requires the 
subject to list his fellow officers in rank order 
in terms of the amount of time on the job 
spent interacting with them. The frequency 
with which the individual is mentioned by his 
fellow officers is positively correlated with his 
level in the staff organization and also shows 
some variation with the kind of administrative 
activity in which the individual spent most of 
his time. 
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Several studies have used sociometric tech- 
niques to describe or classify groups. A paper 
presented by Jenkins (1947) indicated the sen- 
sitivity of sociometric choice and rejection 
measures to morale differences between two 
combat naval air squadrons. When the two 
squadrons, known through observation to differ 
markedly in morale and efficiency, were com- 
pared in patterns of choice and rejection, pro- 
nounced and consistent differences were ob- 
servable. The high morale squadron was 
characterized by positive choice of the com- 
manding officer and executive officer, a high 
incidence of in-group positive choice, and an 
absence of cliques within the group. On the 
other hand, in the low morale squadron the 
commanding and executive officers tended to be 
either ignored or rejected; there was consider- 
able within-group rejection, and there was 
evidence for the existence of distinct cliques. 
In an attempt to relate group cohesiveness to 
military performance, Goodacre (1951) used 
sociometric questions to provide an index of 
cohesiveness and also secured ratings of the 
effectiveness of performance for a dozen twelve- 
man squads while coping with field problems. 
His measure of cohesiveness was simply the 
number of choices made by the members of 
each group. The results indicated a significant 
positive association between group cohesiveness 
and effectiveness of squad performance in the 
field. Zeleny (1947b) reports the use of socio- 
metric techniques to increase the compatibility 
of air force cadets who were assigned to the 
same team during their training. As the flying 
tasks required close coordination in the activi- 
ties of pair-members, it seemed desirable to se- 
cure as much mutuality within the pairs as 
possible. When the interactions in the groups 
that had been established prior to the use of 
sociometric measures were examined, there was 
a relatively low incidence of mutuality. In 
general, Zeleny's study makes clear that it is 
possible through the use of sociometric meas- 
ures and restructuring of groups to increase the 
incidence of positive choices and decrease the 
incidence of rejection. Although no direct evi- 
dence is presented to indicate that such a shift 
results in more effective performance, this 
expectation is justifiable in view of the tasks 
being performed by the groups and the findings 
of other investigators (Jenkins, 1947; Goodacre, 
1951). 

The studies we have reviewed provide evi- 
dence for the usefulness of sociometric 
techniques in military settings, especially if 
measurement of leadership, morale, social ad- 
justment, and similar variables is of importance. 
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MENTAL HOSPITALS AND PSYCHOTHERAPY 


While still in an exploratory stage, the appli- 
cation of sociometric procedures in mental hos- 
pital settings and in conjunction with the 
therapeutic process has already led to some 
suggestive findings. The importance of inter- 
personal relations in personal adjustment, in 
addition to their crucial status in the treatment 
process, makes it understandable that sociomet- 
ric measures should have some function in these 
problem areas. 

Jenkins and Curran (1940) report the use of 
a sociometric questionnaire to secure informa- 
tion concerning the social relations existing in 
a group of about 40 adolescents in a psychiatric 
observation ward. 'The investigators report a 
number of illustrative sociograms and discuss 
them in relation to the known attributes of 
some of the patients included in the sociograms. 
A more recent study by Hyde and York (1948) 
describes a time-sampling technique that can 
be used in the observation of ward behavior 
and which leads to data that can be presented 
in the form of a sociogram. The investigators 
suggest that the device leads to a number of 
uses, including aid in individual diagnosis, 
selection of patients for group psychotherapy, 
providing evidence of the efficiency of patients’ 
interpersonal relations (including signs of im- 
provement or recovery) , and providing relevant 
information for hospital personnel that may 
motivate them to perform more effectively. 
Sharing the belief of these earlier investigators 
that sociometric measures provide empirical 
control over an important aspect of individual 
adjustment, Pepinsky et al. (1952) attempted 
to develop a modified sociometric questionnaire 
that could be used as a criterion against which 
therapeutic efforts could be tested. A less elabo- 
rate attempt to develop a measure of social 
adjustment for a psychiatric setting was made 
by Schauer (1946), who used a simple socio- 
metric technique in a small mental hospital 
population. His results show a very low amount 
of reciprocity in choice among patients, but he 
made no detailed attempt to determine how 
successful the sociometric measure was as an in- 
dex of adjustment. 


COMMUNITY SERVICE 


The economy and speed with which sociomet- 
ric measures can provide information concern- 
ing the personal ties and repulsions within 
relatively large groups have made them an effi- 
cient tool in the hands of individuals concerned 
with studying and facilitating various commu- 
nity functions. Thus, the social worker, com- 
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munity leader, and resettlement agent have all 
found these measures valuable. 

The use of sociometric findings in the study 
of open communities was perhaps first reported 
by Wolman (1937) , who studied a resettlement 
community of 35 families, composed of indi- 
viduals drawn from low-income brackets in 
urban areas. He administered two sociometric 
tests six months apart. The first administration 
was at the start of the project and the results 
were used in determining the location of the 
families. The second administration was de- 
signed to discover what changes the six months 
of community life had brought about. Al- 
though the author discusses the changes in so- 
cial position of the families individually, the 
only general change reported was that two of 
the six families originally unchosen had become 
integrated into the group. Closely related to 
this study are a series of investigations by Loo- 
mis carried out under the auspices of the U. S. 
Department of Agriculture. Loomis and David- 
son (19392), in a study of a rural resettlement 
community, utilized a modified sociometric 
technique, asking the identity of persons who 
actually visited the family being interviewed. 
They conclude: “. . . it is most important that 
the chain of informal social relationships on 
the projects should be studied. The present 
study conclusively demonstrates that such rela- 
tionships are important. ‘The families which 
remained and the families which moved away 
from the project . . . tended to constitute in- 
groups" (p. 78). In a second study, Loomis 
and Davidson (1939b) used the same technique 
to examine the extent to which two years of 
community existence had served to increase 
group integration. When the sociometric find- 
ings were compared with participant observer 
reports of cleavages present earlier, they indi- 
cated signs of increased community integration. 
Loomis (1941) used the same method in a 
Spanish-American village with the intent of ar- 
riving at insights that would be of value to 
Federal agencies concerned with resettling simi- 
lar groups. He concluded that, to function 
smoothly, any attempt to resettle such groups 
must take into consideration the extended fam- 
ily. A more recent study by Infield (1947) re- 
ports the use of sociometric measures in a study 
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of a Canadian cooperative community. 

The utility of sociometric measurement in 
the hands of the social worker has been the 
subject of considerable discussion (Deutsch- 
berger, 1950; Green, 1950; Moreno, 1950). A 
brief paper by Kerstetter (1943) has outlined 
the uses of sociometric tools for social agencies. 

A study is reported by Cologne (1943) in 
which members of a self-help community cen- 
ter were reassigned to work groups on the basis 
of sociometric data after the observation of 
hostility and tension among the members of 
the center. She demonstrated that the reassign- 
ment could be made in such a way as to satisfy, 
to some extent, the choices of a very large pro- 
portion of the group members. 

The use of sociometric devices to detect pat- 
terns of influence within given communities 
and to aid in the selection of community lead- 
ers has been a topic of considerable interest. 
The application of sociometric measures in se- 
lecting "block leaders" as well as in assisting 
with the problem of how a "block" should be 
defined is described by Sanders (1949). Many 
community functions necessitate the use of 
some small organized unit such as the block 
in order to preserve channels of communica- 
tion. In spite of a general tendency to define a 
block as an area bounded on four sides by 
streets, Sanders' sociometric data, collected in 
two different socioeconomic areas, indicates that 
most of the neighborhood interpersonal rela- 
tions are between individuals on opposite sides 
of streets. He concludes that it is generally more 
fruitful to define a block in this way. An ad- 
vantage of this approach is that it makes it 
possible to estimate who in the group is not 
likely to be contacted effectively by any par- 
ticular leader. Several papers by Stewart 
(1947a, b) indicate the feasibility of modified 
sociometric techniques as a means of determin- 
ing the influential individuals in a sizeable 
community. 

The investigations we have discussed make 
clear that sociometric measures can serve a num- 
ber of valuable functions for the student or 
practitioner working in community settings. 
The further development of large-scale socio- 
metric measurement may be expected to en- 
hance this value. 


NEGLECTED CONSIDERATIONS AND LIMITATIONS IN THE Usr OF SOCIOMETRIC MEASURES 


In our discussion thus far we have intention- 
ally presented sociometric measures in a favor- 
able light. Our purpose has been to indicate 
those areas in which such measures are useful 
and the kinds of problems and concepts with 


which they interact fruitfully. At this point, 
we should like to provide a partial antidote for 
this consistent optimism in the form of a small 
number of cautions or limitations with which 
the user of sociometric techniques should be 
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familiar. In some cases these cautions refer to 
the manner in which investigators have charac- 
teristically employed these techniques, while in 
other cases the fault is more general and inheres 
in the very nature of the sociometric technique. 


EXCESSIVE DEPENDENCE UPON SOCIOMETRIC DATA 


Sociometric measures provide only one of 
many possible means of viewing interpersonal 
relations. Knowledge of the formal or official 
organization of a group may be fully as im- 
portant as sociometric data. Further, the evi- 
dence that can be provided through systematic 
observation of the group is in many cases essen- 
tial. An adequate specification of the physical, 
cultural, and sociological setting, as well as a 
detailed psychological appraisal of the indi- 
viduals involved, is often a prerequisite to full 
understanding of the phenomena to which so- 
ciometric techniques are customarily applied. 

The inadvisability of securing sociometric 
data alone has frequently been noted by writers 
in this area. Moreno, in Who Shall Survive?, 
advocates the use of spontaneity tests (role play- 
ing) and interviews in addition to the socio- 
metric test, in order to give information about 
the determinants of the sociometric responses. 
The ineffectiveness of this and subsequent ap- 
peals is in part attributable to the tendency of 
these writers to represent sociometric data as 
more “real,” "spontaneous," or “basic” than 
other data that can be collected in connection 
with the same events. The desire to give socio- 
metric data a special empirical status, subordi- 
nating and ignoring other kinds of data, may 
diminish the motivation to introduce the va- 
riety of other measures necessary for the most 
fruitful application of sociometric measures. 

Whatever the determinants of this constric- 
tive tendency may have been, it is clear that 
sociometric research has had more than its 
share of investigations where internal relations 
are sought. The examining of one aspect of 
sociometric response in relation to another, or 
the relating of sociometric response to the most 
obvious independent measures (age, sex, reli- 
gion), can make only a limited contribution 
once the early phases of investigation are past. 


SCARCITY OF SYSTEMATIC INVESTIGATION 


It is quite characteristic of social scientists to 
let caprice and convenience determine both the 

roblem to be explored and the manner in 
which it is approached. This tendency, how- 
ever, seems especially marked among persons 
doing sociometric research. There has been 
very little in the way of systematic, theoretically 
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derived research aimed at exploring in a regular 
fashion the kinds of conditions and variables 
that are related to sociometric response. Most 
of the evidence accumulated in this area has 
resulted from the use of convenient or in- 
teresting measures, with no attempt to explore 
those relations that might have been predicted 
in advance, have greatest theoretical relevance, 
or that would be most likely to stimulate fur- 
ther investigation. 

The relative scarcity of systematic, cumula- 
tive investigations is intimately related to the 
tendency to rely heavily upon sociometric meas- 
ures alone. The person who permits his empiri- 
cal problem to be determined by casual interest 
and the availability of data is very likely to be 
content with sociometric data alone, while the 
individual who pursues a problem derived from 
theory or which is part of an orderly plan of 
research is almost certain to introduce other 
measures, Related to the lack of systematic 
direction of research has been the unimagina- 
tiveness with which studies have been carried 
out repeatedly with educational and institu- 
tional groups, while very little attention has 
been paid to the many other pertinent groups 
within our society. This lack of concern for 
representativeness in the subjects studied is 
perhaps defensible when the investigator is 
concerned with a general empirical relationship, 
but when he is simply concerned with taxonomy 
or description of group structure, he must be 
concerned with the degree to which his subjects 
represent a definable population. 

It seems quite likely that the value of studies 
that make casual and unsystematic use of so- 
ciometric measures is steadily decreasing. Ex- 
plicit and multivariate conceptions of the 
nature of interpersonal relations can be ex- 
pected to lead to research that will be both 
cumulative and of more basic importance in the 
understanding of group proces. Further, a 
more adequate understanding of sociometric 
instruments is likely to accrue from such sys- 
tematic programs of research. 

The facility with which Moreno has con- 
trived and presented speculative theoretical 
constructions in connection with his empirical 
findings, and his lack of concern for a rigorous 
linking of these formulations to empirical data 
may have served as a deterrent to a more cau- 
tious and persistent approach to the relation 
between theory and sociometric data. 

An interesting step has been made by Lee- 
man (1952) in a direction that may eventually 
lead to a more systematic approach to sociomet- 
ric data, He presents a base-line mathematical 
model to represent sociometric choice. In 
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mathematical models the initial assumptions 
are, of course, crucial to the subsequent deriva- 
tions, and in the “‘base-line model” emphasis is 
placed upon choosing assumptions that can be 
handled rather than those that are necessarily 
most accurate. The correctness of the model's 
predictions concerning the empirical world is 
presumably an outcome of subsequent revision. 
Leeman presents derivations from his model 
for three- and four-man groups and confronts 
his derivations with empirical data. The results 
of this comparison are promising. There are 
many difficulties involved in such an approach 
— Leeman indicates the practical limit of his 
model as the four-man group and further points 
to the complexities involved in making deriva- 
tions from the model. Nevertheless, this direc- 
tion represents one way in which more efficient 
data collection and interpretation might evolve. 


INSUFFICIENT ATTENTION TO THE SELECTION OF 
THE SOCIOMETRIC CRITERION 


All too often sociometric investigators have 
used much of their time and energy in the col- 
lection of data, and have given little thought to 
the criterion of the sociometric question. The 
importance of selecting an appropriate criterion 
has been enunciated frequently but seems to 
have had only a limited audience. Moreno 
(19372) distinguished between the essential and 
the auxiliary criteria — the essential criteria 
are those that refer to the activity which is at 
the center of the investigator's interest. Quite 
frequently investigators have used auxiliary cri- 
teria to define or measure variables such as 
leadership, social status, etc., with the conse- 
quence that they secured contaminated meas- 
ures and results that were difficult to interpret. 

Relevant to this issue is the distinction pro- 
posed by Criswell (1949) betwen one-way and 
two-way sociometric questions. As she points 
out, certain sociometric criteria (sit next to, 
room with) refer to relationships that are re- 
ciprocal, where the extent to which choices are 
mutual or shared is an all-important question. 
Other criteria (lead a squad, represent a school 
at a conference) refer to relationships where 
mutual choice is relatively unimportant. The 
crucial question is how frequently the person 
is chosen, not whether he is chosen by those 
whom he chooses. A further relevant distinc- 
tion is that between psyche-group and socio- 
group (Jennings, 1950). These distinctions are. 
important in the present discussion because 
they point to the fact that if the criterion used 
is not selected carefully, the investigator may 
well secure data that do not lead naturally to 
the kinds of analyses in which he is interested. 
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Not only is it important that investigators 
consider carefully the question of what cri- 
terion is most appropriate for their special use, 
it is also important that future investigations 
aim directly at the question of what choices 
under various criteria mean (relate to), or the 
kinds of variables that can best be measured 
through the use of particular criteria. 

One of the most frequent objections raised 
to sociometric measures is the claim that the 
investigator does not know what a choice 
“means” for each individual. Thus, for one 
subject choice is evidence of personal affection, 
for another it is an intellectual evaluation, 
while for a third it is a mechanical answer con- 
forming to the experimenter’s demands. This 
objection may refer to the general question of 
the relevance of various criteria for providing 
data pertinent to particular dimensions. Viewed 
in this light it is legitimate, or at least points 
to an important area of inquiry. However, it 
is not proper to ask that the investigator know 
what a given response “actually means" to 
every respondent. Such a demand, if taken liter- 
ally, would rule out virtually every data col- 
lection procedure commonly used in the social 
sciences, It is no more necessary that we know 
what a given sociometric response means 
uniquely to each subject than that we know 
what each check on a questionnaire or each 
statement in an interview means. The im- 
portant question is the extent to which an 
order imposed upon subjects from this kind 
of response data can be related to pertinent 
independent measures. 


INADEQUATE QUANTITATIVE TREATMENT OF DATA 


In spite of the tremendous forward steps that 
have been made in the analysis of sociometric 
data, characteristic flaws remain in many stud- 
ies. Most serious of these is the tendency 
to capitalize illegitimately upon chance varia- 
tion and thus to treat random variations 
as representing significant findings. Fre- 
quently this is accomplished by conducting 
a large number of tests of significance and then 
ignoring those relations that fail to materialize, 
while concentrating upon the results that 
achieve respectable significance. This leads to 
a false sense of security and the acceptance of 
relations as demonstrated that are unlikely to 
prove replicable. Even this is more defensible 
than the practice of reporting numerical results 
and then discussing observable differences with 
no attempt to assess the role of chance. Fortu- 
nately, the incidence of such studies has shown 
a marked decrease in recent years. There are 
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many more sophisticated problems encountered 
in the quantitative analysis of sociometric data 


and some of these have been dealt with in pre- 
vious sections of this chapter. 


SUMMARY 


We have attempted to outline the ways in 
which sociometric data can be collected, ap- 
propriate methods for analyzing these data, and 
some of the areas of investigation and appli- 
cation where sociometric measures may be 
useful. Although we have noted certain limi- 
tations in sociometric measures and their cus- 
tomary application, the general impression we 
have tried to communicate is that the measures 


are of substantial value to the investigator con- 
cerned with group processes or the individual 
in a group setting. Further, the value of these 
measures is certain to be enhanced through the 
development of more sophisticated techniques 
for the quantitative analysis of sociometric data 
and through using these measures more exten- 
sively in conjunction with other methods of 
data collection. 
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The Interview: A Tool of Social Science 
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AND 
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INTRODUCTION 


The present discussion of the interview will 
be confined to the uses of interviewing for re- 
search. The interview has, of course, many 
other purposes: it is used, for example, for diag- 
nosis and therapy, and for selection of person- 
nel. But it is in the uses of the interview for 
research that our primary interest as social sci- 
entists lies. Using an interview for research 
purposes implies that the interview is intended 
to yield some sort of measurement, however ru- 
dimentary, along some dimension or variable. 
Usually, also, the objective will be to relate 
several variables, thus measured, to one another. 
In the following discussion then, the interview 
will be considered from the standpoint of the 
usefulness of the research data it provides. Some 
interviewing is, as we shall see later, preliminary 
to the formal process of measurement, in that 
it is used to suggest what the relevant dimen- 
sions for research shall be. 

For our purposes, an interview will refer to 
a face-to-face verbal interchange, in which one 
person, the interviewer, attempts to elicit in- 
formation or expressions of opinion or belief 
from another person or persons. We are exclud- 
ing both paper-and-pencil questionnaires and 
behavioral observation from consideration as 
interviews. It should be noted that behavioral 
observation may cover observation of either 
verbal or nonverbal behavior. Not all observa- 
tions of a subject’s verbal behavior may prop- 
erly be classed as interview data. For example, 
observations of the verbal behavior of small 
groups holding discussions, or observations of 
a parent answering the questions of a child, 
are not considered interviews within the lim- 
its of our definition. In the interview, the 
verbal expressions of the respondent must be 
directed toward the interviewer, in response 
to the interviewer's questions or comments. 
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In the following discussion of the interview, 
we shall trace briefly some of the historical ori- 
gins of present-day interviewing practices, and 
then consider the place of the interview in re- 
search design. Next, we shall discuss some ma- 
jor varieties of the interview (principally 
standardized versus unstandardized interviews), 
and then take up the problems and strategy of 
designing specific interviews, touching upon 
question wording, and the use of indirect or 
“projective” questions. Then we turn to the 
interviewing process itself: the role relation- 
ships between interviewer and respondent, the 
problem of obtaining the respondent's interest 
and cooperation, techniques of asking questions 
(including nondirective probing), and finally 
the process of recording the respondent's re- 
plies. Administrative problems in carrying out 
a field interview study are dealt with briefly. 
The problems peculiar to interviewing children 
and interviewing in other societies than one's 
own are then discussed. The final section of 
the article is devoted to evaluation of the inter- 
view as a research tool: the extent, nature, and 
control of interviewer error or bias are dis- 
cussed, along with the problem of validity of 
interview findings and the merits and demerits 
of the interview as compared with other meth- 
ods of measurement available to the social sci- 
entist. 


HISTORICAL ROOTS OF PRESENT-DAY 
INTERVIEWING 


Historically, the development of the inter- 
view as a research tool has two major roots. 
The use of interviewing in clinical diagnosis 
and counseling indicated that considerable psy- 
chological information could be obtained from 
interviews. The interviewing techniques which 
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w up in the clinical setting, however, were 
not developed with the problems of measure- 
ment primarily in mind. Here the emphasis 
was upon richness of response, and little atten- 
tion was given to the ordering and classification 
of the resultant data. An antidote for this ab- 
sence of concern with measurement was pro- 
vided by the growth of psychological testing. 
During World War I, the development of the 
Army alpha intelligence tests lent strong im- 
petus to the growth of psychological tests and 
measurements. This period saw the beginning 
of the use of attitude scales in which the mul- 
tiple choice test item was adapted to attitude 
measurement. World War II saw a tremendous 
increase in the use of this kind of measure. 
The use of such checklist questionnaires per- 
mitted the development of various kinds of 
scaling techniques that enabled social scientists 
to obtain objective and reliable measures of 
attitudes. 

In the hands of such research workers, the 
personal interview was employed largely as a 
device for administering checklist kinds of 
questions when it was not possible to assemble 
respondents for group administration of paper- 
and-pencil questionnaires, or when it was feared 
the respondent might not be able to handle 
written materials. Much public opinion poll- 
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ing and market research interviewing has been 
essentially an oral adaptation of the objective 
questionnaire. 

In recent years, some psychologists (often 
those with a Gestalt or Lewinian bent) were 
interested in exploring cognitive processes 
through the interview. They felt that the 
"depth" interview which has been developed in 
clinical work was more productive for studies 
of the psychological factors underlying atti- 
tudes than the objective type of attitude scale. 
However, in order to quantify their interview 
information, they categorized the interview 
material into codes quite similar to the multiple 
choices used so widely in attitude measurement. 
As a result of such attempts, it is now possible 
to achieve a high level of reliability in the 
coding of most open-ended or depth interview 
material (H. Guetzkow, 1950, and Survey Re- 
search Center, University of Michigan, 1947). 

Interviews as currently employed for research 
purposes show clearly the influence of both ma- 
jor lines of historical origin. The merging of 
the test tradition with the clinical tradition has 
resulted in a wide variety of interview tech- 
niques that are now available to the research 
worker, combining the two approaches in vary- 
ing degrees. 


Tue INTERVIEW AS A RESEARCH TOOL 


The interview in social psychological research 
is a general tool which may be employed in a 
variety of research designs. It may or may not 
have a place in any particular piece of re- 
search. It does not in itself constitute a com- 
plete research method. For example, if the 
object of the research is to test a hypothesis 
concerning attitude change, a paper-and-pencil 
test or a set of behavior observations may well 
be preferable to the use of an interview as the 
tool for measuring the attitudes of subjects in 
experimental and control groups. Even if the 
interview proved to be a desirable method in 
such an experiment, its application does not in 
itself constitute the method by which the hy- 
pothesis under consideration is tested. It is 
necessary to formulate a specific design, such 
as, for example, the “after only” design (Hov- 
land, Lumsdaine, and Sheffield, 1949). In such 
a design, the interview or the questionnaire or 
the behavior observation is used to measure the 
attitudes of the experimental and control 
groups, and constitutes an important tool in the 
experiment. Unfortunately, the interview is too 
often used as a substitute for study design 
(Stouffer, 1950), in which hypotheses are not 


formulated nor designs set up which would 
provide adequate tests of hypotheses. 

The interview may have a variety of roles in 
the design of a study. It may be employed dur- 
ing the early stages of a study to help identify 
the relevant dimensions, to suggest hypotheses, 
and to reveal the natural frames of reference 
existing in the minds of respondents. When 
paper-and-pencil questionnaires are used as the 
basic instrument of measurement, the interview 
may be employed to pretest the questionnaire 
form. i 

The second possible role of the interview in 
the research process is as the main instrument 
of data collection. Interviews may be used to 
obtain base-line measures on the experimental 
and control groups before the application of 
an experimental treatment, or to measure these 
groups after the application of the experimen- 
tal procedure, or both. In nonexperimental 
studies, the interview is commonly used to 
obtain data for scale measures of a set of vari- 
ables whose interrelations are to be studied. 
When the interview occupies the central posi- 
tion as a study's primary instrument of data 
collection, the problems of standardizing the 
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interviewing technique and measuring reliabil- 
ity and validity become of greater importance 
than they would be otherwise. 

A third role of the interview is to clarify 
findings which emerge from the use of other 
techniques. Merton and Kendall (1946) point 
out that when the experimental variable is a 
complex one (as it is in much communications 
research, where the experimental group is ex- 
posed to a film or radio program) , interviewing 
may indicate what aspects of the experimental 
procedure were responsible for observed ef- 
fects. Furthermore, the interview may offer 
clues as to why the experimental procedure 
does not produce the anticipated effects, when 
unexpected results do occur. 

Interviewing often plays an inconspicuous 
but crucial role in the study of traditional psy- 
chological problems. Answers to questions 
constitute an important part of the data in 
psycho-physical experiments such as those on 
brightness cues in depth perception, Inter- 
viewing is also an integral part of individual 
intelligence testing and studies of verbal learn- 
ing. Sometimes interviewing is employed as a 
means of inducing stress, which may in turn 
be measured in terms of various emotional 
response indicators such as pneumographs, 


sphygmographs, etc. And sometimes, the in- 
terest of the research worker in the interview 
is not so much what the respondent says, as 
how he says it. During World War II, inter- 
viewers for the Office of Strategic Services (OSS 
Assessment Staff, 1948) were interested in the 
ease with which a candidate could talk about 
his early experiences; the data provided by the 
interview in such a case would not be obtained 
by the usual content analysis of an interview 
record, but rather by the interviewer's rating of 
the respondent's reactions to the interview it- 
self. Similarly, Gerald Caplan, an Israeli psy- 
chiatrist, has taken the significant data emerg- 
ing from an interview to be the role in which 
the respondent places the interviewer. That is, 
he observes whether the respondent treats the 
interviewer as a parent surrogate, as an in- 
strument to be manipulated for his own ends, 
etc., and is more interested in what such be- 
havior on the part of the respondent reveals 
about his personality than in the manifest con- 
tent of the respondent’s answers to questions. 

It is evident, then, that the interview can 
occupy many different roles in the research 
process, and may be used to produce several 
different kinds of data. 


THE STANDARDIZED VERSUS THE UNSTANDARDIZED INTERVIEW 


By a standardized interview we mean one in 
which the questions have been decided upon 
in advance of the interview, and are asked with 
the same wording and in the same order for all 
respondents. The questions used in a standard- 
ized interview may be either “open” or “closed.” 
The essential feature of the standardized inter- 
view is that the interviewer does not have free- 
dom to word or reword questions, to introduce 
questions which seem especially applicable to 
the individual case, or to change the order of 
topics to conform to the respondent’s sponta- 
neous sequence of ideas. In the unstandardized 
interview, the interviewer's technique is com- 
pletely flexible, and is varied from one re- 
spondent to another. 

Both standardized and unstandardized inter- 
views are widely used by social scientists. The 
distinction is sometimes made between "struc- 
tured" and "unstructured" interviews (or "ac- 
tive" and "passive" interviews — Kluckhohn, 
1945) . This dimension is not identical with the 
standardized-unstandardized dimension, but it 
is similar, and much of the discussion of struc- 
ture in interviews applies to our present topic. 
Unstructured interviewing is, in general, done 
without a standard list of questions. It has 


the further characteristic that the questions are 
as “open” as possible (see the discussion of 
open questions below) , and offer a minimum of 
guidance to the respondent. Unstandardized 
interviewing (and/or unstructured interview- 
ing) has been advocated by Rogers (1945), 
Roethlisberger and Dickson (1939), Piaget 
(1929) , and Kinsey et al. (1948) , among many 
others. Merton and Kendall (1946) have ad- 
vocated a semistandardized interview which 
they call the “focused interview,” and which 
employs an interview guide with a list of objec- 
tives and suggested questions but gives the 
interviewer considerable latitude within the 
framework of the interview guide. 

The arguments advanced in support of the 
different techniques may be briefly summarized: 


In favor of standardized interviews: 

(a) they incorporate a basic principle of 
measurement: that of making infor- 
mation comparable from case to case, 

(b) they are more reliable, 

(c) they minimize errors of question word- 
ing. 

In favor of unstandardized interviews: 
(a) they permit standardization of mean- 
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ings rather than of the more superficial 
aspects of the stimulus situation, 

(b) they are more valid, in that they en- 
courage more true-to-life replies, 


(c) they are more flexible. 


Let us consider these points in detail. The 
most fundamental argument supporting the 
structured interview is a simple one: that one 
should not use a rubber yardstick. Ultimately, 
the research worker's objective is to differen- 
tiate among his respondents with respect to 
a set of variables. If he varies his methods 
of measurement from one case to another, he 
never knows whether differences obtained 
among his respondents are due to differences 
in the measurement technique or differences in 
the amount respondents possess of the trait 
being measured. To take an oversimplified ex- 
ample, a research worker who was attempting 
to measure the amount of color shock indi- 
viduals display in response to the Rorschach 
test would not show some of his subjects cards 
containing a large amount of red and other 
subjects cards with a small amount of red, for 
if he did so, he would not be able to attribute 
differences in response to differences in the 
personality trait which is presumed to underlie 
color shock responses — instead, the differences 
might be due at least in part to variations in 
the cards to which the individuals were ex- 
posed. 

Unstandardized interviewing is similar to 
using a new set of Rorschach cards for every 
subject. When interviewers are not provided 
with a standard list of questions, they vary the 
wording and order of their questions quite con- 
siderably, even when they have been well 
trained concerning the objectives of the study 
(U. S. Dept. of Agriculture, 1945). And it has 
been found repeatedly that even what would 
appear to be minor changes in the wording of 
questions produce significant changes in the 
frequency of given responses (Cantril, 1944) . 
The attitudes which a respondent expresses in 
an interview, or even the "facts" which he re- 
ports, will be, to a considerable degree, func- 
tions of the particular questions he is asked, and 
respondent 4 may not reasonably be classified as 
more internationalistic, or more authoritarian, 
or wealthier than respondent B (on the basis 
of answers to interview questions) unless both 
respondents have been asked the same questions. 

It should be borne in mind that the research 
worker is not always interested in comparing 
one respondent with another along some scale. 
In exploratory interviewing, he may be inter- 
ested in finding out what the relevant dimen- 
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sions are for later systematic measurement, and 
it is not particularly important that cases should 
be comparable to one another. At this stage of 
research, then, the above objection to unstand- 
ardized interviewing does not apply. 

Some proponents of the unstandardized in- 
terview believe in the value of making methods 
similar from one case to another so that com- 
parison will be possible, but they believe that 
using standardized wording and question order 
is not the way to make the measurements com- 
parable. They point out that the same words 
mean different things to different people, that 
when one asks a standardized question, one has 
not standardized the meaning of the question 
to the respondent (Crutchfield and Gordon, 
1947). It is better, say the exponents of the 
"meaning equivalence" position (Kinsey et al., 
1948), to use words which have equivalent 
meanings to different respondents, even if the 
words are not objectively identical. It is obvi- 
ous that even the most dogged exponent of 
objective standardization, when he encounters 
a respondent who speaks a different language, 
will not attempt to interview in a language 
which the respondent does not understand 
merely for the sake of maintaining formal stim- 
ulus equivalence. In practice, the research 
worker would translate, or get an interpreter. 
The same problem arises to a lesser degree for 
different educational levels; the advocate of the 
unstandardized interview would ask the well- 
educated respondent about "inflation" and the 
poorly educated one about "prices going up" 
on the assumption that these two phrases are 
more truly comparable than the word “infla- 
tion" used for both groups would be. A com- 
mon solution to the problem of equivalence of 
meaning in public opinion work (where the 
emphasis is usually on objectively standardized 
wording) has been to aim for the lowest com- 
mon denominator: to assume that the stimulus 
value of simple words is comparable all along 
the range of education levels even if this is not 
true for complex words. But Kinsey found, 
studying sexual behavior, that there was no set 
of terms which could be understood by all his 
respondents. Each group had its own jargon, 
and it was necessary for the interviewer to de- 
tect and use these words in order to be under- 
stood. Kinsey also felt that it was necessary to 
vary question order: he wished to follow the 
widely approved practice of beginning the in- 
terview with innocuous items, burying the more 
embarrassing ones late in the interview, when 
rapport would be well established. But he 
found that some items were embarrassing to 
some groups, other items to other groups, $O 
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that he could only standardize the rapport- 
building qualities of the interview by varying 
the order of questions according to the respond- 
ent's social group. 

The extreme form of the stimulus-equivalence 
position would be that no word means the 
same thing to two different people, since they 
will have had different experiences associated 
with the word (and meanings are the product 
of all the individual's past experiences with the 
word), so that in every instance one should 
attempt to explore the respondent's idiosyn- 
cratic meanings, and to find words which are, 
for him, equivalent to the meanings intended 
by the definitions of the variables in the 
research. 

The practical difficulties of trying to carry on 
research in this way are obvious, and no one, 
so far as we know, has attempted to do it in 
detail. Furthermore, it is likely that in trying 
to make equivalent one aspect of the stimulus 
situation, the research worker introduces an 
imbalance into another aspect. For example, 
Kinsey varied question order to obtain equiva- 
lence of rapport qualities. However, in so doing, 
he sacrificed equivalence of the context in which 
questions appeared, and it may be that some 
of the group differences which he reports are 
in some degree due to these contextual differ- 
ences rather than to differences in actual 
behavior from class to class. 

How can one evaluate the positions discussed 
above? It is the feeling of the authors that the 
logic of research requires (at least in the final 
stages where measurement is attempted) that 
stimuli be standardized in one way or the other: 
objectively, or via equivalent meanings. Stand- 
ardizing the objective stimulus should serve 
very well in most situations; there is great com- 
munality of meanings for most words among 
people speaking the same language. Kinsey's 
difficulties in finding words with a generally 
understood meaning stem, we believe, from the 
special nature of his subject matter. When 
widespread taboos exist against the written or 
spoken expression of certain facts and ideas, 
the conditions are maximal for the development 
of idiosyncratic meanings and special jargon 
among the small group who do communicate 
with each other about the topic. For most 
research, the problem would be much less acute, 
and there would be little reason for attempting 
to go beyond the standardization of the objec- 
tive stimulus (assuming, of course, that careful 
pretesting has weeded out most of the words 
which would have special meanings for any sig- 
nificant proportion of the population being 
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When the research worker does feel the need 
to work for special wordings for certain sub- 
groups in the interests of meaning equivalence, 
it will be necessary for him to keep in mind the 
following questions: 


L Are the characteristics which place the 
respondent in a particular meaning cate- 
gory identifiable by the interviewer? If 
not, the system of equivalents cannot be 
put into effect. For example, it would be 
possible to specify that a question should 
be asked in one form of Negroes and an- 
other of whites. But it would be more 
difficult to provide a special form of a 
question for people with a father fixation, 
even if this might be highly relevant to a 
person's probable interpretation of the 
meaning of the question, because the mem- 
bers of the group for whom the special 
wording was to be employed could not 
readily be specified in advance. 

2. Are the equivalent meanings of identifi- 
able subgroups known (or can they be 
found out as part of the research process) ? 
In the case of translation into another 
language, or substituting a subgroup's 
synonym, the problem should not be too 
difficult, although the meanings will never 
be exactly equivalent. In other cases, the 
difficulties are likely to be enormous. 

3. Arethe gains made by trying to get equiva- 
lence of one aspect of the interview large 
enough to offset the nonequivalences of 
other aspects which almost inevitably 
accompany such an attempt? 


We have been discussing the implications of 
standardized versus unstandardized interview- 
ing for producing information which will be 
comparable from case to case. There are a 
number of other considerations in the choice 
between these two methods, for example, their 
respective reliability and validity. There are 
two kinds of questions to be asked relative to 
reliability: (1) when the same individuals are 
interviewed a second time, which method will 
yield the closest correspondence between the 
first and second interview? and (2) assuming 
that ratings are to be made on the basis of 
interview write-ups, will the write-up of a 
standardized interview permit more reliable 
ratings than the write-up of an unstandardized 
interview? There are as yet no studies available 
directly testing these questions. The presump- 
tion would be, however, that the standardized 
interview would be superior on both counts. 
In the comparison of the first with the second 
interview, the unstandardized interview would 


D 
© 


454 


produce differences due to interviewing vari- 
ability as well as those due to respondent vari- 
ability, while in the standardized interview 
error due to variations in interviewing tech- 
nique should be much less. On the question of 
reliability of ratings, the interview write-ups 
from an unstandardized set of interviews usually 
produce the following kinds of difficulties in 
the rating process: (1) The interviews tend to 
be very full in some areas, very thin in others. 
In fact, without a standardized set of questions 
before him, the interviewer is very likely to 
overlook certain topics which he should have 
covered, and it is usually impossible to rate all 
the interviews on all the dimensions included in 
the study. Of course, the use of a detailed 
"interview guide" such as Merton recommends 
will help to ensure complete information, even 
when questions are not standardized. Further- 
more, in some situations it is possible to revisit 
the respondent to obtain items of information 
which were overlooked in the first interview. 
When this is possible, the standardized list of 
questions is less important. (2) Some of the 
material in some of the interviews will be am- 
biguous as to whether or not it is relevant to 
the dimensions of the study. Suppose, for ex- 
ample, that in a study of the public’s informa- 
tion about mental health problems one of the 
topics for discussion in an unstandardized 
interview is alcoholism. If the respondent talks 
at length about drunken driving on New Year's 
Eve by otherwise relatively sober people are his 
remarks, revealing as they may be of his attitude 
towards drinking, to be considered as revealing 
attitudes toward alcoholism? In unstandardized 
interviewing, the chances are great that the 
respondents will vary greatly in the frame of 
reference from which they discuss a question, 
and raters will vary in whether they consider 
the frames of reference of several respondents 
sufficiently comparable to justify rating them 
along the same dimension. Such variation, of 
course, produces unreliability of rating. 

The relative validity of the two methods may 
be an entirely different matter, A standardized 
interview is sometimes stilted and unnatural— 
it may put the respondent on the defensive by 
making him feel that he is being tested. The 
unstructured interview, on the other hand, is 
assumed to be more like a real-life conversation; 
presumably, it permits the respondent to follow 
his natural train of thought, and therefore tends 
to bring out material which is more likely to en- 
able the research worker to predict what the re- 
spondent would do or say in real-life situations. 

So far, there are no studies on the compara- 


tive validity of the two types of interviews. It 
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may be true that some varieties of standardized 
interviews (particularly the fixed-alternative 
type) jeopardize rapport. Those who have 
worked with carefully pretested open-end 
standardized interviews know that it is possible 
to maintain the conversational quality of an 
interview within the framework of a standard- 
ized set of questions, at least to a considerable 
degree, but the verbal interchange between 
interviewer and respondent is never as free as 
it would be without a standard list of questions. 
It remains to be seen whether unstandardized 
interviews have sufficiently greater validity so 
that ratings based upon them will predict cri- 
terion variables better than will ratings based 
on standardized interviews. 

A point often made in support of unstand- 
ardized interviews is that they are flexible. By 
flexibility is meant that the interviewer may 
adjust his technique to the individual respond- 
ent's particular situation. He may explore 
unusual ideas which the respondent mentions, 
omit topics which do not seem applicable in a 
particular case, take advantage of extraneous 
events which occur during the interview to 
introduce a related topic, etc. It is true that 
it is very difficult to design a standardized inter- 
view in which all questions are applicable to all 
respondents, but a certain amount of flexibility 
may be achieved within the framework of a 
standardized interview by providing subsets of 
questions which are to be asked only of par- 
ticular groups of respondents—for example, 
only of those who have had direct contact with 
members of a minority group, or only those 
who have certain minimum information about 
the U. N., etc. 

So far, in the discussion of standardized versus 
unstandardized interviews, the two techniques 
have been presented as though they were mutu- 
ally exclusive alternatives, with no middle 
ground. Actually, for some purposes, a semi- 
standardized interview may be best. For ex- 
ample, in the interests of flexibility, the research 
worker sometimes specifies exactly a series of 
main questions, which are to be asked of every- 
one, but lists a series of optional subquestions 
or probes which the interviewer is free to use 
or omit, depending upon the respondent's 
answers to the main questions, 

The research worker's choice of a fully stand- 
ardized, semistandardized, or unstandardized 
interview will depend very much on the stage 
of development of the subject-matter area in 
which he is working. It is fruitless to attempt 
standardized interviewing before the dimensions 
of an area are well delineated, and a good many 
years of exploratory work are often needed 
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before the precision of standardized interview- 
ing is possible or justified. (The insights of 
Freud, arising out of his completely unstand- 
ardized interviewing, are a lesson to the 
overzealous researcher who would begin late- 
stage research too soon.) Nevertheless, the 


obligations of measurement cannot be evaded 
indefinitely by labeling research “exploratory.” 
Every developing field in social science must 
arrive at the measurement stage eventually, at 
which point the standardized interview comes 
into its own. 


THE PHRASING OF INTERVIEW QUESTIONS 


The problem of selecting questions and word- 
ing them properly is essentially the problem of 
translating the objectives of the study into the 
concrete set of stimuli which will determine the 
nature of the final data. It is precisely at the 
point of wording questions that objectives may 
be garbled or lost sight of completely. 

A temptation which besets research workers 
is to translate objectives into questions too 
directly. If an objective of the research is to 
find out something about the determinants of 
a given attitude or piece of behavior, the 
research worker may simply ask the respondent 
"Why did you do that?" or “Why do you feel 
that way?" The weaknesses of the simple “why” 
question have been ably described by Lazarsfeld 
(1935) , who pointed out what a wide variety 
of frames of references it might elicit, only some 
of which are relevant to a study’s objectives. In 
many cases, its worst weakness is that it asks the 
respondent to be the scientist, asks him to pro- 
vide insights about the determinants of his own 
behavior which he should not be expected to 
provide. The research worker may want to 
know “why” certain behavior occurs, but he can 
seldom find the answers simply by asking 
respondents to explain themselves. The very 
question “why?” implies an explanatory system 
—a set of categories which will be related to the 
phenomenon under study. For example, sup- 
pose the research worker wanted to find out why 
the members of one group display more overt 
aggression than the members of another group. 
What are some of the possible levels of explan- 
ation? The group may be aggressive: 


(1) because they hate and fear others, 

(2) because they are frustrated, 

(3) because they believe others hate and fear 
them, and therefore feel justified in 
attacking first, 

(4) because they have no insight into their 
Own motives, 

(5) because they were severely punished in 
childhood and have “identified with the 
aggressor,” 

(6) because they are larger and stronger and 
have learned that aggression will get 
them what they want, 


(7) because they have an internally unbal- 
anced social system and can only keep it 
intact by creating an “out-group” to 
attack, 


Clearly, there are different levels of explana- 
tion here. Reason 2 may lie behind reason 1. 
If we want to explain aggression in terms of 
projection, we might expect that the following 
sequence is involved: the individual is frus- 
trated, he is prevented from showing aggression 
toward the instigators of his frustration, and 
then, provided he has no insight into his own 
motives, he projects (attributes his own aggres- 
sive feelings to others) , and then attacks them 
in "self-defense." From the point of view of 
this presumed sequence of events, (2), (5), 
(4), and (3) are all parts of the explanatory 
sequence. None of them is the explanation. 
Reason 7 comes from a completely different 
explanatory system—a system of concepts arising 
out of the theory of group process. 

The research worker must choose the system 
of explanatory concepts which he wishes to 
employ, and then must ask questions in his 
interview which are relevant to these concepts. 
If he chooses one set of concepts, he will ask 
about the severity of discipline meted out by 
the respondent's parents. If he chooses another 
set of concepts, he will ask about the relations 
of group members to one another and to their 
leader. He will seldom succeed in getting at 
desired levels of explanation merely by asking 
the respondent why he does what he does or 
believes what he believes. 

The research worker, then, decides what 
questions to ask by using all his ingenuity in 
selecting kinds of explanations which offer 
promise of predicting the behavior he is study- 
ing, and then finding specific questions which 
will help to measure the dimensions he has 
chosen. Even after he has decided that he must 
measure “severity of current frustration," 

e "severity of punishment in childhood," “cohe- 
siveness of the group," or "degree of insight," 
he has the very great problem of translating 
these dimensions into questions which will 
provide reliable and valid measures. An inter- 
esting example of translating an underlying 
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dimension into specific questions has been pro- 
vided by the students of authoritarian person- 
ality (Adorno, et al., 1950), who ask about 
attitudes toward women in politics, and the 
respondents’ views about the proper reaction 
when one's honor is insulted, etc.—questions 
which a researcher would ordinarily not think 
of asking in a single study. They appear 
together because the research worker has de- 
signed questions which are all related to an 
underlying personality dimension which unites 
the seemingly unrelated bits of behavior. 

Whether the research worker devises a set of 
standardized questions in advance or composes 
his questions as he goes along in the interview, 
he must be cognizant of the pitfalls in phrasing 
questions. Stanley Payne (1951), in his book, 
The Art of Asking Questions, has discussed 
many of the problems of wording, and has pro- 
vided a checklist of one hundred considerations 
to keep in mind when preparing questions for 
public opinion interviewing. Other discussions 
of question wording which present both sum- 
maries of experimental findings on the effects 
of variation in question wording and some of 
the “wisdom” of practitioners in the field may 
be found in the literature (Cantril, 1944; 
Parten, 1950; Jahoda, Deutsch, and Cook, 
1951). It is not possible here to present a com- 
plete and specific list of the do's and don't's of 
question wording. Instead, we wish to discuss 
certain major problems which the research 
worker faces in phrasing questions, and to sum- 
marize some of the solutions which have been 
found workable. 

A primary problem in wording questions is 
to make sure that the questions create for the 
respondent as clear a picture as possible of the 
subject matter under discussion. Of course, in 
some instances, a truly clear picture is impos- 
sible, because the respondent is completely 
uninformed about the topic and completely 
uninterested in it, so the question can hardly 
provide a frame of reference which will be 
meaningful for him. Frequently, the research 
worker will employ an information filter ques- 
tion, which will permit him to identify such 
people and omit further questions for them. 

The advantages of having questions worded 
as clearly as possible are perhaps too obvious 
to need recounting. In the first place, when 
questions are ambiguous, standardization of the 
stimulus has not been achieved, because some 
respondents are talking about one thing and 
others about another. (Crutchfield and Gordon, 
1947) ; in the second. place, interpretation and 
classification of the answers is hampered, be- 
cause there will be too many answers which do 
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not fit the intended scales, and too many "don't 
know" answers. 

Following are some of the rules of wording 
which have been worked out in the interests of 
question clarity: 

(a) Avoid words with a double meaning. 
The word fair, for example, may mean either 
"equitable" or "not very good." The phrase 
put up may mean to nominate a candidate, stay 
for the night, endure an insult, or preserve 
fruits. 

(b) Avoid long questions. The respondent 
will not be able to keep the entire question in 
mind, and will answer only part of it, ignoring 
some of the qualifications the research worker 
intended. 

(c) Specify exactly the time, place, and con- 
text which you want the respondent to assume. 
A question about "When you were in your 
teens" may elicit very different answers from 
"When you were young" or "when you were 
still in school." 

(d) Either make explicit all the alternatives 
which the respondent should have in mind 
when he answers, or make none of them ex- 
plicit. For example, if you ask: "Are promo- 
tions in your office usually based on merit?,” 
the respondent many answer "yes" hastily with- 
out considering that raises might be based on 
seniority, favoritism, etc. If the interviewer 
reminds the respondent of the other possibili- 
ties, a more thoughtful reply will be forthcom- 
ing. For some purposes, it would be better to 
mention none of the alternatives. The question 
could then be phrased: “What are promotions 
in your office usually based on?" 

(e) When the interview concerns a subject 
with which the respondent may not be very 
familiar, or one in which he may not have the 
necessary technical vocabulary, it is sometimes 
desirable to preface questions with an explana- 
tory paragraph or an illustration which will set 
the stage for the question the interviewer wants 
to ask. (See NORC, 1950.) The question itself 
may then still be kept brief. 

(f) It is often helpful to ask questions in 
terms of the respondent's own immediate (and 
recent) experience, rather than in terms of 
generalities. For example, in studying super- 
visory practices, the interviewer may ask, 
“Think back to the last time one of your work- 
ers came in late. What did you say?,” rather 
than, "What do you usually say when an em- 
ployee comes in late?" Similarly, studies on 
exposure to mass media have found that when 
people are asked how much time they spend 
listening to the radio or looking at television 
on the average, they are likely to underestimate 
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the time. More accurate estimates can be ob- 
tained by asking people to say how many hours 
they spent in this way on the preceding day, 
or what specific programs they saw or heard 
during a given period of time. 

When an interview deals with sensitive top- 
ics, attention must be given to wording ques- 
tions in such a way as to minimize ego defenses. 
As will be seen later, research workers some- 
times have recourse to projective or other 
kinds of indirect questions. There are ways, 
however, in which direct questions may be 
worded so that the respondent will feel rela- 
tively free to report candidly. Essentially, the 
problem is to keep the respondent from feeling 
that certain answers will mean loss of prestige. 
This can be accomplished by making the an- 
swers which the respondent might want to 
give seem socially acceptable. The following 
are some ways in which this has been done: 

(a) Indicate that other people hold what 
might otherwise be considered a frowned-upon 
opinion. “Some parents feel that it’s terribly 
important to teach a child not to fight with 


other children. Others feel that in some cir- « 


cumstances, a child must learn to fight. How 
do you feel about this?” 

(b) Balance against each other alternatives 
each having a positive value loading: “Do you 
and your family believe in showing your affec- 
tion for each other a lot, or are you fairly re- 
served sort of people?” An earlier version of 
the question, couched solely in terms of whether 
the respondent's family was affectionate, yielded 
a bunching of cases at the “very affectionate” 
end of the scale. / 

(c) Assume that the respondent possesses the 
low-valued attitude or behavior in some degree, 
and place the burden of denial on him. Kinsey, 
et al. (1948) use this technique when they 
phrase their questions, "When did you first 

" instead of "did you ever . . .” Similarly, 
if people are asked whether they paid cash for 
an article or bought it on the installment plan, 
too many say they bought for cash. But if the 
question is worded, "How much are your 
monthly payments?" the proportion who say 
they paid cash corresponds much more closely 
to the proportion known to be true from sales 
figures. 

(d) Use euphemisms. For example, in an 
interview on child-rearing, "training methods" 


or “methods of discipline" would be better than , 


"methods of punishment." 

(e) If the sensitive area involves the expres- 
sion of criticism of a person or institution, 
provide an opportunity for the respondent to 
voice praise first, so that he will not feel he is 


being unfair. "What do you like best about 
your supervisor," then “What do you like 
least?" 

(E) Introduce face-saving phrases and words. 
For example, "Do you happen to know who 
the Congressman from this district is?" was 
used instead of the more abrupt "Do you know 
. . A and Russians escaping to the West, who 
might hesitate to admit former Communist 
Party membership, were asked: “Were you able 
to keep out of the Communist Party?" 

One of the major areas of discussion among 
social scientists who have used the interview 
as a research tool centers around the problem 
of whether questions should be "open" or 
"closed." An open question gives the respond- 
ent little or no guidance as to the form or 
content of his answer. A closed question, on 
the other hand, limits his answer severely: it 
specifies a series of alternatives (sometimes only 
two) from which he is to choose. The large 
polling agencies commonly use closed questions. 
For example, Roper frequently employs the 
multiple-choice (or “cafeteria”) question, in 
which the respondent is given a card containing 
a series of alternatives, and is asked: "Which 
of these comes closest to your own opinion?" 
The Gallup questions on voting intentions, 
and most of his other questions, are closed, and 
are frequently of the two-choice type; the re- 
spondent is to say "yes" or “no,” agree or dis- 
agree, approve or disapprove, or choose one 
of two candidates. Closed questions are com- 
monly used in studying consumer purchases. 
The consumer is asked which of a list of house- 
hold durables he bought within the past year; 
he looks over the list and designates the items 
which he bought. 

Open questions state a problem without 
specifying a list of alternative answers. Follow- 
ing are some examples of open questions: "How 
would you compare the way you are raising your 
children with the way your parents raised 
you?," "It has been suggested that the United 
Nations should have the power to make laws 
which the people in the United States and the 
other member nations would have to obey. 
How do you feel about this?," "Did you buy 
any large items for your home during the past 
six months?" (If yes) “What things were 
those?" 

It should be borne in mind that the openness 
of a question is a matter of degree, Questions 
vary from those which restrict the respondent's 
response completely to those which do so mini- 
mally. Every question limits the range of 
possible answers to some extent, by virtue of 
specifying the subject matter under discussion. 
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For example, in interviewing about prices of 
consumer goods, the following question might 
be used: "Would you give me your comments 
about the prices of the things you buy?" This 
question is very open, and permits a range of 
answers which would include, "My grocer will 
overcharge if I don't watch closely," "Eggs have 
just come down, but butter is still high," 
“Things were much cheaper when I was a 
child," and "I think theyll be coming down 
soon." If the research worker is interested in 
price expectations, he may use a question which 
is less open but still not closed: "What do you 
think will happen to the prices of the things 
you buy during the next year or so?" The com- 
pletely closed version would be: "What do you 
think will happen to the prices of the things 
you buy: Will they go up, down, or stay about 
the same?" Here, the respondent is almost fully 
restricted as to the nature of his answer. 

It will be seen that open questions, in a 
sense, ask the respondent to recall something 
— to produce it spontaneously. Closed ques- 
tions, on the other hand, ask the respondent 
to recognize something. The literature on re- 
call versus recognition memory tells us that 
more will be recognized than will be recalled, 
and such proves to be the case with open versus 
closed questions. In consumer purchase studies, 
for example, the open question yields fewer 
buyers of any given item than does the closed 
question. The chances are that the closed ques- 
tion is more valid in purchase studies; that is, 
that the housewife who looks down the list and 
says, "Oh yes, I did buy an electric iron. I had 
forgotten about that," really did buy the iron. 
On opinion questions, however, the difference 
in validity is by no means so clear. The open 
question may produce errors of omission, but 
the closed question may produce equally great 
errors of commission. If the respondent is pre- 
sented with a series of alternatives, he may 
choose one even though he has never thought 
about the issue before, and has no opinion or 
information on it. Choosing something saves 
him from having to admit his ignorance. In the 
open question, where he must produce an an- 
swer in his own words, the chances are probably 
better that ignorance will be revealed and that 
the spontaneous answers which do emerge will 
reflect his more considered opinions (Camp- 
bell, 1946) . 

An advantage of the open question, then, is 
that it permits the research worker to detect 
ignorance, misunderstandings of the wording 
of the question, unexpected frames of refer- 
ence, etc. Furthermore, it is not so likely to 
suggest an answer to the respondent. The open 
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question is presumed also to have the advan- 
tage (although this, again, has not been docu- 
mented) that it promotes rapport because the 
interchange between interviewer and respond- 
ent seems more like an ordinary conversation. 
With closed questions, the respondent may be- 
come irritated by having the words of the fixed 
alternatives put into his mouth. He may feel 
that none of the answers provided properly 
represent his opinions, with the nuances he 
would like to include, and he may prefer to 
state his answers in his own words. At any 
rate, open questions are clearly a more natural 
form of verbal communication and, as such, 
should produce a closer replica of the kinds of 
statements the respondent might make in real 
life. 

The primary advantage of the closed ques- 
tion is that it produces greater uniformity 
among respondents in the dimension along 
which their answers lie. In the series of ques- 
tions about consumer prices, given earlier as 
an example, it will be evident that the first 
form of the question is much too open if the 
objective of the study is to learn something 
about price expectations. While some of the 
respondents happen to answer the question 
from this frame of reference, many do not, and 
their answers would not be useful in analysis 
of the data. The closed question pins down 
each respondent and obtains an answer which 
fits into the categories of analysis. It may be 
objected that to force individual answers into 
predetermined categories will not serve the 
purposes of analysis if this has been done at 
the expense of validity — that is, if it has been 
done by forcing an opinion on the respondent 
which is not really his. This is true, of course, 
and careful pretesting of fixed-alternative ques- 
tions must be done to make sure that the ques- 
tion itself is meaningful to most respondents 
and that the answers provided offer a reason- 
able range of answers so that the respondent 
can find his own position represented. The 
reason that many answers to open questions are 
not useful in analysis is usually not that the 
respondent could not have answered along the 
relevant dimension; usually, he can and will 
give an answer which fits the categories in- 
tended for analysis if it is made clear what the 
desired dimension is. For example, in the case 
of price expectations, the woman who answered 
the original question in terms of her memory 
of prices in her younger days could no doubt 
talk about the probable trend of prices in the 
future — she only happened to take the origi- 
nal question in another light. Those who have 
used open questions have found that in many 
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instances they must follow them by more spe- 
cific, closed questions, in order to make sure 
that all respondents give material relevant to 
the variables which are being measured in the 
study. 

Another advantage of the closed question is 
that it facilitates field coding. When the ques- 
tion limits the possible answers to a specified 
set of alternatives, the interviewer may simply 
check the alternative which the respondent 
chooses and, in many instances, the answer may 
be transferred directly from the interview form 
to a punch card. When open questions are 
used, on the other hand, and the respondent 
answers at length in his own words, there are 
considerable problems of classifying and using 
the answer in analysis. The interviewer may 
be instructed to classify the answer into a set 
of categories as soon as the respondent has 
finished giving it. For example, in reply to a 
question on sovereignty for the United Na- 
tions the respondent may say: "Well, I don't 
think anybody should have the right to tell 
the United States what to do. After all, we're 
footing the bill for all those other countries. 
Of course, I believe in the United Nations — 
do they have to be able to make laws to keep 
going? If they do, I suppose I'm in favor of it." 
If the interviewer is provided with a set of 
categories: "Approve, approve with reserva- 
tions, pro-con, disapprove with reservations, 
disapprove" and is required to check one, 
it is evident that he will have difficulty, 
and that the reliability of his judgment will 
be questionable. On the other hand, if he is 
not provided with categories, and is instructed 
to write down the respondent's answer verba- 
tim, it will be necessary to classify those answers 
in a later step of analysis. Coding free-answer 
material is costly and time consuming. 

Practitioners in the field of public opinion 
interviewing nearly all use some open and 
some closed questions. It is widely recognized 
that each technique has its own virtues. The 
choice between forms depends on the objec- 
tives of the study, the time and money avail- 
able, and perhaps most importantly, on the 
level of the research worker's knowledge about 
the dimensions of opinion in the area he is 
studying. If he is only beginning to explore 
an area, he must use open questions, for he 
will not know how to phrase alternatives for 
closed questions. If he knows the specific manis 
festations of the dimensions he is studying, and 
is satisfied to obtain data on these dimensions 
and nothing else, then the fixed-alternative 


question may serve his purposes very well. 


Many studies include a range of questions, 
some open, some closed. It has been common 
to arrange groups of questions in a “funnel” — 
that is, to begin questioning on a particular 
topic with an open question, and then follow 
with closed questions on specific points of in- 
terest to the research worker. The primary 
advantage of this technique is that it permits 
study of the spontaneous frame of reference 
which a respondent will adopt before any line 
of thought has been suggested to him, while 
at the same time providing comparable 
data for all respondents on specific points. 
In a recent paper, Gross and Mason (1953) have 
suggested that there are instances in which the 
research worker should use the opposite tech- 
nique: the “reverse funnel.” Gross and Mason 
were studying role conflict in school superin- 
tendents, and embarked on their work using 
open questions. Using all their ingenuity in 
phrasing questions, they could not get re- 
sponses related to role conflict with open 
questions during their pretest. Instead, the 
questions brought out comments about situa- 
tions in which the superintendent was in dis- 
agreement with groups or individuals in his 
community. After considerable experimenta- 
tion, Gross and Mason found that they had to 
reverse their procedure. They first presented 
the superintendent with four structured situa- 
tions which all superintendents face (salary 
increases for teachers, etc.). Then a series of 
fixed alternatives were given, concerning how 
various people or groups might expect him (the 
superintendent) to act. Then the superin- 
tendent was asked whether the individuals who 
expected different things of him had a right to 
do so. If different expectations were perceived 
as justified, then the case was considered one 
of role conflict. After a considerable period 
of highly structured interviewing in the manner 
described above, Gross and Mason found that 
they could ask open questions about other role 
conflict situations which the superintendent 
had encountered, and get meaningful answers. 

Gross and Mason's experience seems to sug- 
gest that there are instances in which the re- 
search worker wishes to inquire about a general 
category of experience, but the category is one 
which is not usually seen as a category by the 
respondent. However, after giving a series of 
instances of the category he has in mind, the 
interviewer can impart to the respondent a 
sense of the meaning of the category (assuming 
that the instances actually do have something in 
common!) , and permit him to talk about other 
instances and finally come to a generalized 
opinion. 
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INDIRECT VERSUS Direct METHODS OF QUESTIONING 


We have discussed elsewhere the fact that 
respondents are sometimes unwilling or unable 
to report directly the data a research worker 
wishes to secure. He may wish to measure 
certain underlying personality traits (such as 
“authoritarian personality") which the re- 
spondent could not describe in himself even 
with the best possible rapport between inter- 
viewer and respondent. Or he may be interested 
in measures of "sensitive" areas, such as sexual 
behavior, financial situation, intergroup preju- 
dice, or neurotic symptoms. Presumably, the 
better the rapport the interviewer is able to 
achieve with the respondent, the more valid will 
be the answers he will get in sensitive areas, but 
usually there will be some distortion of the 
individual's report in the direction of what he 
believes to be socially acceptable. 

A variety of indirect techniques have been 
used in interviewing when the subject matter 
of the study is likely to arouse resistance. The 
rationale behind the indirect techniques is that 
they will produce relevant information without 
the respondent's being fully aware of the impli- 
cations of his replies. 

In recent years, there has been great interest 
in “projective” questions as devices for indirect 
measurement. While the term is variously used, 
projective questions usually refer to questions 
asked about someone other than the respond- 
ent. The assumption is that the respondent 
will put himself in the place of the other person 
referred to in the question, so that his answer 
will reflect his own attitude. Thus, if a re- 
spondent says that "other people" probably 
don't believe in God as strongly as they say 
they do, the answer would be taken to mean that 
the respondent's own private belief in God is 
weaker than his public belief. The principle 
of the projective measure has been made famil- 
iar through the widespread use of such clinical 
instruments as the TAT and Rorschach test 

Projective questions have probably been used 
less widely in research-oriented interviews than 
they have in paper-and-pencil personality in- 
ventories and in diagnostic and therapeutic 
interviews. In public opinion work, projective 
questions have given rise to considerable diffi- 
culties in interpretation of the answers. In 
some instances, answers do seem to reflect the 
respondent's own attitudes, but in others they 
seem to represent a very different thing: his 

realistic appraisal of someone else's attitudes. 
For example, a young woman being interviewed 
about her attitudes toward her job situation 
was asked: "How do most of the girls in the 


office feel about the supervisor?" She answered: 
“They think he's wonderful. They'll do any- 
thing for him.” At which point the interviewer 
followed with: *And how about you — how do 
you feel about him?" with the reply: "I really 
detest him, I'm trying to transfer out of the 
unit." 

The problem of validating projective ques- 
tions is essentially difficult, One can, of course, 
correlate the answers to projective questions 
with the answers to direct questions on the 
same subject. But one should not expect a very 
high correlation — research workers have had 
recourse to projective questions in the first 
place because they hoped these questions would 
measure something which direct questions can- 
not measure. The hope has been, in other 
words, that indirect questions would yield in- 
formation that was more valid than the infor- 
mation produced by direct questions. The 
literature is almost devoid of evidence that this 
hope is at all justified (Campbell, 1950, p. 30) . 

One study (Getzels, 1950) provides some 
support for the greater validity of indirect 
questions, although it was done with a paper- 
and-pencil questionnaire and used sentence- 
completion items, and may therefore not be 
entirely applicable to the face-to-face interview. 
Getzels compared direct items, such as “When 
I am asked to be in charge, I generally tend 
to .. ," with indirect (or projective) items 
such as “When they asked him to be in charge, 
he..." He found that indirect items on 
neurotic symptoms would discriminate between 
a group previously classified as “maladjusted” 
and one classified as "adjusted" better than 
would direct items. He also found that the 
difference between the answers to direct and 
indirect items depended upon how much the 
topic was subject to clearly perceived public 
norms and social pressures. His indirect items 
yielded more expressions of hostility toward 
Negroes, more expressions of conflict with 
parents, more "neurotic" answers to personality- 
relevant questions, than did direct items. On 
the other hand, there was little or no difference 
between the responses to direct and indirect 
items in political-economic questions where a 
variety of answers would be equally acceptable. 
A further finding was that while the answers 
to direct items in areas subject to social norms 
were grouped around the conforming end of 
the scale (in a J-curve), the answers to the 
projective questions in the same areas were 
more normally distributed. 

Getzels asked his subjects about their per- 
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ceptions of the person referred to in the in- 
direct questions. Usually, they reported that 
they thought of the person as “someone like 
myself.” The attitudes which they attributed 
to this mythical other person were very dif- 
ferent from the attitudes which they attributed 
to “a cross section of the people.” Getzels has 
some evidence, then, that his respondents were 
identifying with the subject of the indirect 
item, and not merely describing what they 
believed other people’s opinions to be. His 
findings on the validity of indirect items apply 
to questions which refer to indefinite subjects 
Che" "John") and not to questions referring 
to “most people” — the form more commonly 
used as projectives in public opinion work. 

In personal interviewing, it is difficult to 
ask projective questions such as “When some- 
one is rude to John, how do you think he usu- 
ally feels?" The respondent may ask who John 
is. On the other hand, if the interviewer asks 
projective questions in terms of what “most 
other people” think, the respondent may give 
his appraisal of the beliefs of others (or say he 
doesn't know what other people think about 
the issue) , instead of identifying with the sub- 
ject of the question and projecting his own 
beliefs in his answer. Getzels’ work shows, in 
fact, that people very often do not project when 
the “other people” form of a question is asked. 

One possible solution in personal interview- 
ing is to phrase questions in terms of “a per- 
son”: “How do you think a young person would 
feel about it if his father wouldn't let him stay 
out on Saturday evenings past twelve o'clock?" 
Or it is possible to invent a story in which the 
situation of a mythical person is described, 
after which the respondent may be asked how 
he thinks the subject of the story would feel 
and behave. 

An interesting example of the successful use 
of projective questions in interviewing a sizeable 
sample is provided by Sanford and Rosen- 
stock (1952). Their objective was to obtain 
data relevant to authoritarian personality 
structure. They used a series of cartoon-like 
drawings, depicting situations where issues of 
leadership or authority were involved. For ex- 
ample, in one picture a man is standing in front 
of a group and saying: “Since I'm head of this 
group, you'd better do as I say.” One member 
of the group is shown answering, but his answer 
is left blank. The interviewer shows the picture 
to the respondent, and says; “Here is a picturet 
One person is talking, the other is going to 
answer. What do you think the answer would 
be?" or "Something has been said. What's the 


first answer that comes to your mind?" The 
pictures were mixed with a series of more con- 
ventional interview questions, including some 
from the F-scale (on authoritarian personality) . 
The entire interview took over an hour. 

Sanford and Rosenstock report that respond- 
ents were very interested and cooperative in 
the use of the pictures; they believe that the 
change from conventional questions served to 
break the monotony of the interview. It was 
possible to score the picture responses (using 
a system of coding responses into categories) 
with reliability equal to or greater than that 
for conventional questions, and the reliability 
from repeated interviews also compared well. 
Although it was not possible to obtain be- 
havioral data against which to validate the 
responses to the pictures, the picture responses - 
did discriminate between groups rated au- 
thoritarian or equalitarian on the Escale, The 
technique is therefore promising from the 
standpoint of validity, and offers the great ad- 
vantage that it can be employed in interviewing 
all levels of the general public. Burleigh Gar- 
diner (1952) has used a similar technique with 
comparable success in commercial research. 

Projective questions are not the only kinds 
of indirect questions which have been employed 
in interviewing. Campbell (1950) and Wesch- 
ler and Bernberg (1950) have summarized 
much of the literature. We can only mention a 
few of the indirect methods here, to illustrate 
the variety that is possible: 

(1) The errorchoice method (Hammond, 
1948). People are asked to choose among vari- 
ous possible answers to a factual question (such 
as what proportion of the consumer's dollar 
goes to profit). All the answers provided are 
incorrect. The direction of the error a respond- 
ent makes will reflect an attitude. 

(2) Information tests. It has been widely 
found that the amount and kind of information 
an individual has about a subject will be a 
function of his attitudes in the subject-matter 
area (Newcomb, 1946). Information questions 
in an interview may be used, therefore, as an 
indirect measure of attitudes. 

(3) Argument completion (Murray and 
Morgan, 1945). The respondent is given a card 
containing a description of two people having 
an argument. He is asked to complete the argu- 
ment. A series of cards are presented as a test 
of the respondent's powers of argumentation. 

(4) Word association. Vicary (1948) has 
used the word-association device with some suc- 
cess in market research. For example, the re- 
spondent may be presented with a stimulus 
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word, such as a kind of commodity, and asked 
to respond as quickly as possible with the brand 
name that comes to mind. 

We have pointed out some of the kinds of 
indirect questions which have been used in in- 
terviewing, and have presented fragmentary 
evidence that such questions (1) yield more 
differences from direct questions the greater the 
social pressure against direct expression of the 
attitudes, and (2) are more valid measures 
than direct questions when social pressures are 
present. Balanced against these tentative find- 


ings is the fact reported by Campbell (1950) 
that direct tests tend to have higher re- 
liability coefficients than do indirect ones. A 
reasonable conclusion appears to be that in- 
direct questions should be used only when the 
subject matter is one involving strong social 
pressures against the expression of certain atti- 
tudes, and when measures can be taken to 
strengthen the reliability of the indirect ques- 
tions sufficiently so that their potentially greater 
validity will have an opportunity to be effective. 


RoLe RELATIONSHIPS IN THE INTERVIEW 


One could imagine a measurement situation 
in which a respondent sits alone in a room 
where questions are emitted by a phonograph 
and the respondent's answers are recorded by a 
mechanical recorder. An interview in which 
the interviewer is present to ask questions is 
very different from the hypothetical situation 
just described, and the difference lies mainly 
in the nature of the interpersonal relations 
between the two people involved. The be- 
havior of each member of the pair is deter- 
mined to a considerable extent by his perception 
of the other: his perception of the other's 
status and power position, the other's probable 
views, and the other's similarity to people with 
whom he interacts or would like to interact. 

What are some of the roles in which respond- 
ents may perceive an interviewer? Much de- 
pends, of course, on the auspices of the study 
and the setting of the interview. If the study 
has been sponsored by a prestigeful institution 
and covers topics on which the interviewer 
might be assumed to have expert knowledge, 
the interviewer may find himself placed in a 
role similar to that of the family doctor: he is 
consulted for his advice on the respondent's 
problems. Such a role for the interviewer may 
interfere seriously with his purpose of gathering 
certain kinds of information from the respond- 
ent. When the study is sponsored by an in- 
dustrial concern and the respondent is an 
employee, there is danger that the respondent 
will perceive the interviewer in the role of 
"company man" or informer, and in these cir- 
cumstances the respondent will not, of course, 
be candid. When the study has no particular 
sponsorship, and is introduced at the respond- 
ent's door by an interviewer who says he is 
"doing a survey," the respondent may believe 
he is a salesman, a solicitor, or an investigator 
with sinister purposes. Almost always, the role 
in which the respondent places the interviewer 

will have some status implications: that is, it 


, 
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will be a role which has more prestige, the same 
prestige, or less prestige than the respondent's 
own role, and the nature of the communication 
which can occur between the respondent and 
the interviewer will be greatly influenced by the 
mutual perceptions of status. In general, we 
know that people are more anxious to commu- 
nicate to those above them in the hierarchy 
than to those below them. Back, et al. (1950) 
and Riesman and Glaser (1948) have pointed 
out that this applies to the interviewing 
situation. While upper-class respondents feel 
they have little to gain by expressing their 
opinions to an interviewer, lower-class respond- 
ents are pleased to be consulted. The content 
of the communication, of course, will be af- 
fected by the status relationships: the person 
of lower status will be motivated to present 
himself in a favorable light to someone who 
might be in a position to influence his future. 
At the same time, there may be elements of hos- 
tility based on the resentment of authority. 

Studies on interviewer bias (to be reviewed 
later in this chapter) have dealt primarily with 
the effects upon interview data of the charac- 
teristics of the interviewer independent of the 
characteristics of the respondent; the above 
discussion of role suggests that it would be 
fruitful to analyze interview data in the light 
of interviewer and respondent characteristics 
taken jointly. That is, a young interviewer 
should exercise a different kind of influence 
upon the interview, depending on whether his 
respondent is young or old. Similarly, the ef- 
fects of the social status of the interviewer, or 
sex of the interviewer, can be understood only 
if the interviewer's status or sex is seen in rela- 
tion to that of his respondent: certain kinds of 
communication should occur most easily if the 
respondent perceives himself to be higher than 
the interviewer on the social scale, certain kinds 
if lower, and other kinds if the two are at ap- 
proximately the same level. 
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The essential point regarding role relation- 
ship in an interview is that the interviewer must 
occupy some role, whether he wishes to or not, 
and that therefore the research worker must 
be conscious of the various roles possible for 
his interviewers and attempt to establish the 
role which will best further the purposes of his 
study. Having decided upon the most advan- 
tageous role, he can govern his choice of inter- 
viewers, of sponsorship, and of introductory 
remarks accordingly. Of course, it is not always 
possible to anticipate the respondent's role per- 
ception of the interviewer, but much can be 
done to encourage desired perceptions and 
avoid undesired ones. 

What is the ideal role for interviewers? There 
is no general answer to this question. There 
are some studies where the role of "expert" may 
be best, and others where the role of naive 
learner may produce more valuable informa- 
tion. There do seem to be some general points 
regarding the optimum interviewer role which 
emerge from a variety of studies, however. First, 
the interviewer should not appear to be com- 
pletely naive on the subject matter of the 
study, or the respondent will feel that his an- 
swers cannot be understood. The more special- 
ized the subject matter, the more must 
interviewers be selected or trained to possess 
a minimum of sophistication about the topic. 
For example, Gross and Mason (1953) found 
that in interviewing school superintendents, 
interviewers were unsuccessful unless they had 
an institutional sponsorship which would imply 
their having some knowledge of educational ad- 
ministration; Kinsey, et al. (1948) found that 
interviewers working with respondents from 
the underworld must know some underworld 
jargon to gain the confidence of the respond- 
ents, and the Department of Agriculture found 
that interviewers (especially if they were men) 
could achieve better rapport with farmers if they 
had some agricultural experience. On the other 
hand, the interviewer must not appear to know 
everything that the respondent knows: he must 
appear to need, and want, information which 
the respondent is in a better position to have. 
Ideally, then the interviewer should appear to 
be (to some degree) an “expert” seeking addi- 
tional information from an informed person 
who has detailed knowledge or opinions to con- 
tribute (Bauer, 1953). 

An essential element of the interviewer's role 
is that he should not be in a position to controF 
sanctions affecting the respondent. He should 
be outside the power hierarchy in which the 
respondent normally finds himself. If he is not, 
the respondent will attempt to put himself in 


such a light as to gain the interviewer's support 
and avoid his censure, and will not talk freely 
about any topic where he fears he might dis- 
credit himself. Furthermore, any in-group 
applies sanctions against the expression of ag- 
gression within the in-group, and if the research 
worker wants to detect elements of internal 
hostility and tension, he will be impeded if he 
is identified as a member of the in-group. It is 
common knowledge among research workers 
using the interview that under proper condi- 
tions people will talk to an interviewer who is a 
complete stranger more freely than they would 
to a personal friend or fellow-worker (England, 
1949). The interviewer's detachment from the 
respondent's real-life situation is an important 
asset, and in most studies his role must be 
played in such a way as to maintain this detach- 
ment. Miller (1952) has warned about the 
dangers of "over-rapport." He found that when 
he established close personal friendships with 
informants, it became very difficult to question 
their basic attitudes. A further difficulty is that 
if the research worker becomes intimate with 
one person or group among his potential re- 
spondents, other respondents will identify him 
with that person or group and govern their 
communications accordingly. All this means 
that, in general, social contacts between inter- 
viewers and respondents are not to be encour- 
aged, unless the interviewer is working at the 
exploratory level and must rely upon partici- 
pant observation to check some of his interview 
findings; even then, his research may be ham- 
pered by his loss of the stranger role. 

The “deeper” the material touched upon in 
an interview, the more familiar the interviewer 
should be with the emotional involvements that 
may appear with communication of such ma- 
terial. Therapists have long been familiar with 
the strong positive or negative feelings toward 
the interviewer which may occur. In most social 
science research work, interviewing is seldom 
so deep and so prolonged that the interviewer 
has serious problems of resolving transferences, 
but he must be prepared for less profound va- 
rieties of interpersonal stress. The respondent 
may exhibit dependent responses toward the 
interviewer: ask for help, advice, praise. The 
interviewer must attempt to deflect these de- 
mands upon himself without appearing to 
reject the respondent. If, under the pressure 
of anxiety aroused by the interview, the re- 
spondent becomes hostile, the interviewer must 
refrain from becoming angry or hurt in his 
turn, while not appearing indifferent. 

It would seem evident that role theory and 
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research can contribute considerably to the 
understanding of role relationships in the in- 
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terview, and the interview situation provides a 
good laboratory for the study of role problems. 


GETTING THE RESPONDENT'S COOPERATION 


One of the problems in conducting most in- 
terview studies is that of getting the respond- 
ent's cooperation in being interviewed. While 
the best methods for getting rapport vary with 
the time, place, subject, and identity of the 
respondent, there are a number of general 
points that can be made that apply to most 
interview situations. If, for example, inter- 
viewing is to be carried out in a community, a 
carefully planned program of obtaining co- 
operation within the community is desirable. 
Depending on the subject matter, various local 
groups and relevant officials may be interested 
in the study and can help obtain community 
acceptance. It should be pointed out, however, 
that it is often dangerous for the researchers 
to be too strongly identified with one group in 
the community. If, for example, an industrial 
study is being carried out and workers are to 
be interviewed, support for the study should 
be obtained from union as well as company 
officials. Often the fact that such a survey is 
being conducted is sufficiently newsworthy to 
attract space in newspapers or other local publi- 
cations. Even if these media do not directly 
reach all respondents, an interviewer can be 
armed with a copy of the newspaper story, and 
this will sometimes simplify his entrée consider- 
ably. Some research organizations find it de- 
sirable to make it a rule to notify local police 
in order to avoid potential difficulty. 

One method of attempting to reduce the re- 
fusal rate and ease the interviewer's problems 
in obtaining cooperation from respondents is 
the use of preceding letters or telephone calls. 
These devices have sometimes been found to 
be helpful in the case of respondents in the 
upper socioeconomic levels. Similarly, when 
the study is being conducted among the mem- 
bers of an organization (business, government, 
student body, etc), the making of appoint- 
ments with respondents in advance of inter- 
viewing can facilitate matters greatly. 

Appointments have the advantage of permit- 
ting the respondent to select a time for the 
interview when he will be free and uninter- 
rupted. They also enable him to consider in 
advance the subject matter of the interview, so 
that his answers to the interviewer will not be 
so much “off the top of his head.” There may 
be some disadvantages to making appointments 
in advance, however: in order to telephone or 
write, the survey organization normally has to 
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identify the potential respondents, and it is 
difficult in these circumstances to maintain an 
atmosphere of anonymity. Furthermore, if tele- 
phone contacts are made, it is easier for the 
respondent to refuse to be interviewed. (In 
an unpublished study, Peter Rossi found a 
higher refusal rate with telephone than face- 
to-fface contact.) A compromise solution 
adopted by some organizations (e.g., Survey 
Research Center) is to write a letter to the 
“Head of the household” at a sample address, 
introducing the study and the interviewer, and 
giving the time when the interviewer will make 
his first call, but not giving the respondent an 
opportunity to refuse until the face-to-face con- 
tact has been made. In some studies, of course, 
it is better not to forewarn the respondent 
about the interview. In expectation of being 
interviewed, he might inform himself about the 
topic, and thus measurements of information 
level would be misleading, 

The above are suggestions for improving 
public relations in undertaking a survey in a 
community. In addition, there are a number of 
things an interviewer can do to gain a respond- 
ent's cooperation in the face-to-face situation. 
First of all, most people like to be interviewed 
and will usually agree without much persua- 
sion, If the study is serving an important public 
or scientific need, the appeal to the respondent's 
altruism is often effective. In any case, the 
respondent, if he wishes, is certainly entitled 
to know, whenever possible, the nature and 
purpose of the study on which he is being inter- 
viewed. It should be pointed out, however, 
that one of the most common errors interview- 
ers make is to give too long and too complex an 
introduction. Most respondents are not in- 
terested in getting complete statements on the 
nature, purpose and design of the study, and 
brief introductions work best with such re- 
spondents, 

Interviewers who are confident and expect to 
get respondent cooperation usually have the 
fewest refusals. When interviewers experience 
several successive refusals, it is wise to have 
them obtain help on their approach before 
proceeding to further interviews, Although all 
interviewers, if they do enough interviewing, 
will occasionally run into an uncooperative re- 
spondent, survey organizations usually find that 
a large proportion of refusals and semirefusals 
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come from a disproportionately small number 
of interviewers. 

There is considerable evidence to suggest 
that careful selection and training of interview- 
ers can keep the refusal rate low. When refusals 
reach fairly high rates of 10-15% (Sheatsley, 
1948), sampling bias can become a serious 
problem (Hilgard and Payne, 1944). Heneman 
and Paterson (1949) reported reducing refusal 
rates to below 1%. In an unpublished survey 
by one of the writers, the refusal rate was cut 
from 15% to 4% by the use of a selected team 
of follow-up interviewers on initial refusals. 

If it can be done, an appeal to the personal 
needs of a respondent, showing a way in which 
the survey can be of help to him, can be a 
highly effective inducer of rapport. However, 
if the motivation for cooperating is too per- 
sonal, the respondent may distort his answers in 
the light of what he wants to get out of the 
interview. 

Finally, the interviewer must attempt to 
make sure that the situation under which the 
interview is to occur will be optimal for rap- 
port. If the interview is to be with a housewife, 
for example, and is to take a half to three- 


quarters of an hour, it would be wise to select 
a time of day when she can best spare that time, 
and a real attempt should be made to conduct 
the interview inside the house in a room where 
the interview is unlikely to be disturbed by 
others in the household. Not all questions in 
a study are affected by the presence of other 
people during an interview, but many are, and 
the pattern of role relationships (discussed 
above with reference to the respondent and in- 
terviewer) is made still more complex when 
others are present. In one study of the re- 
adjustment of returned veterans (Stouffer, et 
al., 1949, pp. 596-644) , it was found that when 
the veteran's wife was present during the inter- 
view, he was more likely to complain about 
unfair treatment by the public than if she were 
not present, and that he was less likely to ex- 
press resentment toward civilians who asked 
him about his war experiences if others were 
present. Which set of statements more truly 
expresses the respondent's "real" attitudes is 
not so much the point here — the fact remains 
that the statements will be different if others 
are present, and comparability of cases is 
jeopardized. 


"TECHNIQUES OF INTERVIEWING PROPER 


Interviewing has often been called an art 
rather than a science, on the grounds that there 
are few scientifically established generalizations 
available concerning the usefulness of different 
interviewing techniques. Nevertheless, there 
are many points of technique which are widely 
agreed upon by practitioners, and it may be 
helpful to outline these here. Our summary 
is necessarily brief, and for more detailed dis- 
cussions of interviewing technique the reader 
is referred to Bingham and Moore (1941), 
Rogers (1945), and Sheatsley (1951c). 

The first question of technique concerns the 
interviewer's proper manner and bearing in 
asking the questions. Should he be responsive 
or detached? Respectful or man-to-man? Light 
or serious? Friendly or formal? In our earlier 
discussion of role, we pointed out the advan- 
tages of the interviewer's role as a stranger who 
is not involved in the respondent's normal sys- 
tem of personal interactions. This means that 
the interviewer should retain a certain amount 
of reserve; he should be dignified, and make it 
clear that he takes the interview seriously. At 
the same time, he should not be wooden: it is 
of the greatest importance for the interviewer 
to show genuine interest in what the respondent 
is saying. The interviewer must tread a fine 
line with respect to his responsiveness — it Is à 


cardinal principle of interviewing that the in- 
terviewer must attempt not to reveal his own 
attitudes on the subject matter of the study: he 
must not show shock or disapproval over any- 
thing the respondent says, or nod enthusiasti- 
cally when the respondent supports the inter- 
viewer's own point of view. If the interviewer 
is completely impassive and noncommittal, 
however, even when the respondent says some- 
thing evidently intended to shock him, the 
natural conversational atmosphere of the inter- 
view will be destroyed, and rapport will suffer. 
The following has proved to be an effective 
compromise: the interviewer adopts a manner 
of friendly permissiveness, He laughs at the 
respondent's jokes, exclaims when the respond- 
ent says something evidently intended to be 
astonishing ("really?," “you don't say!") , makes 
supportive statements such as "I see your 
point," "that's understandable," "thats very 
interesting," and in other ways allows himself 
some of the emotional expressiveness which 
would be normal in the situation. He scrupu- 
lously avoids, however, direct approval or dis- 
approval of the respondent's position — he 
never argues with the respondent, or says “I 
feel the same way myself." 

Sometimes the respondent asks the inter- 
viewer where he stands on the issues under dis- 
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cussion. The interviewer may first simply try to 
deflect the question, by saying: “I'm interested 
in what you think about it," but if the question 
is pressed, he can either say that he hasn't made 
up his mind, or that he is not supposed to 
express his own opinion but would be glad to 
talk about it when the interview is over. (He 
will almost never be asked to make good on 
this offer!) 

When conducting a structured interview, the 
interviewer must, of course, ask the questions 
word for word as they are provided for him. 
He can practice delivering them, however, until 
they sound and feel to him like his own words. 
Often after the interviewer has conducted a 
number of interviews using the same interview 
schedule, he begins to ask the questions too 
rapidly and in a "canned" fashion which makes 
it difficult for the respondent to understand 
him. He can usually overcome this tendency 
by pausing and looking at the respondent di- 
rectly as he asks the question. 

There is some difference of opinion in the 
field about how much time the respondent 
should be given to think over his answer. Kin- 
sey believes that it is well to ask questions in a 
rapid-fire manner, so that the respondent will 
not have time to organize his defenses and think 
up an evasive answer. This technique is based 
upon the assumption that the truth is the first 
thing which enters the respondent's mind when 
he is asked a question. On the other hand, 
many researchers feel that permitting the re- 
spondent more time will reduce the number of 
"don't knows" and superficial or unconsidered 
answers. It would seem to the writers that for 
questions that are likely to meet with emo- 
tional resistance, the rapid-fire technique may 
be superior in some instances, but that, in most 
interviewing, allowing the respondent ample 
time to give a considered answer is preferable. 

It has just been pointed out that in struc- 
tured interviewing, the interviewer must ask 
the questions in the exact words of the schedule. 
But beyond this, in open-end interviewing, it 
is extremely important that the objectives of 
the study are met in each question. The inter- 

viewer must judge whether or not the question 
is understood by the respondent, and whether 
or not it is adequately answered. If it is not, 
probing must be done. 

Users of open interviews agree that such prob- 
ing should be nondirective. By this is meant 
that the question introduced by the interviewer 
should in no way influence the direction of the 
response. It should seek to elicit the desired 
information from the respondent without in 
any way making it likely that the respond- 
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ent will lean more in one direction than an- 
other in responding to the probe or additional 
question. There has been considerable discus- 
sion by Rogers (1945) concerning the nature 
of nondirective probing in clinical interviewing 
and suggestions by him of how such probing 
should be used in research interviews. The 
Survey Research Center at the University of 
Michigan makes extensive use of such probing 
in its open-end interviews. 

It should be noted that all nondirective 
probes do, in a sense, “direct” the respondent. 
Even if a respondent’s statement is merely re- 
flected back to him, or the question is repeated, 
or a neutral question is added, like “Tell me 
more about that" or "Why do you feel that 
way?" the mere fact that a particular point in 
the interview has been selected by the inter- 
viewer for probing will guide the respondent 
into a different channel than he would have 
chosen spontaneously. Indeed, the intent of 
introducing the probe is precisely to accomplish 
this. If the objective has not been met with 
the single use of the standard question, the 
interviewer probes in order to obtain informa- 
tion that will meet the objective. The question 
must be asked in such a way as to elicit a re- 
sponse which will provide information along 
a given dimension, and frequently the respond- 
ent must be "directed" to that dimension if the 
necessary data are to be obtained. 

When an interviewer is employing a non- 
directive probe, he must learn to know when 
to stop probing. Often there is a reason why 
the respondent fails to understand the question 
or gives an answer which does not actually 
answer the question, namely, the question 
arouses anxiety in him. The interviewer must 
learn to detect signs of anxiety, and must judge 
when it is wisest to stop the line of questioning 
even if he has not obtained the information the 
objectives called for. To continue when the 
respondent is upset will jeopardize rapport for 
the entire interview, and it may be possible to 
return to the topic later when the respondent 
is more at ease. A danger is that the interviewer 
himself may feel anxious about a problem and 
may therefore be oversensitive to signs of dis- 
turbance in the respondent. 

A common problem in interviewing occurs 
when the respondent gives information on a 
later question in the course of answering an 
early one. The safest thing to do is to ask the 
Pater question in its proper place, prefacing it 
by some remark such as: “You have already said 
something about this point, but . . ." and go- 
ing on with the question. The advantage of 
this procedure is that the respondent may say 
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quite a bit more that is highly relevant to the 
objective and still isn’t likely to feel that the 
interviewer is being stupidly mechanical and 
not paying attention to his answers. 

Related to this point is the problem of what 
to do if the respondent contradicts a point he 
has made earlier in the interview. This may 
arise in connection with factual information 
such as earnings or income, or it may arise in 
connection with an attitude. Sometimes the 
contradiction should be allowed to stand with- 
out comment in the interview: the existence of 
contradictory views in a single respondent may 
be an interesting finding of the study. But 
sometimes the study's objectives require that 
such contradictions be followed up in the in- 
terview. In such cases, the interviewer, in 
probing, can quote the respondent, but he must 
be careful not to antagonize him. One good 
device is for the interviewer to precede the 
reinstatement of the early remark with some- 
thing like this: "I want to be sure to get this 
right, and I wonder if I haven't made a mis- 
take. In my notes, I have . . ." This device 
saves the respondent's face and enables the 
interviewer to get the necessary information. 

A problem sometimes encountered in the 
open interview is that of digression by the 
respondent. While in some interviewing situa- 
tions the problem is one of getting beyond 
monosyllabic answers and encouraging the re- 
spondent to talk fully, in others the respondent 
is voluble and discursive, wanting to tell anec- 
dotes or deliver lectures and failing to give 
material directly related to the subject of the 
interview. Serious problems are created when 
this occurs, for usually the interviewer has a 


list of topics or questions which he must cover 
in a limited time, and it will be impossible to 
complete the interview if digressions are not ` 
controlled. In her report of the problems of 
interviewing Russian Displaced Persons (among 
whom the problem of discursiveness is espe- 
cially acute), Bauer (1953) has presented an 
excellent list of techniques for handling digres- 
sions by the respondent. She recommends that 
direct interruptions be avoided, since they can 
do irreparable damage to rapport. Instead, the 
following devices may be tried: (a) The 
“shunt” question — the respondent may be led 
back to relevant material by questions which 
relate what he has been saying to the topic in 
which the interviewer is interested. (b) Long 
personal anecdotes may be shortened by antici- 
pating their ending: the interviewer, showing 
interest in the story, may guess how it came out, 
which will short-circuit some of the intermedi- 
ate steps of thestory. (c) Later digressions may 
be minimized if the interview is begun with a 
general question which gives the respondent 
an opportunity to bring out some of the things 
he especially wants to say. This is important 
if the respondent knows about the interview 
and its subject matter in advance, for he may 
have prepared a list of points he wants to cover 
with the interviewer. Often the things he wants 
to say are not the things which the interviewer 
particularly wants to hear, but it is necessary 
to give the respondent some opportunity to say 
what he has prepared, or he will show mounting 
irritation throughout the interview. Finally, 
one of the best protections against digression by 
the respondent is the precise formulation of 
questions. 


RECORDING INTERVIEW DATA 


A central problem of interview methodology 
is that of how the respondent's answer shall be 
recorded. Interviews have been recorded in the 
following way: 


(1) Write-up from memory. The interviewer 
attempts to remember as much as possible of 
what the respondent has said, and makes notes 
on the content after the interview. 

(2) Field ratings. The interviewer is pro- 
vided with a set of field rating forms. He does 
not attempt to write down the respondent's 
remarks, but on the basis of the interview rates 


the respondent on the dimensions which the, 


study is seeking to measure (war involvement, 
level of economic satisfaction, degree of democ- 
racy practiced in supervision, etc.) . Each rating 
is usually based on a series of questions or on 
the entire protocol. 


(3) Field coding. The interviewer codes 
specific answers to specific questions into pre- 
determined categories, during the interview. 

(4) Verbatim notetaking. While the re- 
spondent is talking, the interviewer attempts to 
take down his answer verbatim, or as nearly 
so as possible. 

(5) Mechanical recording. The respondent 
talks into a microphone (which is either open 
or concealed). His remarks are recorded by 
means of a tape, wire, or disk recorder. 


Recording from memory has the obvious dis- 
advantage that it permits more distortion via 
the interviewer's biases than does immediate 
recording. Points in the interview which 
seemed dramatic or significant to the inter- 
viewer are likely to be sharpened, other points 
dropped out, Answers are likely to be recorded 
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as being more consistent than they actually 
were. In short, all the forms of memory dis- 
tortion which Bartlett (1932) has led us to 
expect should appear in this form of recording. 

Apart from distortion, memory recording in- 
volves a considerable loss in the sheer amount 
of interview content that can be retained. 
Symonds and Dietrich (1941) made mechanical 
recordings of several fairly open interviews, and 
also had the same interviews written up from 
memory after varying periods of time by stu- 
dents who observed the interviews from behind 
a one-way screen. Using a system of counting 
points or “ideas” contained in the interviews, 
and weighting them according to their impor- 
tance, the experimenters found that memory 
write-ups contained, on the average, 39% of the 
interview content if the write-up was done im- 
mediately after the interview, 30% after a lapse 
of 2 days, and 23% after a lapse of 7 days. Im- 
portant points were recalled more fully than 
unimportant points, but even so, almost half 
the important points were omitted from mem- 
ory write-ups. 

Payne (1949) reports, in a comparison of 
mechanically recorded interviews with inter- 
viewer reports written from memory immedi- 
ately after the interview, that 25% of the 
statements attributed by the interviewer to the 
respondent are clearly wrong. Covner (1944) 
finds, however, that while most of the items re- 
corded by the interviewer after a guidance 
interview are accurate, the write-up is very 
greatly condensed (omitting important points) 
and introduces many changes into the temporal 
sequence of the interview. Points are brought 
into conjunction and made to seem related 
which were separated and unrelated in the 
actual interview. 

The argument which has been advanced in 
favor of memory recording is that taking notes 
in the interview may upset the respondent, de- 
stroy his confidence in the anonymity of the 
interview, and produce an unnatural atmos- 
phere. It is difficult to assess the damage, if 
any, that notetaking during an interview does 
to rapport, Experienced interviewers who do 
take notes report very few instances of the 
respondent's raising questions about the notes 
or asking him not to take notes, Occasionally, 
the respondent will want to reveal a bit of 
gossip to emphasize a point, and will say “Don’t 
write this down, but . . . or “off the record”; 
such remarks point to the fact that the inter- 
viewer's notebook sometimes makes the inter- 
view seem more formal, more official. This is 
not always detrimental to the interview, for it 


may mean that the respondent will give more 
consideration to his answers and not answer 
flippantly. Some respondents find it flattering 
that someone considers their views important 
enough to write them down, and in such 
cases, notetaking helps rapport. If the inter- 
viewer notices that his notebook seems to be 
making the respondent nervous, he can talk 
about notetaking a little before he begins the 
interview: "I hope you won't mind my taking 
some notes on what you say. I want to be sure 
I get it right” or “I talk to a good many people 
during the day, you know, and I don't want 
to get mixed up on who says what"; "I take 
notes on what people say, so I can refresh my 
memory later." 

Actually, the majority of respondents seem 
to regard it as perfectly natural that the inter- 
viewer should take notes. If the interviewer 
is skillful, the notebook can be all but forgot- 
ten by the respondent as the interview pro- 
gresses. Nearly all social scientists employing 
the interview feel that the advantages in accu- 
racy of taking notes during an interview far 
outweigh any disadvantages, so that some form 
of recording during the interview itself is almost 
universal practice. 

Field ratings are not very commonly used as 
a way of recording data, They may be prac- 
ticable when the interviewers are very thor- 
oughly familiar with the objectives of the study, 
and when the number of dimensions to be 
rated is relatively small. While little methodo- 
logical work has been done to check the useful- 
ness of field ratings, it is probable that they 
suffer from halo more than other forms of 
recording. Furthermore, it is difficult to main- 
tain sufficiently similar frames of reference from 
one interviewer to another to ensure reliability. 
The Division of Program Surveys, U. S. De- 
partment of Agriculture, conducted a study 
during World War II of the effects of meat 
shortages on consumer food habits. Two sample 
cities were chosen, one of which had objectively 
experienced severe meat shortages, while the 
other had not. Interviewers were asked to rate 
each of their respondents on the severity of the 
meat shortages the respondent had experienced. 
The mean interviewer ratings were identical 
for the two cities. When coders in the home 
office rated the interviews on severity of short- 
age, however (without knowing which city the 
interviews were from) , the mean ratings for the 
two cities were significantly different in the 
expected direction (Cannell, 1953). The inter- 
viewers, evidently, were using their own sample 
of cases as a frame of reference for "severity." 
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interviewer's write-up might be nearly identical 
with those based on a complete report. Much 
depends on what elements of the interview are 
to be used as cues for rating. If a rating is to 
be made of the respondent's anxiety level, for 
example, based upon such cues as blocking, 
over-attention to detail, evasiveness, and incon- 
sistency, a complete interview record should be 
important, for interviewers would tend to drop 
out relevant elements in their notetaking. For 
less subtle ratings, the loss through notetaking 
may not be serious, although, of course, there is 
still considerable opportunity for interviewer 
bias to affect the study via the notetaking 
process. When interviewers are to attempt 
verbatim notes, it is important to train them to 
write up their interviews immediately. As they 
write, partial notes will recall fuller remarks 
the respondent made, but there is great danger 
that the interviewer will be reminded of the 
remarks of a different respondent if he has 
taken several other interviews between the tak- 
ing and writing-up of a particular interview. 

Experience is gradually accumulating with 
mechanical recording of interviews. Many 
technical difficulties remain to be ironed out 
— to the writers’ knowledge, studies using 
mechanical recordings always suffer some loss 
of data due to inaudible recordings, but these 
losses need not be heavy. In a study by the 
Laboratory of Human Development at Har- 
vard, which yielded 382 two-hour interviews 
conducted in respondents’ homes, only 3 had 
to be discarded completely because of inaud- 
ible recording, and only 3 were partially lost. 
Technical improvements in mechanical record- 
ing devices are making their use more and 
more promising. 

Mechanical recordings have been used both 
with and without the respondent's knowledge. 
There are difficult ethical questions involved 
here. But these questions aside, one may ask 
what are the effects of the respondent's knowing 
or not knowing a record is being made. The 
recording process has been concealed by some 
researchers on the grounds that it makes re- 
spondents uncomfortable, and that one can get 
more candid expressions of opinion when the 
respondent does not know he is speaking “for 
the record." On the other hand, if the record- 
ing is concealed, and if the respondent is told 
about it afterwards in order to obtain his per- 
mission to use the record, he may be irritated 
and feel that the interviewer has taken advan- 
tage of his good will. Of course, whatever 
damage to rapport this engenders occurs after 
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the record has been obtained, but there may be 
serious public relations problems for further 
interviewing. There is reason, therefore, to tell 
the respondent and obtain his consent in ad- 
vance whenever the subject matter of the study 
permits this, even when it is possible to conceal 
the recording device. In any case, in home 
interviewing with recorders, it is not usually 
possible to conceal the recorder. It may be 
fruitful, then, to consider how the mechanical 
recording process may be presented to the re- 
spondent so as to produce the least disturbance 
to the interview. 

While little needs to be said to respondents 
about notetaking, since they usually take it 
for granted, mechanical recording equipment 
in a respondent's home is a different matter. 
It cannot be ignored, for it is usually bulky and 
often requires the respondent’s help in getting 
it properly set up. A successful method of in- 
troducing recording equipment to respondents 
is to begin by asking where it could best be 
placed — the respondent's attention is then 
focused on mechanical problems, and the inter- 
viewer behaves as though recording is to be 
assumed, instead of asking permission to make 
a recording. The interviewer tests the machine 
by playing back a sentence or two of the re- 
spondent's words, and if there are children 
present, may allow them to hear their voices 
played back. Soon the respondent's curiosity is 
satisfied, and attention may be turned to the 
interview itself. With this procedure, no one 
in the previously mentioned study of 382 fami- 
lies objected to having a recording made. 

A major problem with mechanical recording 
is the cost of having the interviews typed from 
the records. Skilled typists take 4 to 7 hours 
to type each hour of recorded interview, so 
that typing becomes at least as expensive as the 
interviewing itself. It has been suggested that 
ratings might be made directly from the re- 
cordings, so that typing would not be necessary. 
To the writers’ knowledge, no comparison has 
yet been made of this procedure with that of 
rating from a typed transcript. One might 
expect that it would be very cumbersome to 
make over-all ratings directly from recordings, 
since the rater would have to refer back to 
materials scattered throughout the interview, 
and it is difficult to find the relevant sections of 
a recording. On the other hand, rating from 
recordings would permit the rater to take ad- 
vantage of emphasis, intonation, pitch, and 
pauses as cues, which might be valuable for 
some purposes. 
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FIELD ADMINISTRATION 


Any study involving large-scale interviewing 
necessitates the recruitment, training, and su- 
pervision of an interviewing staff. The number 
of interviewers may vary from just a few in a 
small local study to several hundred in large- 
scale national sample studies. Some research 
results have been reported on the validation 
of selection, training, and supervision methods. 
Stember and Hyman (1949c) report that al- 
though inexperienced interviewers show evi- 
dence of considerable error in the coding of 
interview material, experienced interviewers 
show little error. However, no alternative meth- 
ods of selection and training were compared in 
this study. Other studies have investigated the 
effect of social class on interviewer performance 
(Katz, 1942). 

A comprehensive study of interviewer charac- 
teristics as related to performance was reported 
by Sheatsley in a series of three articles (1950, 
1951a, b). Three aspects of performance were 
rated on 116] NORC interviewers: their 
handling of open-end questions, their paper 
work, and their carrying out of sampling in- 
structions. An index based on these three sets 
of ratings was related to 19 interviewer charac- 
teristics such as experience, education, sex, 
political affiliation, etc. Although a few dif- 
ferences occurred which were significant at the 
.05 level, no differences of any important magni- 
tude occurred. Sheatsley concludes that inter- 
viewer performance cannot be reliably predicted 
on the basis of the kind of information on inter- 
viewers’ backgrounds and attitudes currently 
available. It may be of some interest to note 
that it is possible from Table 21 of the third 
Sheatsley article to discover the criteria that 
one of the outstanding survey organizations has 
used in the selection of its interviewing staff. 

Most of the knowledge about this problem 


area derives from the experience of researchers 
rather than from research. Sheatsley (1951c) 
has presented an excellent summary of such 
accumulated wisdom. He lists desirable charac- 
teristics in the selection of interviewers in terms 
of adaptability, physical health, interest, intel- 
ligence, temperament, availability, appearance, 
and manner, and the absence of such special 
disqualifying characteristics as intense partisan- 
ship, the desire to regard the interview as a 
therapeutic device, and a long background of 
undesirable experience, such as selling. With 
respect to training and supervision, Sheatsley 
points out the need for inducing strong motiva- 
tion in the staff concerning the nature and the 
importance of the study, and he stresses the need 
for careful training, including role playing and 
supervised practice interviews, and the con- 
tinuation of careful encouragement and train- 
ing while interviewing is in progress. 

One problem of supervision is that of cheat- 
ing, in which interviewers sometimes turn in 
reports without having conducted the interview 
or asked particular questions. Some systematic 
research has been reported on the incidence of 
such cheating behavior (Guest, 1947; Stewart 
and Flowerman, 1949) , and spirited discussions 
of the problem were initiated in the literature 
by Crespi (1945-46) . 

Various methods are discussed for the detec- 
tion of cheating: the use of postcards, re- 
interviews, phone calls to a subsample of 
respondents, and the use of internal interview 
checks. However, all discussants seem agreed 
that prevention is the most desirable single 
approach, and that the best steps are to ensure 
high interviewer morale through personal con- 
tact, interest in the study, high pay, high pro- 
fessional selection standards, well-designed 
questionnaires, and the like. 


INTERVIEWING CHILDREN 


A paramount problem in interviewing young 
children is that of learning the child's language 
and system of concepts, so as to permit phrasing 
questions which are meaningful to the child. 
It is extremely difficult for the adult interviewer 
to avoid assuming that the child can answer 
from an adult's frame of reference. For ex- 
ample, in interviewing adults, we frequently 
phrase questions in terms of what causes some- 
thing else to occur. But, as Piaget (1929) has 
pointed out, the young child may not yet have 
developed a concept of causality, so that if we 
ask him such a question as "What makes the 


sun move?" the question (and consequently the 
answer) will not be meaningful. A study in- 
volving the interviewing of children normally 
requires a longer period of exploratory, un- 
structured interviewing and observation before 
it is possible to phrase good standardized ques- 
tions. Piaget suggests that the research worker 
can find valuable clues to the nature of chil- 
dren's concepts in the questions which children 
ask spontaneously. 

The question is sometimes asked: What is 
the youngest age at which children may be in- 
terviewed? The answer depends, of course, on 
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the subject matter of the interview — for some 
purposes, the child can be interviewed almost 
as soon as he can talk. Radke (1946) used a list 
of 53 standardized open-end questions with 
children as young as three years old, covering 
such areas as affective relationship to parents, 
the child’s views of the parents’ techniques of 
discipline, and the child's ideas of what consti- 
tutes "good" and “bad” behavior. She found 
that although the thought processes of the 
younger children were slower, they could give 
perfectly meaningful answers. 

As in any interviewing, the interviewer who 
works with children must devote considerable 
attention to getting good rapport. The child 
may be more reluctant than adults to express 
his own thoughts and feelings because (1) he 
assumes that everything he knows is already 
known by adults, and (2) he is afraid of saying 
something wrong or foolish (Piaget, 1929). 
Probably one of the reasons it is so helpful to 
play with a child before starting to ask him 
questions, and to put the questioning in the 
context of a game, is that these procedures help 
to free the child from the fear that his answers 
are to be judged right or wrong. 

A further important element of good rapport 
with children is the proper timing of the intro- 
duction of the more formal parts of the research 
proces. When he is first brought into the re- 
search situation, the child will want to explore 
and satisfy his curiosity about the room and 
the objects in it. He must be given time to 
begin to feel at home, but must not have time 
enough to become bored. A good way to de- 
termine when to begin a structured interview is 
to allow the child to give the signal himself: 
"We want to play a game together in a minute. 
Will you tell me when you're ready to begin?" 

The fact that the child is unsophisticated 
about the meaning of his answers to the adult 
interviewer means that he will not try to cover 
up or disguise certain kinds of answers as much 
as an adult will. For example, a child will 
usually tell freely and openly about his dreams, 
whereas many adults have learned that adverse 
personality judgments may be made on the 
basis of their dreams and will be careful about 
telling them, Furthermore, a child will be less 
troubled by the desire to be consistent, and 
will usually express inconsistent beliefs without 
embarrassment. This is not to say that young 
children are free of ego defenses in an interview. 
If they are being asked about areas which 
are anxiety-producing (for example, their own 
"naughty" behavior), defensive reactions may 
emerge, such as leaving the field ("I don't want 
to play any more") , failing to understand the 
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question, changing the subject, nonsense an- 
swers, etc. To prevent these reactions, the 
interviewer must be as permissive as possible, 
making it clear that he will not disapprove of 
the child or punish him for anything he may 
say, and must phrase the questions carefully. 
One interviewer found, for example, that he 
could not ask children “When you have done 
something you shouldn't, do you tell your 
mommy or do you hide it from her?" Instead 
he had to use the more euphemistic expression: 
“Do you tell your mommy or do you keep it a 
secret?” Keeping something a secret was evi- 
dently more acceptable to five-year-olds than 
hiding something from their mothers. If a 
child shows any signs of disturbance over a 
question, the interviewer often finds it useful 
to interrupt the interview for a period of play 
or refreshments, until the atmosphere is light- 
ened. 

Piaget feels that a major problem in inter- 
viewing children is their suggestibility. They 
are accustomed to taking cues from adults when 
they are unsure about what they should say or 
do, and they are therefore quick to agree with 
the interviewer if he says “I imagine you feel 
(think) thus and so, don’t you?” The usual 
rules about nondirective phrasing of questions 
are, then, even more important for child inter- 
views than adult interviews. As a way of check- 
ing on whether a child’s statement is merely a 
response to a suggestion from the interviewer, 
Piaget suggests that the research worker should 
use counter-suggestion, and ask about the same 
subject in different ways at different times. The 
suggested belief should disappear rapidly. 

A further problem Piaget mentions in con- 
nection with interviewing young children is 
what he calls “perseveration.” If the inter- 
viewer asks in order whether the sun, wind, 
clouds, rain, etc., are alive, the child is likely to 
say “yes” to the entire series if he says “yes” to 
the first one. This same tendency to complete 
series is seen in doll play, where the child seems 
to feel that every member of the doll family 
must be put through the same act sequence 
before a new act can be introduced. The inter- 
viewer can guard against this excessive stereo- 
typy by breaking the sequence of questions and 
coming back to a topic after a new set has 
intervened. 

The young child is, of course, not very skilled 
in the use of words, and he cannot continue 
long with a situation which takes an exclu- 
sively verbal form any more than he can carry 
on a sustained conversation, It has been found 
very helpful to combine interviewing with play 
materials, so that the child may act out some 
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of his answers as well as verbalize them. The 
doll-play interview (Levy, 1933; Conn, 1938; 
Ammons and Ammons, 1952) has proved very 
useful. In this technique, the dolls are asked 
questions, and the child answers for the dolls. 
The child may have the doll talk, or act, or 
both. Extremely high observation and scoring 
reliabilities are reported for this technique 
(Ammons), as well as good session-to-session 


INTERVIEWING IN 


The experiences of field workers in other 
societies are of especial interest because they 
highlight some of the methodological problems 
the social scientist encounters (but sometimes 
overlooks) when he attempts to work with 
people from different ethnic backgrounds or 
different socioeconomic classes within his own 
culture. Furthermore, as social psychologists 
become increasingly interested in the fields of 
“culture and personality” and “national charac- 
ter,” and in cross-cultural tests of psychological 
hypotheses, they must concern themselves with 
the problems of gathering data in societies other 
than their own. 

Social anthropology is rich in experience with 
the interview as a tool for work in other socie- 
ties. Benjamin Paul (1953) has provided an 
excellent summary of the literature on anthro- 
pological interviewing, and offers valuable 
insight into the special problem of role taking. 
In comparing social-psychological with anthro- 
pological technique, it is important to note 
that anthropologists (especially those working 
in nonliterate societies) have worked primarily 
with a small number of informants in the so- 
cieties they have studied; they have seldom 
attempted to do large-scale interviewing with a 
cross section of respondents, and have tended 
not to be concerned with standardizing measur- 
ing instruments. The interviews with inform- 
ants are generally very long, taking many 
sessions, and interview technique varies from 
very free, nondirective interviewing to inter- 
viewing in which the ethnographer takes an 
active part in guiding the informant along 
chosen topical paths. 

One of the first major problems encountered 
by the field worker in another society is how 
to present himself to that society. The role of 
scientist — someone who is trying to find out 
things simply to accumulate knowledge — is 
familiar to Western societies and is generally 
regarded with sympathy, but this role is com- 
pletely unintelligible to some peoples. Espe- 
cially where most people are not literate, the 
spectacle of someone with a notebook and a 
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reliabilities, Ammons and Ammons found that 
children were willing and able to answer the 
experimenter's questions more often when the 
questions were phrased in terms of what the 
doll would do in a given situation, rather than 
what the doll would say — a further indication 
of the limitations upon the child's ability to 
deal with exclusively verbal material. 


OTHER SOCIETIES 


typewriter asking questions and making notes 
can be frightening. It is of the greatest im- 
portance to be introduced by someone who 
is trusted by potential informants, and to at- 
tempt to make one’s purpose (or part of it) 
clear and acceptable. The field worker is some- 
times helped by the fact that he is a foreigner, 
and strange behavior may be tolerated in him 
when it would not be in an in-group member. 
Essentially, the field worker must be accepted 
in some role before he can begin interviewing, 
and gaining acceptance may be a slow, arduous 
process. In a village in Afghanistan, for ex- 
ample, a field worker had to wrestle publicly 
with the village wrestling champion and take 
high hurdles on horseback before he was ac- 
cepted: 

A second major problem is that of language. 
Some field work has been done exclusively 
through interpreters. In such work, of course, 
the information received is filtered through the 
personality and status of the interpreter. How 
important this can be is apparent, Kluckhohn 
(1945) points out, when the field worker is 
forced to change interpreters in the midst of 
his work with a particular informant. After 
the change, a different sort of material is likely 
to be forthcoming, either because the new in- 
terpreter censors different things, or because of 
the informant’s tendency to put things in a 
different light before this new person. When 
the ethnographer wishes to study topics which 
are surrounded by taboos and overlaid with 
anxiety, it is especially dangerous to obtain all 
his information through an interpreter; any 
advantage that the ethnographer might have in 
receiving confidences because of his being a 
stranger is lost. 

Field workers have often worked with a “con- 
tact" language — a language which is secondary 
both to the field worker and to the informant, 
but which permits direct conversation. Pidgin 
English has often been used for this purpose. 
As Margaret Mead (1939, pp. 193-194) cogently 
points out, contact languages have generally 
grown up in the context of special kinds of re- 
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lationships between groups: the master-servant 
relationship, the conqueror-conquered relation- 
ship, etc. She says: “These contact languages 
often have a complete ethos associated with 
them, and are regarded by one or both of the 
cultures in contact as instruments of hypocrisy, 
of wheedling, ingratiating, or bullying, and a 
great many values besides those specifically lin- 
guistic may be jeopardized (by their use.)” It 
is evident that field work benefits enormously 
if the field worker can learn the native language 
of the group he is studying, although to do so 
involves an enormous burden of time and ef- 
fort. 

There are several advantages to beginning 
field work with a period devoted to learning 
the language. It is easy for the people in the 
society to understand what the field worker is 
doing when he is learning a language, and they 
are usually glad to help. Furthermore, the field 
worker learns not only the denotative value of 
words, but a great deal about the way in which 
the people think — the way they categorize 
phenomena. Such knowledge is indispensable 
when it comes to phrasing questions in such a 
way that they will be meaningful to the re- 
spondents. He may find, for example, that 
there are few words referring to time periods, 
and from this observation he may be led to the 
discovery that the society has a different time 
orientation than our own, so that questions 
phrased in terms such as "How long ago was it 
when . . .” would be difficult for the respond- 
ent to answer. 

In fact, before a field worker can do intelli- 
gent interviewing on particular topics, he must 
steep himself in the culture to the fullest pos- 
sible extent. In interviewing in our own society, 
many shared customs are implicit in the ques- 
tions we ask (we take for granted, for example, 
the fact that every seventh day is a holiday, or 
that the group of people sharing a dwelling is 
likely to be made up of a husband, wife, and 
children). The field worker in another society 
must start very far back — not only will he be 
ignorant of the shared customs and beliefs that 
everyone takes for granted, but he won't know 
how to ask about them. He must therefore 
spend a much larger proportion of his time in 
the exploratory and "scouting" phases of his 
research than would be necessary to study a 
specific topic in his own society. There is a 
certain dilemma here: even the exploratory 
phase will involve interviewing, and how shall 
questions be phrased? The best solution seems 
to be that they should be as open as possible. 
They can be phrased in the form: “Tell me 
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about . . ." and should be followed up with 
nondirective probes. One pitfall of the begin- 
ning interviewer in another society is that he 
will want to appear intelligent and under- 
standing, so as not to invite the contempt of 
the people with whom he is working. He is 
therefore likely to nod his head and say “Oh 
yes, I understand" when he doesn't understand 
or understands only in a completely alien frame 
of reference. He must learn to say easily: "I 
don't quite understand that. Would you tell 
me more about it?" 

Some field workers have used pictures to 
great advantage in the initial, most open stages 
of their interviewing. They take a picture of 
a village scene, and then ask a number of dif- 
ferent people what the people in the picture 
are doing, what they are about to do, how they 
feel, etc. This technique helps the interviewer 
to get the "feel" of the society with very little 
directive questioning on his part. 

Perhaps the most common fault of the novice 
interviewer, in this as in so many other settings, 
is to put words into the mouth of the respond- 
ent. For example, in interviewing about court- 
ship patterns, one novice interviewer was told 
that a young man had told his father he was 
interested in a particular girl. The interviewer 
said: "And then your father went to see her 
father, is that right?" This kind of interview- 
ing is particularly dangerous in other societies, 
where the interviewer's guesses about what the 
respondent has in mind are likely to be inaccu- 
rate, and where the respondent may be accus- 
tomed to showing deference toward people like 
the interviewer, so that he feels constrained to 
agree with what the interviewer says. 

As in any interviewing, the field worker in 
another society must first obtain the interest 
and cooperation of his respondents. To do so 
he needs to know a good deal about what 
motivates people in the society. Some sort of 
economic reward is often helpful in enlisting 
cooperation (Kluckhohn, 1945). The feeling 
of being misunderstood in one's own group and 
the desire to talk to someone who will be 
sympathetic is often sufficient motivation for 
cooperation, but this motive must be tapped 
with care, for it is likely to draw deviants, and 
the field worker is usually more interested in 
the individual who is in some sense "typical" of 
the people in the society. Sometimes an in- 
formant will be motivated by pride in his own 
group. For example, Indian mothers became in- 
terested in being interviewed about their child- 
rearing methods when the field worker asked 
them how they succeeded in training their 


THE INTERVIEW: 


children to be so quiet and well-behaved — 
pointing out the fact that white children cry 
much more (Chasdi, 1952) . 

Good rapport requires, of course, that the 
field worker conduct himself according to the 
rules of politeness prescribed by the society he 
is studying. These rules may be very different 
from those under which he has been brought 
up. For example, among the Navaho, it is im- 
polite to look someone in the eye — the polite 
interviewer lowers his eyes when introduced, 
and then sits beside his respondent, and both 
gaze off into the distance while talking. Or, for 
another example, in some societies it is im- 
polite, while talking to one member of a society, 
to refer to another member by name. 

During his exploratory work, the interviewer 
usually needs to discover something about what 
topics are taboo in the society. To obtain such 
information, he will find it advantageous to 
begin his interviewing with a particular re- 
spondent with topics which are emotionally 
neutral, and turn gradually to the more touchy 
subjects as rapport develops. In general, lan- 
guage, genealogy, and material culture are good 
topics to start with. Such topics as religion, 
status relationships, and attitudes toward sex 
and aggression should come later. 

As in any interviewing, the ethnographer 
must watch for signs of anxiety in the respond- 
ent, and adapt himself accordingly. In other 
societies the signs of anxiety may not be the 
same as they are in his own; for example, in 
stead of becoming restless and fidgety under 
anxiety, the respondent may become stonily 
impassive, and the interviewer must learn to 
recognize the meaning of this behavior. 

The mounting interest in cross-cultural stud- 
ies raises the question of the feasibility of 
greater standardization in field work methods 
from one society to another. Also, whenever 
the scientist is interested in comparing in- 
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dividuals within a society (in studying in- 
dividual differences), he must consider the 
advantages of standardizing his interviewing 
techniques from one respondent to another. 
Lists of standardized questions with large num- 
bers of respondents have been used within a 
nonliterate society (F. Kluckhohn, et al., 1953), 
and the authors’ own experience in conducting 
interview studies in two societies other than 
our own (both literate and both Westernized, 
however) suggest that it is entirely feasible to 
standardize questions and obtain comparable 
information from several societies. Of course, 
the problem of defining underlying variables 
which will be valid cross-culturally is very 
complex. For example, suppose one wished to 
compare the amount of aggression shown in a 
number of societies, or compare the severity 
with which children are punished for aggres- 
sion. In one society, aggression is shown by 
making a doll in the image of the enemy and 
destroying it; in another, the aggressor may 
give the enemy a white feather; in another, he 
may spit at his feet; in another, he may “damn 
with faint praise.” How to decide what will 
be included as aggression for purposes of cross- 
cultural comparison? For other kinds of prob- 
lems, getting cross-cultural comparability of 
variables is not nearly so difficult, and simple 
translation of questions from one language to 
another may be in order. When the research 
worker wishes to translate questions into a 
language which is unfamiliar to him, inci- 
dentally, one technique which has been used 
successfully is to have the questions translated 
by one bilingual person, and then have them 
translated back into the original language by 
a second bilingual person. The differences be- 
tween the research worker's original form and 
the retranslated version will reveal to him any 
serious alterations in meaning which grew out 
of the translation process. 


SOURCES OF INTERVIEWER ERROR 


Of all the methodological problems involved 
in interviewing, the one which has been most 
fully studied is the problem of interviewer 
error (bias). It has been customary to use the 
term interviewer bias for many of the kinds of 
errors which will be examined in this section. 
Properly speaking, however, the word bias re- 
fers only to systematic or nonrandom errors. 
Since many of the sources of interviewer effects 
on interview data do not tend to shift the data 
in a particular direction but rather are more 
or less randomly distributed, we prefer to use 
the more inclusive term interviewer error. No 


attempt can be made here to list all the studies 
on interviewer error. Before we attempt to 
summarize some of the findings in this area, a 
brief discussion of the meaning of the term 
may be in order. 

Interest in the problem grew out of the ob- 
servation that the set of answers reported by 
one interviewer would differ from the set re- 
ported by another interviewer, even when the 
samples were comparable. Furthermore, it ap- 
peared that some interviewers obtained answers 
which were similar to their own views, and the 
suspicion naturally arose that the interviewer 
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had somehow influenced the respondent. In a 
study where absolute levels of response are im- 
portant (how many people expect to vote for a 
certain candidate, how many own cars, etc.) , it 
would obviously be damaging to the study if all 
the interviewers were influencing their re- 
spondents in the same direction. Even if the 
influence of various interviewers counterbal- 
anced each other, error would be introduced 
into the measurement process, and intercorre- 
lation among variables would be seriously 
affected. . 

How to measure interviewer error? The 
simplest approach would be to obtain some 
independent measure against which to check 
the interviewer's data. Thus, interviewer re- 
ports of whether a set of respondents voted 
could be checked against poll records. But dis- 
crepancies between outside records and inter- 
view reports might be due to poor question 
wording, or poor memory on the part of the 
respondents, rather than to any error by the 
interviewer. Furthermore, when a study is con- 
cerned with attitudes, there is no factual base 
line against which to compare the respondent's 
answers. Suppose, for example, that a respond- 
ent expresses more liberal attitudes on race to 
a Negro interviewer than to a white interviewer. 
Which interviewer is biasing the respondent? 
Which statement represents his "true" attitude? 
Clearly, it is possible that the respondent actu- 
ally has different attitudes in the two different 
social contexts — that his attitudes are situa- 
tionally determined. 'The interview must set 
some social situation for the expression of atti- 
tudes, and the attitudes expressed will neces- 
sarily be a function of that situation. 'The 
problem of measurement arises when the inter- 
viewers vary in their appearance, their manner, 
their words, so that the measurement situation 
is not comparable from one case to another. 

The amount of interviewer error, as currently 
conceived, means the amount of variance in 
data which may be attributed to interviewers 
rather than respondents. It is studied by com- 
paring sets of answers which different inter- 
viewers obtain from similar samples, to see 
whether the distributions show more difference 
than would be expected by chance. 

Six possible sources of error in interviewer 
performance and representative research con- 
cerning these will be discussed: (1) the ap- 
pearance and manner of the interviewer, (2) 
the way in which interviewers word or deliver 
questions, ($) the interviewer's own attitudes, 
(4) the interviewer's expectations of the re- 
spondent's attitudes, (5) interviewer variations 
in the fullness of probing, and (6) interviewer 


variations in the recording of respondents’ 
answers. 

(1) Appearance and manner of interviewer. 
Katz's (1942) study showed important dif- 
ferences in the responses of persons interviewed 
by working class and white collar interviewers. 
In a study conducted by OPOR in 1942 (Can- 
tril, 1944) , Negro respondents were questioned 
on their attitudes towards German or Japanese 
treatment of American Negroes in the event of 
their winning the war. Negro interviewers ob- 
tained a far higher proportion of pro-Axis 
sentiments than did white ones. Sampling dif- 
ferences, namely, the particular respondents 
selected by white and Negro interviewers, could 
account for these differences. However, a Re- 
search Branch Study (Stouffer, et al., 1950, pp. 
718-721) with a similar design in which it is less 
likely that respondent selection could have 
caused the differences reports comparable re- 
sults. It is possible that other factors could 
account for the obtained differences. Such fac- 
tors as cheating or failure to probe, particularly 
if such factors are correlated with class mem- 
bership or race, might be partly responsible. 
However, the results are consistent with the 
hypothesis that the appearance and manner of 
the interviewer are factors biasing interview 
data. Most survey researchers feel that the 
manner and appearance of interviewers is im- 
portant, and they often attempt to achieve 
inconspicuous and relatively uniform dress 
among their interviewers, in spite of the fact 
that not much definitive research has been done 
on this problem. 

(2) Question wording by interviewers. In 
previous sections of this chapter, the effects of 
different ways of wording questions both in 
standardized and in unstandardized interview- 
ing have been discussed. In unstandardized 
interviews, the interviewer has great latitude in 
question wording, and systematic differences in 
the data due to interviewer selection of ques- 
tion wording are clearly possible. Even in 
standardized interviewing this factor can be 
important, This is particularly possible when 
probing is encouraged or permitted, either with 
open-end or closed-end questioning. In studies 
reported by Guest. (1947) and by Stewart and 
Flowerman (1949), in which interviews were 
recorded without the knowledge of the inter- 
viewers, considerable differences among inter- 
viewers occurred. In Guest's study, both 
closed- and open-end questions were used. The 
closed-end questions theoretically should pro- 
vide no opportunity for interviewers to manipu- 
late question wording. Nevertheless, he found 
word changes of both “major” and “minor” 
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sorts among his interviews. Furthermore, in 
the use of probes designed to obtain specific 
or complete answers to his open-end questions, 
he found many major differences among inter- 
viewers. 

Stewart and Flowerman found many instances 
where questions were not asked at all, although 
answers were recorded even by interviewers who 
were highly conscientious in seeking out their 
assigned respondents. Both studies reported 
that the usual quality control procedures in 
examining the interviewer's written records 
failed to reveal the nature and frequency of 
such practices. The latter investigators devel- 
oped an office procedure they call the "displace- 
ment measure” which correlated highly with the 
errors revealed on the interview recordings. 
It has been shown that a scaling procedure such 
as the Guttman scale can also reveal the pres- 
ence of such interviewer cheating (Elinson and 
Cisin, 1948) . 

(3) Effects of interviewers’ own attitudes. 
Public opinion researchers have evinced con- 
siderable interest in the problem of the possible 
biasing effects of the opinions the interviewers 
themselves have in the attitude areas being 
studied. Does a Republican interviewer tend 
to obtain more or fewer Republican opinions 
from his respondents than a Democratic one? 
Does an internationalist-minded interviewer get 
more or fewer internationalist attitudes from 
respondents than does an interviewer who is 
more isolationist or nationally minded? Typi- 
cal of the studies in this area is one reported in 
Cantril (1944). The study was a pre-election 
poll being made prior to the 1940 presidential 
elections. Each interviewer was asked to fill 
out the same ballot as the respondents were 
given in a national sample poll. The inter- 
viewers then returned this ballot with their 
completed interviews. Then the distributions 
of the interviews taken by interviewers who 
had a given attitude were compared with those 
obtained by interviewers with opposite atti- 
tudes. Interviewers who favored helping Eng- 
land obtained one and one-half times as many 
respondents favoring helping England com- 
pared with those favoring keeping out of war 
at all costs. On the other hand, interviewers 
who felt that we should keep out of war almost 
reversed this figure for their sample. Similar 
differences occurred on presidential choices. 
Since quota sampling methods were employed 
in both these studies, it is not possible to dei 
termine to what extent respondent selection is 
responsible for these differences and to what 
extent, if any, interviewers were affecting the 
expressed attitudes of their respondents. 
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It is also interesting to note that for inter- 
viewers working in large cities, no such biasing 
effects occurred. The author suggests that it 
may be more difficult for interviewers to select 
in advance, either consciously or unconsciously, 
respondents whose attitudes will correspond to 
his own in large communities, where fewer 
people are known to him. 

Katz’s study, reported above, also throws con- 
siderable light on this problem. He found 
that working-class interviewers tended more 
often to select middle-class respondents than did 
middle-class interviewers. However, the work- 
ing-class interviewers tended to elicit more 
pro-working-class attitudes from middle-class 
respondents. Even in this study, however, it is 
possible that working-class interviewers tended 
to select middle-class respondents whose views 
were more like their own than was the case for 
middle-class interviewers. Therefore, there is 
no clear demonstration of effects of interviewers 
on the expressed attitudes of the respondents. 

It remained for the comprehensive study of 
Feldman, Hyman, and Hart (1951-52) to sepa- 
rate out the relative contributions of respond- 
ent selection and interviewer effects on 
respondents’ reports. In their experiment, nine 
interviewers were assigned to each of five seg- 
ments of Denver. Respondents were randomly 
preselected for each interviewer so that samples 
were comparable from interviewer to inter- 
viewer within each sector, thus controlling the 
possible biasing effects of respondent selection. 
A predesigned schedule and uniform instruc- 
tions and supervision were employed. Whereas 
previous studies had shown biasing effects when 
very few interviewers were being compared 
(Rice, 1929; Shapiro and Eberhart, 1947), this 
study, providing five replications with nine 
interviewers each, found relatively little bias. 
Significant differences were found among in- 
dividual interviewers on but seven of the 21 
opinion closed-end questions employed and on 
but one out of seven factual questions. Some 
rather striking differences did occur, however, 
on two kinds of items: field ratings or overall 
codes of respondents by interviewers showed 
wide differences, as did the type and quantity 
of information obtained in open-end questions. 
These latter differences were not found to be 
systematically related to the interviewer's own 
attitudes, but rather to his expectations of the 
respondents' attitudes. Further evidence on this 
point is presented in (4) below. 

It should be noted that, in this experiment, 
regular experienced interviewers were em- 
ployed in much the same way they would be in 
a typical professionally conducted opinion- 
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attitude survey. These studies indicate that 
when interviewers are free to select their 
respondents, as in quota sampling, very con- 
siderable differences can occur in the distribu- 
tions of responses of allegedly comparable 
samples of respondents. However, few such dif- 
ferences have been found to occur due to the 
influence of experienced, well-trained inter- 
viewers on the respondents’ expressed attitudes. 

(4) Effects of interviewers’ expectations. Two 
published studies (Smith and Hyman, 1950; 
Wyatt and Campbell, 1950) suggest that inter- 
viewers’ expectations of the respondents’ atti- 
tudes have a greater influence on how the 
interviewer will perceive and record the opinion 
of the respondent than the interviewer's own 
attitude. Smith and Hyman used the device of 
a recorded interview in which a "planted" re- 
spondent evinced an attitude not generally 
correlated with the other attitudes he revealed. 
For instance, the respondent would generally 
take a strong internationalist position on for- 
eign affairs, and then suddenly, in answer to a 
particular question, take an isolationist posi- 
tion. Interviewers were also polled on their 
own attitudes. When the respondent took an 
unexpectedly isolationist position, no dif- 
ferences occurred between “isolationist” and 
“interventionist” interviewers. Only one-fifth 
of each group, however, coded the inconsistent 
isolationist response correctly. When the re- 
spondent took an expected interventionist posi- 
tion, many more of both groups of interviewers 
were correct: 78% of the interventionist inter- 
viewers and 61% of the isolationist interview- 
ers. Wyatt and Campbell found no significant 
biasing effects due to either source, but the 
tendency towards bias was more pronounced 
due to interviewers’ predictions of population 
distributions than due to the interviewers’ own 
opinions. 

In a recent and as yet unpublished study, in 
which highly trained interviewers and a care- 
fully designed questionnaire were used, Cannell 
(1953) found no biasing effects of interviewers’ 
expectations. Apparently both sources of bias, 
interviewer attitudes and expectations, can be 
reduced to a minimum under appropriate con- 
ditions. e 

(5) Interviewer variations in probing. In 
open-end interviewing, whether the interviewer 
attempts to record the complete answers of his 
respondents or codes them directly into pre- 
coded categories while the interview is in prog- 
ress, variations can occur in the extent to which 
he will continue to question a respondent if the 
original question is not adequately answered. 
The extent to which a respondent is pressed to 
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state an opinion may differ from interviewer to 
interviewer and even from respondent to re- 
spondent with the same interviewer. Thus, in 
the excellent large-scale study of interviewer 
effects on survey data reported above (Feldman, 
Hyman, and Hart, 1951-52), wide differences 
occurred in the numbers of answers obtained 
by different interviewers assigned comparable 
samples. Some interviewers evidently probe 
considerably more than others and thus obtain 
considerably more information. Shapiro and 
Eberhart (1947) also obtained similar results. 

(6) Interviewer variations in recording. The 
problems involved in interviewer recording of 
respondents’ answers were discussed in some 
detail in the preceding section on recording 
interviews. There are several studies relevant 
to this problem, In an investigation by Guest 
(1947), a wire recorder was concealed in a re- 
spondent’s home. Data are reported on 15 
college student interviewers who interviewed 
this one respondent. Questions of both the 
open-end and multiple-choice type were used. 
Some of these were informational and others 
attitudinal. The subject matter dealt with atti- 
tudes towards psychologists. There were thirty- 
two errors of commission involving recording 
something either not said or said differently, 
and sixty errors of omission in not recording 
something said. In addition, 41 cases of not 
differentiating probes from regular questions 
in the write-up were discovered. Unfortunately, 
the proportion of these errors to total possible 
errors is not reported, and it would also be of 
interest to know just what changes in tables 
of results these errors would cause. In any case, 
the frequency and character of the recording 
errors reported suggest that the problem of in- 
terviewer recording bias may well be a serious 
one. 

A. second experiment on this problem is re- 
ported by Stewart and Flowerman (1949). The 
authors obtained full wire recordings of 33 in- 
terviews involving 15 interviewers. These 
interviewers were trained on a fifty-item ques- 
tionnaire dealing with attitudes towards minor- 
ity groups. The $3 interviews recorded were 
with different respondents who were trained to 
give the interviewers as difficult a time as pos- 
sible. As in the previous study, considerable 
error occurred in the way interviewers recorded 
answers and, also as in the previous study, the 
ysual methods of interviewer evaluation failed 
to predict which interviewers made the most 
errors. 

Both these studies, together with the con- 
siderations listed in the previous section, sug- 
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gest that interviewer recording error is indeed 
a serious problem in the use of interview data. 

The work on interviewer error to date points 
up the seriousness of this problem in the use 
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of the interview as a research tool. Some of the 
research cited above is particularly helpful in 
that it points to ways of overcoming interviewer 
error. 


VaLiprrY OF INTERVIEW DATA 


As in the case of any measure, it is essential 
to determine the extent to which the interview 
constitutes a valid measuring instrument. The 
question of the validity of "the interview" in 
general is too broad. Validity depends upon the 
nature of the particular interview technique, 
the subject matter of the study, and the circum- 
stances in which the interview occurs But 
research workers using the interview must face 
the question of whether answers in an interview 
have any relation to actual behavior, the under- 
lying attitudes which govern behavior, or the 
factual events concerning which they are asked 
to report. We know from studies on the re- 
liability of witnesses that many people are very 
poor reporters and will misreport even if they 
are asked about recent events. Furthermore, 
with the passage of time, omissions and distor- 
tions become much more pronounced. And 
we know that respondents will want to put 
themselves in a favorable light before the inter- 
viewer, and may therefore distort or conceal 
(either consciously or unconsciously) their atti- 
tudes and behavior. If these things are true, 
how can we rely upon an interview to give us 
valid data? It is not enough that those who use 
interviewing as a method usually have a strong 
conviction that their respondents are giving 
them meaningful answers — the answers "ring 
true," they are colorful, and it seems impossible 
that respondents could invent out of whole 
cloth the experiences and feelings they report. 
There should be controlled empirical evidence. 
McNemar (1946) pointed out that few validity 
studies had been reported. What is the current 
evidence concerning interview validity? 

"There are several ways of stating the validity 
problem. In attitude measurement, the prob- 
lem is often described in terms of what predic- 
tions can be made about how a respondent will 
behave, say in the voting booth, or in the 
presence of his immediate supervisor, or to- 
wards his child, on the basis of information 
obtained from interviewing him. Another way 
of stating the validity problem is perhaps of 
no lesser importance. That is, to what extent 
are interview data measures of what they pur 
port to measure? An attitude, for example, 
towards continuing economic aid to Western 
Europe need have no particular behavioral ex- 
pression other than a verbal one. The question 


here is not so much “What acts do interview 
data on such an attitude predict?" as “To what 
extent do such data accurately reflect a re- 
spondent’s underlying attitude on this issue?" 

Before proceeding to the discussion of va- 
rieties of validity studies, it might be well to 
point out the distinction between group valida- 
tion and individual validation as applied to 
interview studies. In group validation, predic- 
tions from interview data are made as to how 
a population will behave. For example, in a 
pre-election poll, predictions may be made 
concerning what proportions of the electorate 
will vote for each candidate. An interesting 
example of a method of group validation is 
provided by the Survey Research Center in its 
studies of consumer finances. The total in- 
come, government bond holdings or bank 
deposits reported by a crosssection national 
sample of spending units, is multiplied by the 
reciprocal of the sampling ratio. (That is, if 
every 10,000th spending unit is interviewed, 
the total income of the respondents in the 
sample would be multiplied by 10,000.) The 
resulting figures should represent the total na- 
tional income of spending units, or the total 
individual holdings in Government bonds or 
bank deposits. The aggregate figures thus de- 
rived are compared with Federal Reserve Board 
estimates of these quantities based on their 
reports from banks and other financial sources. 
Thus, even though no outside data are available 
on the distribution of incomes or savings, out- 
side aggregates may be used to check the valid- 
ity of a survey's over-all findings. 

In individual validations, a given individual's 
behavior may be predicted on the basis of inter- 
view data, and his subsequent behavior can be 
observed and compared with the prediction. 
For example, in the Erie and Elmira county 
panel studies of the determinants of voting 
behavior, the same individual is followed 
through a campaign and his actual voting be- 
havior is determined by interview after the 
election. Similarly, in a Survey Research Center 
economic behavior study, check-ups were made 
the following year to determine how many of 
the individuals who reported plans to purchase 
durable items during the previous interview 
actually did so. 

Either individual or group comparisons may 
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be made in any of the four types of validation 
procedures discussed below. Four different 
varieties of validity data have been collected, 
bearing upon the above statements of the valid- 
ity problem. 

(1) Comparison of respondents’ factual re- 
ports with outside reports or objective evidence 
of the fact under consideration. Respondents 
are asked, “Did you vote in 1948?” “What 
magazines do you take?” “Do you own a car?" 
“What was your income last year?" "How often 
do you spank your child?" etc., and the answers 
are checked against other evidence. 

Hyman (1944-45) reports three such studies 
based on wartime interviewing. A list of 243 
respondents who had redeemed U. S. War Sav- 
ings Bonds were interviewed, and 42 (17%) of 
these denied having redeemed bonds. Of the 
grocers who, when interviewed, reported having 
a certain government poster on display, 42% 
did not actually have it on display, and 4% of 
workers who had been absent from work said 
they had not been absent. (The 4% figure 
was obtained on open-end interviews. When a 
closed-end question was used, the distortion was 
27%.) These studies suggest some of the most 
valuable ways of obtaining valid criterion in- 
formation. In all of the situations above, the 
distortion was in the direction of social ap- 
proval. In the absentee situation, the open-end 
question was least objectionably worded from 
this standpoint. 

A recent study reported by Parry and Crossley 
(1950) presents considerable data on the valid- 
ity of interview data on eight classes of factual 
information. It is interesting to note in this 
study that certain information is subject to far 
less response error than other information. 
When the respondent is asked for factual infor- 
mation that he is likely to know about in pre- 
cise detail, and when the correct reporting of 
this information is not likely to subject him to 
feelings of guilt, he is apparently able to report 
accurately. For example, all but 3% of those 
from whom the information was obtained cor- 
rectly reported whether or not they owned a 
car, 96% reported accurately that they either 
did or did not own a house, 9895 that they 
did or did not have a telephone. When, how- 
ever, the respondent is asked for information 
in which memory errors are likely to occur or 
in which guilt feelings are likely to arise when 
answering in a certain way, response error does 
tend to occur. For example, the more recent 
the presidential election, the more likely the 
respondent is to report accurately about his 
voting. In 1949, 86% reported correctly 
whether or not they had voted in 1948; this 
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figure dropped to 73% for the 1944 elections; 
and in both cases the error is predominantly 
in the direction of voting reported by those 
who did not actually vote. On community chest 
contributions, where social pressure to contrib- 
ute is strong and where the subjects are not 
likely to be aware of any possibility of verifying 
their answers, 3496 reported giving who did not 
give. The corresponding figure for those who 
actually contributed but reported not doing so 
is unfortunately not given. Even if allowance 
is made for some error in the criterion listings, 
the response error is no doubt substantial. It 
is at least clear from this study that the correct 
reporting by interview of factual information 
cannot always be safely assumed, and that cer- 
tain kinds of information appear subject to 
considerable response error. 

Kinsey, Pomeroy, and Martin (1948) have 
devoted considerable attention to the validity 
problem in their well-known studies of human 
sexual behavior. One of their methods is to 
compare the data obtained from husbands with 
that from their wives. They found that inci- 
dence of certain behavior patterns was more 
often independently agreed upon by both mem- 
bers of a marriage than was the frequency of 
such behavior. Kinsey and colleagues also found 
that except for the aged, recent events were re- 
called more easily than remote ones. They also 
compared interview data (obtained from prison 
inmates) on homosexual incidents with an in- 
dependent estimate by an observer of the num- 
ber and identity of men believed to be active 
homosexuals, and found about 85% agreement. 

The individual validation of questionnaire 
data against independent self-observation of 
behavior has been done by Goodenough (1931). 
She asked parents to check from a list of meth- 
ods for controlling anger in children those 
which they employed. Following this, she had 
them keep a diary on what they did each time 
the child got angry. Little correlation was 
found between the sets of data. It may well be 
the case, as is so often true in our society, that 
notions of ideal behavior were being checked 
and that actual practice fell considerably short 
of these ideals. 

Radke (1946) also attempted to validate 
parental reports of child-rearing behavior by 
obtaining independent measures from each of 
the two parents and also from parents and 
children of the same families. The correlation 
‘between parents was .64 and for parent and 
child, 42. The parents represented themselves 
to be warmer, less punitive, and more consistent 
than did their children. 

(2) Prediction of future behavior on the 
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basis of respondents’ statements of their plans 
and expectations. By far the most extensive 
source of such data is in the area of pre-election 
forecasting. There are, however, a number of 
reasons why these are not completely satisfactory 
as validity checks. 

Sampling problems, for example, are espe- 
cially complex in election forecasting. In order 
to interview a representative sample of voters, 
the pollster must be able to specify, first of all, 
the population who will actually vote on elec- 
tion day. Secondly, the campaign does not 
cease from the time interviewing is begun until 
election day, and presumably many people do 
change their minds in the course of at least 
some campaigns. In other words, even if pre- 
election polls were completely valid measures 
of the voting intentions of the population 
sampled, it would not necessarily follow that 
the group predictions based on these data would 
be accurate for election day (SSRC Comm., 
1948). When these and other factors, such as 
the undecided vote, are taken into account, 
the remarkable fact usually is how closely pre- 
election poll data do predict aggregate percent- 
age votes. In addition to group validation 
procedures available through election returns, 
various additional checks are available through 
comparing the predicted behavior of subgroups 
such as the working class or the upper class 
with the election statistics of precincts or wards 
which are known to be heavily populated by 
the members of a class. 

Students of economic behavior are often con- 
cerned with predicting the trends of consumer 
purchasing and saving, for these are closely 
related to the economic cycle. Interview studies 
have been done in which consumers are asked 
their buying and saving plans and, from their 
replies, an effort is made to predict economic 
trends in the near future (Katona, 1951). In 
several instances, trends predicted from such 
surveys have proved to be accurate, but much 
more experience is needed before the method 
can be considered fully tested. 

(3) Consistency of statements relating to the 
same underlying attitude. Internal consistency 
measures are usually considered solely as re- 
liability measures, For example, if the same 
question is asked in somewhat different form 
at two different points in the interview and 
the answers to the two questions are highly 
correlated, the measure may be considered a 
reliable one. However, suppose that a numbet 
of questions are asked which differ not only in 
form but contain substantively different subject 
matter, these different elements of subject mat- 
ter being related to each other in a predicted 
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psychologically meaningful or logical fashion. 
High correlations among these elements sug- 
gest that these questions are all tapping a 
common attitude or set of attitudes and are 
measuring what they purport to measure. If, 
for example, a respondent indicates through 
one question that he believes in sharing all our 
military secrets with Britain and in answer to 
another question says we should standardize 
our military equipment with that of the British 
and on still a third question that U. S. and 
British armies should be integrated at the 
Divisional level, the evidence begins to mount 
up that that respondent believes that there 
ought to be a close military alliance between 
Britain and the U. S. Similarly, when a group 
of items forms a Guttman scale with a repro- 
ducibility score in the nineties, not only is high 
reliability being indicated, but the fact that the 
attitudes expressed by the replies to the series 
of questions have psychological consistency 
suggests that the scale is measuring an attitude 
that can be labeled with some confidence in 
terms of its verbally expressed content. 

(4) Indirect validation: correlation of inter- 
view data with data which have a predictable 
relationship. This form of validation is per- 
haps more commonly employed than any other 
(although it is often not thought of explicitly 
as a form of validation) . On the basis of theory 
and previous research, the social scientist pre- 
dicts that a variable which he measures through 
the interview ought to be related to another 
variable (to be measured through interviewing 
or in some other way). If the relationship actu- 
ally emerges from the study, there is presump- 
tive evidence that the interview measure was 
valid. Of course, the demonstration of validity 
is not conclusive: the relationship may have 
been generated by something else about the 
interview than the variable the research worker 
thought he was measuring (sheer verbality, 
extraversion, etc), but the interview must be 
measuring something reliably to make correla- 
tions possible, and since this something is 
correlated with other variables in a predictable 
direction, the chances are good that the some- 
thing being measured is the same thing (or a 
clase cousin) as that which the research worker 
started out to measure. Perhaps a few examples 
will make this approach to validity clearer. 

In an interview with parents of young chil- 
dren, Sears et al. (1953) inquired into the 
amount and kind of punishment employed. 
They predicted that the amount of punishment 
should be related to the amount of aggression 
children display in fantasy (specifically, in con- 
trolled doll-play sessions) and in their play with 
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server, if he knows he is being observed. An 
example of this is the anecdote told by a candid 
mother, who with her child had recently been 
a subject in a behavioral observation study. The 
two had been observed from behind one-way 
screens while the child was solving a difficult 
problem. Several days later, at home, the mother 
became angry with the child, and the child 
said: "Mother, why aren't you nice to me at 
home the way you are over at that playroom?” 
It is evident that behavioral observation is not 
intrinsically more valid than the interview as 
a measure of what would occur in real life, 
although it may be so in some instances. 

What are some of the considerations which 
bear upon the choice between the two methods? 
First of all, some behavior, because of conven- 
tions about privacy or other reasons, is hardly 
accessible to observation, so that the interview 
is the only source of information available. 
This is true of many aspects of family life, al- 
though astonishingly complete observations are 
sometimes made (see One Boy’s Day, Barker 
and Wright, 1951). Perhaps a more difficult 
problem for behavioral observation is that of 
sampling the behavior which is the subject of 
study. The observer can be present only for a 
limited time and in a restricted range of situa- 
tions, and the question always arises as to how 
representative the behavior at these times and 
in these situations is of the subject’s general 
behavior, Some events occur very rarely in the 
respondent's life (perhaps only once, as in the 
case of the death of a parent), and studying 
reactions to such events through behavioral ob- 
servation is obviously difficult from the stand- 
point of timing the observations. In an 
interview, the interviewer can ask about past 
events, expectations of future events, and about 
a wide variety of situations, and the respondent 
can report what he has done or would do in 
these circumstances. In this way, the research 
worker can cover a larger sample of behavior 
than would be possible with observation, and 
can learn something fairly quickly and easily 
about how the behavior he is studying varies 
with changes in the individual's external situa- 
tion (always assuming that the respondent can 
report accurately) . b 

An important question in weighing the rela- 
tive merits of behavioral observation and the 
interview is this: to what extent are prestige 
and status involved in the respondent's activi- 
ties? The more these are involved, the more 
distortion one might expect in the interview. 
(See the discussion of validity above) Be- 
havioral observation, while subject to the same 
problem, can sometimes be more free of it, if the 
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observation process is inconspicuous and as 
much as possible of the real-life situation is 
present. How aware is the subject of his own 
behavior and motives? It is in the spheres of 
behavior where repression and other defense 
mechanisms are at work that it becomes most 
important to check interview findings against 
overt behavior. A field worker reported the fol- 
lowing event: A mother was being interviewed 
concerning her child-rearing practices. During 
the interview she held her small son on her lap. 
The child began to play with his genitals. The 
mother, without looking directly at the child, 
moved his hand away and held it securely for 
a while, meanwhile talking to the interviewer. 
Later in the interview, the mother was asked 
what she ordinarily did when her child played 
with himself. She replied that he never did this 
— he was a very "good" boy. She was evidently 
entirely unconscious of what had transpired in 
the very presence of the interviewer. Similarly, 
people may be unconscious of the ways in which 
they express aggression, or the ways in which 
they attempt to get approval from others. When 
interviewed about their reactions to a moving 
picture, people may not be able to describe the 
extent of their emotional arousal, yet managers 
of large theaters know that for particularly 
torrid moving pictures they must adjust the 
air-conditioning apparatus to handle the higher 
temperatures generated in the theater by the 
simultaneous emotional reactions of masses of 
people. It is evident that for such areas of 
research, interviewing alone is inadequate, 
although much can be done by using diagnostic 
questions whose import the respondent does not 
recognize. On the other hand, the research 
worker may be working with subject matter in 
which the verbal expressions themselves are the 
focus of interest, and in which there is no other 
behavioral criterion to which direct observation 
could be applied. For example, in studying 
ideologies, some aspects of concept formation, 
or rationalizations, the research worker will 
necessarily employ some form of the interview 
as his research tool. 

, In summary, it appears to the writers that the 
interview is currently being used uncritically in 
researches where other techniques would be 
equally, or more, efficient. On the other hand, 
some of the most impressive contributions to 
social science knowledge have been made by 
studies which employed the interview as their 
central technique. A growing body of methodo 
logical findings is enabling the discriminating 
research worker to know when to use the intel 
view and how to adapt it to his own research 
needs. 
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CHAPTER 13 


Content Analysis* 


BERNARD BERELSON 
Ford Foundation 


In the communication process a central posi- 
tion is occupied by the content. By communica- 
tion content is meant that body of meanings 
through symbols (verbal, musical, pictorial, 
plastic, gestural) which makes up the communi- 
cation itself. In the classic sentence identifying 
the process of communication — “who says 
hat to whom, how, with what effect" — com- 
munication content is the what. This chapter 
deals with the analysis of what is said in com- 
munication of various kinds. 

Since the content represents the means 
through which one person or group communi- 
cates with another, it is important for com- 
munication research that it be described with 
accuracy and interpreted with insight. Com- 
munication content is so rich with human 
experience, and its causes and effects so varied, 
that no single system of substantive categories 
can be devised to describe it. However, a scien- 
tific method has been developed — and is being 
developed further — for describing various 
facets of communication content in summary 
fashion. That method is called content analy- 
sis. It has been used to answer such diverse 
questions as these: 

How have the slogans of May Day propa- 
ganda in the U.S.S.R. changed during the 
Soviet regime? 

How is the writer's personality structure re- 
flected in what he writes? 

How do the values in American plays differ 
from those in German plays of the same time? 

What is the pattern of movement in the de- 
liberations of a small group? What is the pat- 
tern in social case work? 

How are minority ethnic groups treated in 
short stories in popular magazines? j 

How can communications suspected of sub- 
version be tested for their “propaganda” com- 
ponent? 

This report on the technique of content 
analysis attempts to survey critically the appli- 
cations of content analysis, to relate it to mar- 
ginal developments in other fields, to note the 
major types and categories of analysis, and to 
review certain technical problems. After a gen- 


eral introductory section, the report discusses 
the major uses of content analysis, the units and — 
the categories of analysis, and such technical 
matters as sampling, reliability, and modes of 
inference. 


DEFINITION OF CONTENT ANALYSIS 


What is meant by the term "content analysis"? 
Review of several definitions which have ap- 
peared in the technical literature will serve to 
identify its major characteristics: 


Systematic content analysis attempts to refine 
more casual descriptions of the content, so as to 
show objectively the nature and relative strength 
of the stimuli applied to the reader or listener. 
(Waples and Berelson, 1941, p. 2) 

A social science sentence may be called one of 
"content analysis" if it satisfies all of the follow- 
ing requirements: (1) it must refer either to syn- 
tactic characteristics of symbols . . . or to semantic 
characteristics .. . (2) it must indicate frequencies 
of occurrence of such characteristics with a high 
degree of precision. One could perhaps define 
more narrowly: it must assign numerical values 
to such frequencies. (3) It must refer to these 
characteristics by terms which are general . . - 
(4) it must refer to these characteristics by terms 
which occur . . . in universal propositions of social 
science. One may consider adding to this defini- 
tion another requirement: (5) a high precision of 
the terms used to refer to the symbol character- 
istics studied. (Leites and Pool, 1942, pp. 1-2) 

The content analyst aims at a quantitative clas- 
sification of a given body of content, in terms of 
a system of categories devised to yield data rele- 
vant to specific hypotheses concerning that con- 
tent. (Kaplan and Goldsen, 1943, p. 1) 

“Content analysis” may be defined as referring 
to any technique for the classification of sign- 
vehicles; which relies solely upon the judgments 
—which, theoretically, may range from percept 
discriminations to sheer guesses—of an analyst of 
group of analysts as to which sign-vehicles fall into 


* At the request of the editor, Dr. Berelson has 
prepared a condensation, by half, of his Content 
Analysis in Communication Research, The Free 
Press, 1952. Few changes have been made in the 
process of condensation. 
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which categories; on the basis of explicitly formu- 
lated rules; provided that the analyst's judgments 
are regarded as the report of a scientific observer. 
The results of a content analysis state the fre- 
quency of occurrence of signs—or groups of signs 
—for each category in a classification scheme. 
(Janis, 1943, p. 429) 

. . - The technique known as content analysis 

. attempts to characterize the meanings in a 
given body of discourse in a systematic and quan- 
titative fashion. (Kaplan, 1943, p. 230) 


This group of definitions provides six distin- 
guishing characteristics of content analysis: 

(1) It applies only to social science generali- 
zations: Leites and Pool. 

(2) It applies only, or primarily, to the de- 
termination of the effects of communications: 
Waples and Berelson. 

(3) It applies only to the syntactic and se- 
mantic dimensions of language: Leites and 
Pool. 

(4) It must be "objective": Waples and 
Berelson, Leites and Pool, Janis, Kaplan. 

(5) It must be “systematic”: Leites and 
Pool, Kaplan and Goldsen, Kaplan. 

(6) It must be quantitative: Waples and 
Berelson, Leites and Pool, Kaplan and Goldsen, 
Janis, Kaplan. 


As we shall see, the first and second of these 
characteristics define the field of content analy- 
sis too narrowly. The review of the literature 
will show that it has been applied successfully 
in other fields than the social sciences and for 
other purposes than the description of the 
effects of communications upon readers and 
listeners. But the other four characteristics are 
required for a proper definition of content 
analysis, 

The syntactic-and-semantic requirement is 
meant to rule out the analysis of communica- 
tion content for the pragmatic dimension of 
language (the third branch of semiotics, the 
general science of signs, as developed by Charles 
Morris). That is, content analysis is ordinarily 
limited to the manifest content of the commu- 
nication and is not normally done directly in 
terms of the latent intentions which the content 
may express or the latent responses which it 
may elicit. The results of content analysis, 
however, frequently serve as the basis for these 
“interpretations” of the latent content. 

The requirement of objectivity stipulates 
that the categories of analysis should be defined 
so precisely that different analysts can apply 
them to the same body of content and secure the 
same results, Like the first requirement, this 
ordinarily limits content analysis to the mani- 
fest content. 
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The requirement of system contains two dif- 
ferent meanings. In the first place, it states 
that all of the relevant content is to be analyzed 
in terms of all the relevant categories, for the 
problem at hand. This requirement is meant 
to eliminate partial or biased analyses in which 
only those elements in the content are selected 
which fit the analyst's thesis. 

The second meaning of "system" is that 
analyses must be designed to secure data rele- 
vant to a scientific problem or hypothesis. The 
results of a content analysis must have a meas- 
ure of general application. 

The requirement of quantification, the single 
characteristic on which all the definitions agree, 
is perhaps the most distinctive feature of con- 
tent analysis. Of primary importance in con- 
tent analysis is the extent to which the analytic 
categories appear in the content, that is, the 
relative emphases and omissions. Now this re- 
quirement of quantification does not necessarily 
demand the assignment of numerical values to 
the analytic categories. Sometimes it takes the 
form of quantitative words like "more" or “al- 
ways" or "increases" or "often." Although re- 
sults of this kind may be appropriate for certain 
studies, it should be recognized that such terms 
are just as "quantitative" as the terms 37 or 
52%; they are only less exact and precise. In 
most applications of content analysis, numerical 
frequencies have been assigned to occurrence 
of the analytic categories. h 

This review of the distinguishing characteris- 
tics of content analysis, then, results in the 
following definition: Content analysis is a re- 
search technique for the objective, systematic, 
and quantitative description of the manifest 
content of communication. 


THE HISTORY OF CONTENT ANALYSIS 


The output of content analysis studies has 
sharply increased in every five-year interval over 
the past thirty years. From about two studies a 
year in the early decades of the century, content 
analysis studies have increased to three a year, 
then four, eight, twelve, and in the most recent 
years to an average of about 25 a year. Pre- 
sumably the field is still expanding. 

Content analysis was first used by students of 
journalism (and later by sociologists) to study 
the content of American newspapers. This 
work centered in the School of Journalism at 
Columbia University, and culminated in Mal- 


* colm Willey’s study, The Country Newspaper, 


in 1926. Almost exclusively, such early studies 
employed straight subject-matter categories 
(e.g., domestic affairs, politics, labor, crime, 
divorce, sports, etc.) . 


. 
e 
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The other early concentration of content 
analysis studies occurred in the field of litera- 
ture, with the analysis of various stylistic fea- 
tures in English poetry and prose (e.g., sentence 
length, predications, conjunctions, etc.). The 
contribution of this kind of study was appar- 
ently not valued highly by students of litera- 
ture; it was many years before it was applied 
again to problems of literary style by Rickert, 
Spurgeon, Miles, and others. 

During the late 1930’s content analysis was 
given an important stimulus through the work 
of Harold D. Lasswell and his associates, as well 
as through the growing interest in propaganda 
and public opinion and the emergence of radio 
as a great mass medium of communication. 
Viewing communication problems within the 
framework of a theory of politics, Lasswell in- 
troduced new problems, new procedures, and 
new categories into the field (and thereby im- 
parted a political complexion to it). The con- 
cern of Lasswell and others with problems of 
public opinion and propaganda led naturally 
to the study of communication and thence to 


HANDBOOK OF SOCIAL PSYCHOLOGY 


content analysis. At about this time, the rise 
of radio also directed attention to content analy- 
sis, most notably under Paul F. Lazarsfeld of 
the Bureau of Applied Social Research at 
Columbia University. Also during this period, 
the professional interest in adult education 
promoted several researches into the readability 
of materials. During World War II, content 
analysis was employed by several government 
departments charged with responsibilities in the 
field of mass communication. The Experimen- 
tal Division for the Study of Wartime Com- 
munications, in the Library of Congress, the 
Analysis Division of the Foreign Broadcast Intel- 
ligence Service (Federal Communications Com- . 
mission), and the Organization and Propaganda 
Analysis Section of the Special War Policies 
Unit, Department of Justice, all employed con- 
tent analysis techniques in their investigations 
of the propaganda output of various countries, 
organizations, or individuals. In addition, con- 
tent analysis has been given certain commercial 
applications. 


Tue Uses or CONTENT ANALYSIS 


The method of content analysis has been ap- 
plied to so large and diverse a group of ma- 
terials, with respect to so large and diverse a 
set of problems, that it is not easy to order the 
uses in a single classification. However, seven- 
teen types of uses (applications, functions) of 
content analysis have been identified. 

This set of uses is presented here as a list, 
not as a classification system. That is, there is 
no claim that the uses form a logically coherent 
organization or that they are classified on a 
single dimensional base. ‘This classification has 
been induced from the major purposes of the 
studies and as such it presumably reflects the 
central problems to which the method has been 
applied. If the list is not sufficiently coherent 
and systematic, neither is the field, 

The uses of content analysis are ordered un- 
der major headings which progressively refer 
to the characteristics of communication con- 
tent, the causes of content, and the consequences 
of content. 


CHARACTERISTICS OF CONTENT: SUBSTANCE 


To describe trends in communication con- 


tent, “The most valuable use of studies of con- _ 


tent... is in noting trends and changes in 
content. Systems of classification may be inade- 
quate and unstandardized; nevertheless, if a 
system is used consistently over a time period 
valuable facts may appear” (Albig, 1938, p. 


, 


349). This quotation indicates the importance 
sometimes attached to this application of con- 
tent analysis, although it seems an overstate-- 
ment in view of the more advanced uses of 
content analysis. The classification into a single 
set of categories of similar samples of commu- 
nication content taken at different times pro- 
vides a concise description of content trends, in 
terms of relative frequencies of occurrence. Such 
descriptions of trends are often useful in them- 
selves; in addition, they provide data which can 
be correlated with corresponding changes on 
the part of the communicator or the audience. 

The obvious importance of this application 
of content analysis has led to many such studies. 
The contributions can be illustrated by a few 
examples: 

In thirty Minnesota weekly papers, the per- 
centage of shop-set news originating ins the lo- 
cality and trade area rose from 28% in 1860 
to over 80% in 1890 and thereafter. Personal 
news increased from less than 6% in 1860 to 
almost 40% in all years after 1900 (Taeuber, 
1932) . 

There was relatively little difference in the 
extent to which economic, nationalistic, mili- 
taristic, or internationalistic motivations were 
employed in the McGuffey readers and in 3 
sample of school readers in 1930, but there was 
a considerable difference in the relative use of 
religious motivations (Estensen, 1946) . 
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The propaganda content of May Day slogans 
in the Soviet Union from 1924 to 1943 moved 
from "universal revolutionary" symbols to “na- 
tional" symbols (Yakobson and Lasswell, 
1949) . 

Whereas only about half the articles on child 
development in the Ladies Home Journal and 
Good Housekeeping in 1904 and 1910 were 
based on “scientific authority" rather than “per- 
sonal opinion," the proportion in 1940 had 
risen to fully 98% (Ojemann, 1948). 

As compared with World War I, propaganda 
in World War II was less emotional, less 
moralistic, and more truthful (Kris and Leites, 
1947) . 

Such trend studies provide a valuable histori- 
cal perspective against which the current con- 
teht of the communication media can be more 
fully understood. Indeed, some students have 
suggested the organization of continuing trend 
studies of the major media on various topics 
in order to provide a background against which 
particular studies could be interpreted. In this 
connection, it is important to point out that 
most of the existing trend studies have dealt 
only with subject-matter categories, although 
the addition of other categories (e.g., direction, 
authority, emotionality) would make such 
studies even more valuable. 

To trace the development of scholarship. 
The preceding section has discussed content 
trends in the general media; this section deals 
with the special case of content trends as indices 
to the development of scholarly interests and 
activities. Content analyses of the literature in 
various scholarly and scientific fields have been 
used to describe their development. An analy- 
sis of the development of physics in England, 
Germany, and France during the 18th and 19th 
centuries produced certain “wave-like fluctua- 
tions of creative productivity,” decades in dura- 
tion (Rainoff, 1929). The changing nature of 
botanical research from 1890-1930 is revealed 
in the study of the content of a group of Ameri- 
can journals in that field (Stevens, 1932). 
Similatly, the development of sociological in- 
terests in the United States over the past fifty 
years is indicated by the distribution of articles 
in the American Journal of Sociology (Shanas, 
1945) . In recent years the Journal has increased 
its attention to problems of population, races 
and nationalities, and methods of research at 
the expense of papers dealing with the theory 
and history of sociology and with social reform.* 
Such analysis not only describes the develop- 
mental trends of a discipline, but also supplies 
the definition or composition of the field at 
any particular time. A similar analysis of psy- 


chological journals was carried out by Allport 
and Bruner (1940). 

The particular assumptions involved in this 
use of content analysis must be recognized and 
validated. Most such studies rely upon a single 
source of data, or a small group of sources, over 
a relatively long period of time, and the con- 
tinued adequacy and consistency of the data 
may require special attention. For example, a 
few general journals may not contain a valid 
sample of the research literature of a given 
field over a long time because of the later addi- 
tion of more specialized journals representing 
branches of the field. Or, again, changes in the 
content of a relatively few journals in a field 
may reflect changes in editors or editorial poli- 
cies rather than genuine, broadly recognized 
shifts of interest within the field itself. In some 
cases, results may be distorted by the national 
origin of the source materials, as, for example, 
in analyses of "great men" by means of space 
measures in the Encyclopaedia Britannica 
and/or the Americana. And trend studies may 
be distorted by the tendency of sources to give 
more attention to the work of the period im- 
mediately preceding their own. Even changes 
in classification policy may be a disturbing in- 
fluence; for example, articles on the borderlines 
between physics and mathematics or between 
sociology and political science may have been 
assigned to different fields at different times. 
However, precautions can be taken against most 
of these difficulties, and then content analysis 
becomes a useful tool for describing the com- 
position and development of scholarly and 
scientific fields. 

T'o disclose international differences in com- 
munication content. Systematic analysis can 
disclose differences in the communications avail- 
able in various countries. What news of com- 
mon interest and relevance is carried in 
newspapers or on the air, and how is it treated? 
How different are international versions of the 
same event, such as wars or international crises 
in which nations have equal investments? What 
are the central differences between, say, Ameri- 
can, British, French, and Russian films? Such 
questions can be answered through comparative 
content analyses which will reveal differences 
in the focus of attention of national audiences. 

An intensive analysis (though not particu- 
larly systematic or rigorous) revealed the differ- 
ing treatment of this country's wars in American 
history textbooks and in the textbooks of the 
former enemy countries (Walworth, 1938). 
The versions of the Revolutionary War in 
American and British texts or of World War I 
in American and German texts are compared in 
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order to reveal the common or divergent views 
of history presented to the youth of different 
countries. The relevance of such presentations 
for international attitudes is clear. So is the 
record: in virtually every case, the divergencies 
are predominant. Take, for example, the 
American and British versions of the Revolu- 
tionary War. In simplified form, the situation 
is described thus (Walworth, 1938, pp. 3-20) : 


American 


King George III is a blundering bigot, obstinate 
and narrow-minded. 


America had contributed to British prosperity. 


The “Intolerable Acts” passed by the British Par- 
liament were the beginning of hostilities. 


The Colonies were fighting the cause of traditional 
English liberty by their action. 


(omitted) 


The American defeat at Bunker Hill was a moral 
victory, for it showed that the raw Colonial troops 
could face the regulars without flinching. 


In the one case, the literature of the Hitler 
Youth and that of the Boy Scouts of America 
were analyzed in terms of goals and justifica- 
tions (Lewin, 1947). There was no difference 
in emphasis upon such ends as physical fitness, 
discipline, or achievement, but there was more 
stress in the German material on national loy- 
alty and identification and less on altruism and 
creativity. In addition there was considerable 
difference in the bases for recommending de- 
sirable ends to the members of the two organi- 
zations. The Hitler Youth recommendations 
were justified primarily on the basis of member- 
ship in the national community (66%), while 
the Boy Scouts justified ends in terms of the 
individual's own satisfaction (47%) . 

In the other study, essentially the same thing 
was done for the most popular dramas in the 
two countries (McGranahan and Wayne, 1948). 
Whereas the major themes in American plays 
were love (60%) and morality (36%), in Ger- 
man plays they were ideals (44%) and power 
(33%). In American drama the “level of ac- 
tion" was almost exclusively personal (96%) ; 
in German drama it was evenly divided between 


Although this particular study is based upon 
comparative quotations culled from selected 
works, with only a few rough references to 
central tendencies, there is no question but that 
systematic techniques of analysis can be applied 
to this problem. 

Two unusual studies of German-American 
differences come to similar conclusions, al- 
though they involve widely differing materials. 


British 


King George was a man of moderate ability with 
a firm will, real courage, and a high sense of royal 
obligation to the public. 


The colonies were a liability rather than an asset 
to the British Empire. 


Acts of violence committed by the radical element 
in the colonies made military action necessary. 


(omitted) 


King George made a last attempt at reconciliation. 


-.. The small English army suffered as heavy 
losses with as steady courage as has ever been shown 
on prouder fields of honour. 


ideological and personal affairs (51% and 
47%). Now while both these studies can, of 
course, be used to illustrate and perhaps docu- 
ment differences in national character, they 
also can be taken as descriptions of what was 
presented to similar groups in the two countries. 

To compare media or “levels” of communi- 
cation. Different media and different “levels” 
of communication not only attract different 
audiences but they also treat the same topics 
in different ways. For both of these reasons, 
the comparative analysis of communication 
content is among the most promising applica- 
tions of content analysis, and a number of in- 
teresting studies have appeared in recent years. 

For example, from a detailed study of 24 
“good” novels and the films adapted from them 
are developed the major types of changes found 
in such adaptations — with respect to dialogue, 


character, action, scene, setting, sequence — and 


the (inferred) reasons for them (Asheim, 1949). 
Simply the amounts of attention given to the 
central plot components in book and film serve 
as an index to the fidelity of the transcription 
from one medium to the other: 
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The Elizabeth-Darcy romance 

The Wickham Story 

The Jane-Bingley romance 

The story of Mr. and Mrs. Bennett 
The character study of Collins 

Minor characters 

The character study of Lady Catherine 
Pure description 


The story of Hepzibah and Clifford as brother 
and sister 

The story of Hepzibah and Clifford as lovers 

The story of Phoebe and Holgrave 

The story of Judge Pyncheon’s plot against Clifford 

The story of Judge Pyncheon and the counterplot of 
Clifford 

Commentary 

General descriptions 

Lesser characters 


In addition, the study disclosed the probable 
bases for various content changes which were 
made in many adaptations. Thus, for example, 
the technical requirements of film-making pro- 
duced an “impersonal omniscient” viewpoint in 
23 of the films, whereas it had occurred in only 
nine or ten of the books. 

Another study is distinctive for its compari- 
son of different “quality levels” dealing with 
American history; it investigated the extent to 
which the basic findings of research mono- 
graphs were incorporated in secondary school 
texts on American history (Blythe, 1932). 
About thirty new viewpoints, emphases, and 
discoveries in historical scholarship from 1893 
to 1928 were checked, e.g., Turner's interpreta- 
tion of the significance of the frontier, Mahan's 
emphasis on sea power in the War of 1812, 
Beard's economic interpretation of the Consti- 
tution, Becker's view of the American Revolu- 
tion as a double revolution, and so on. The 
typical finding was incorporated in only one- 
third of fifteen then-recent texts, and individual 
texts included only from five to nineteen of 
the findings. On the average, it took about 
six years for a development from historical re- 
search to find its way into history textbooks for 
secondary schools. Because different groups of 
people read and listen to different kinds o| 
communications on the same subject, the com- 
parative application has considerable potential 
value as a method for describing the differences 
in content to which they are exposed. In addi- 
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Mosr FAITHFUL 
Pride and Prejudice 


Book. Film 
4896 41 96 
23 14 
13 12 
12 20 
12 14 

9 9 

6 13 

5 4 


Least FAITHFUL 
The House of the Seven Gables 


Book Film 
30% 0% 
0 43 
28 21 
19 0 
0 35 
16 5 ` 
12 4 
7 8 


tion, it is particularly useful in comparing 
different "levels" of communication content. 

To audit communication content against ob- 
jectives. Every communication outlet has an 
objective or a set of objectives, whether implicit 
or explicit. One measure of the quality of the 
content is the extent to which it faithfully ex- 
presses such objectives. 

Not many studies have been done primarily 
with reference to this function of content analy- 
sis. A large number have compared content 
emphases with the standards of the analyst. (to 
be discussed in the next section), but only a 
small number have checked content against the 
communicator's own objectives. Sometimes such 
analyses are made by "outsiders" on the basis 
of the assumed objectives of the communicators. 
Thus an analysis of the "educational" content 
of television was carried out in early 1951 by a 
group interested in promoting such programs. 
On the assumption that the television industry 
would argue, in F.C.C. hearings, that it was 
already fulfilling this purpose adequately, the 
total New York output for one week was 
analyzed for type of program and the findings 
became an impressive exhibit (Smythe and 
Horton, 1951). For example, public issues re- 
ceived 2% of the total program time, public 
events 1%, and information programs 345, as 
compared with 10% for crime drama and 14% 
for variety. 

Content analysis can be employed to correct 
the constant, often inadvertent, and perhaps 
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inevitable omissions from and mis-emphases in 
communication content, in terms of the objec- 
tives and directives of the communicator. 

To construct and apply communication stand- 
ards. Content analysis can describe communi- 
cations but it cannot, per se, evaluate them. 
Evaluation necessitates the acceptance of a 
standard, or standards, with which the com- 
munication content is then compared by means 
of content analysis. 

Content analysis has been used in three meth- 
ods of evaluating communications: (1) evalua- 
tion of performance against such a priori 
standards as “balance” or "social purpose"; (2) 
evaluation of performance by comparing one 
body of content with another; and (3) evalua- 
tion of performance by comparing content with 
a noncontent source. In each case, the standards 
are advanced by someone other than the com- 
municator himself — which distinction dif- 
ferentiates this use from the audit function of 
content analysis. 

Perhaps the major recent application of a 
priori standards to communication content was 
made for the Royal Commission on the Press 
(1949). This British group utilized a content 
analysis, of subject and form mainly, ", . . in 
order to ascertain how great was the extent of 
suppression, distortion, and inaccuracy ..." (p. 
107). Partly on the basis of this research, re- 
ported in several tabulations in the appendix 
to the report, the Commission concluded “. . . 
that the political factor in the selection and 
presentation of news is apparent in all the na- 
tional papers . . ." (p. 109). Other examples 
have dealt with the treatment of labor in radio 
news programs (Sussman, 1945), the treatment 
of the Taft-Hartley Act in 50 periodicals (Ash, 
1948), and the incidence of local news in 
county weeklies (Willey, 1926) . 

The second basis of evaluation involves the 
construction of standards internally, i.e., within 
the content itself, in which case comparisons are 
made between one body of content and another. 
For example, in a study of “ethical and unethi- 
cal behavior by representative newspapers," the 
performances of several papers were evaluated 
against the average “socialization-sensational- 
ism" profile of a group of papers highest»in 
“socialized news attention" (Kingsbury and 
Hart, 1937). The profiles were secured by the 

analysis of front-page headline space devoted 
to three “highly social" subject matters (foreign 
problems, U.S. international, and citizenship) , 
and three “extremely sensational” ones (money- 
sensational, sex, and money-sex). By the sub- 
traction of the latter from the former, a single 
"socialization-sensationalism score" was secured 
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for each of some forty newspapers ranging from 
the United States Daily (32.8) and the Chris- 
tian Science Monitor (30.3) to the New York 
Daily News (—14.2) and the New York Daily 
Mirror (—17.3). Thus, the papers were com- 
pared on this standard with one another as 
well as with the norm of the "better" news- 
papers. 

Finally, the third method of evaluation — 
against an outside noncontent standard — was 
used in assessing the frequency with which 
members of different ethnic groups appeared 
in magazine stories (Berelson and Salter, 1946). 
In this case, the distribution of the fictional 
characters was compared with the distribution 
of the actual population. "Census data only 
accentuates the differential treatment accorded 
‘natives’ and ‘minorities’ in the stories . . . 
Every ‘minority’ group appears less frequently 
in the stories than in the country. Only "The 
Americans’ appear more frequently" (p. 173). 

Another such comparison, useful in evaluat- 
ing newspaper performance, was employed to 
determine the extent to which British papers 
reported the major facts on an important 
event, the first year's progress of. the National 
Coal Board (Royal Commission, 1949). The 
analysts constructed a master list of 33 major 
facts — "what actually happened" — against 
which they checked the coverage of the papers. 
The scores for nine papers ranged from 2314 
for the Times to 3 for the Daily Worker. 

The discussion of standards for mass commu- 
nications will almost certainly increase in com- 
ing years, and in this discussion the use of 
content analysis can be extremely valuable in 
evaluating the media. Once the nature of the 
standards is clearly and explicitly formulated, 
content analysis can be usefully applied to meas- 
ure the extent to which the specified standards 
are met. Potentially, at least, content analysis 
can provide objective data on performance to 
compare with the agreed-upon norms stated in 
the form of communication standards. 

To aid in technical research operations. To 
Some extent, content analysis has been used as 
one step in a series of research operations. The 
major use here involves the coding of quali- 
tative materials secured through sample survey 
or experimental methods. 

In recent years there has been considerable 
development in the methods of collecting vari- 
ous kinds of information, opinions, and inter- 
action data by means of personal interview or 
recorded observation. One such method is the 
intensive or open-end interview, in which the 
respondent is asked a question without being 
given a checklist or other structured form in 


CONTENT ANALYSIS 


which to answer. By means of probes and other 
devices, the respondent is encouraged to reply 
at some length. For certain purposes, this type 
of question is indispensable. However, its use 
raises the problem of the systematic and ob- 
jective analysis of the answers. “It can safely 
be said that the proponents of the open-end 
interview technique have made much more 
progress in the conduct of the interviews than 
in their statistical analysis” (Lazarsfeld, 1944) . 

Strictly speaking, the quantification aspect of 
content analysis is seldom applied to such inter- 
views — partly because the coding rarely re- 
quires such intensive analysis, partly because 
the materials would not ordinarily support it 
(i.e., they are too far from a verbatim account 
of what the respondent actually said), and 
partly because the meaning of the quantifica- 
tion would rest upon dubious assumptions 
about repetition, extension of remarks, etc. The 
coding procedure is similar to content analysis 
in that it involves the codification of verbal ma- 
terials which can take many diverse forms and 
still "mean" the same thing, in terms of the 
categories under which they are subsumed. But 
there the similarity usually ends. Thus coding 
is based upon the presence or absence of the 
categorized material, not upon its frequency — 
upon whether, not how much. 

However, in one form of this material a vari- 
ant of content analysis has been used to classify 
the subjects on the basis of their total response 
to the interview, with consideration of the point 
in the interview at which certain responses were 
made. In a study of ethnic prejudices of a 
group of veterans, the respondents were classi- 
fied into the intense (anti-Semitic or anti- 
Negro), the outspoken, the stereotyped, and the 
tolerant according to whether they made spon- 
taneous restrictive remarks, elicited restrictive 
remarks, stereotyped responses, or explicit ref- 
erences of a tolerant nature (Bettelheim and 
Janowitz, 1950). In another case, the category 
of "attitude toward political parties in general" 
was coded on the basis of the pros and cons 
securéd in answer to several questions dealing 
with political matters of various kinds. 

More direct application of content analysis 
as a research aid is found in some recent anal- 
yses of group behavior (Chapter 10). A “profile 
of group interaction" has been developed by 
the quantitative analysis of statements made 
by group members, in terms of such categories 
as these: shows solidarity, shows tension re 
lease, agrees, gives suggestion, gives opinion, 
gives orientation (and their counterparts). Hy- 
potheses on group behavior under varying con- 
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ditions are tested by means of this use of content 
analysis (Bales, 1950) . 


CHARACTERISTICS OF CONTENT: FORM 


To expose propaganda techniques. Content 
analysis is often identified with propaganda 
analysis. In one sense, a great number of con- 
tent studies — certainly the majority of them 
— can be considered propaganda analyses, if by 
propaganda is meant the deliberate attempt to 
influence attitudes or behavior on controversial 
public issues. Even the study of the treatment 
given education in newspaper editorials be- 
comes an analysis of propaganda, and its cate- 
gories become “propaganda techniques." In this 
way, then, almost every category used in content 
analysis can be considered a “propaganda tech- 
nique" or a category of "propaganda analysis," 
since almost every category has been used at 
one time or another in analyses of material 
designed to influence attitudes on controversial 
issues. 

As the field is usually identified; however, 
certain "techniques" can be illustrated from the 
"propaganda analyses" presented in the litera- 
ture. The first type of category that appears in 
many such analyses is the theme or, as it is often 
called, the "appeal" Now "appeal" is a term 
which ordinarily refers not to characteristics of 
the content, but to characteristics of either the 
intent or the effect of the communication; the 
corresponding term for content is probably 
"theme." Various content formulations, or 
themes, may carry the same "appeal"; similarly, 
the same theme may carry different "appeals" 
for different audiences. As already indicated, 
it is preferable to analyze communications in 
content categories rather than in terms of pre- 
sumed intents or effects, and then to make the 
latter type of interpretation on the basis of 
analytic results. An illustration of this process 
is provided by a study of propaganda techniques 
used in World War I (Lasswell, 1927). 'The 
analysis of the themes used by the opposing 
sides illustrated the four major objectives of 
wartime propaganda: to mobilize hatred against 
the enemy, to preserve the friendship of allies, 
tq, preserve the friendship and if possible to 
secure the cooperation of neutrals, and to de- 
moralize the enemy. Within this framework 
the effectiveness of the propaganda was evalu- 
ated: “The British were amazingly successful 
in the development of humanitarian war aims. 
'The Germans aroused much resentment and 
suspicion abroad by talking about a war of 
German Kultur, and by underplaying the hu- 
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manitarian ideal . . . British public men be- 
gan to talk about a war to end war long before 
the German statesmen learned this vocabulary 
. . . The Germans were never able to efface 
the initial impression that they were aggressors. 
This was due in part to the stupidity of their 
own appeals” (Lasswell, pp. 196-197). A more 
molecular and quantitative approach to the 
analysis of political communications is provided 
by White (1942, 1949) . 

Another set of categories used in propaganda 
analysis — perhaps the set most frequently asso- 
ciated with the term — deals with the "tricks 
of the trade.” They are the various special 
devices of which the subject is not ordinarily 
aware. Perhaps the most appropriate illustra- 
tions of such analysis are found in the work 
of the Institute for Propaganda Analysis. Seven 
such devices are described and illustrated in 
a study of the speeches of Father Coughlin (Lee 
and Lee, 1939). This list was later expanded 
to eleven “tricks of the trade”: 


A. Techniques of basic procedure 
1. Selecting the issue 
2. Case-making or card-stacking 
3. Simplification 


B. Use of omnibus words (basic propaganda 
shorthand) 
4. Name-calling 
5. Glittering generality 


C. Techniques of identification 
6. Transfer 
7. Testimonial 
8. Plain folks 
9. Bandwagon 


D. Strategic techniques 
10. Hot potato 
ll. Stalling 


Although such devices may be useful in making 
people propaganda conscious, it should be 
clearly recognized that they do not constitute 
the “whole” area of propaganda analysis. Little 
has been done to define these categories pre- 
cisely, and there has been no effort to correlate 
systematically the appearance or frequencies of 
these categories with the effectiveness of :he 
material. 

The source of assertions is another propa- 
ganda characteristic which has been dealt with 
in content analyses. This category was included 
in the study of German short-wave propaganda 
through a comparison of certain prestige de- 
vices in British and German propaganda deal- 
ing with the same event, Le, the Altmark 
incident (Bruner, 1941). As it happens, the 


two countries used authoritative symbols quite 
similarly in this case. 

This brief review of some categories that have 
been used in "propaganda analysis" cannot 
omit the characteristic which is often associated 
with the term "propaganda," namely, the emo- 
tional content of the material. Some attempts 
have been made to measure systematically the 
relative emotionalization of various propa- 
ganda, usually by counting the frequency of 
emotional, or sentimentalized, terms. An illus- 
tration of this kind of propaganda analysis is 
provided by a study of the 1940 presidential 
campaign, which revealed that significantly 
more emotional content appeared in the ma- 
terial dealing with Roosevelt than in that deal- 
ing with Willkie (Waples and Berelson, 1941). 
The same procedure revealed that the emo- 
tional content of newspaper material on the 
campaign was higher than that of radio or 
magazine material and that the emotional tone 
of propaganda during the last month of the 
campaign was twice as great as that during the 
relatively quiet period of early August. 

The analysis of "propaganda techniques" 
may take a large number of forms and employ 
a large number of different categories. It should 
not be considered as restricted to the emotional 
component or to the "tricks of the trade." Just 
as effective propaganda for various audiences 
may necessitate many and complex appeals, so 
must content analysis designed to identify the 
efficient influences within it be varied and com- 
plex. 

To measure “readability.” One of the two 
main uses of content analysis by educators is 
the measurement of the “readability” of printed 
materials; textbook analysis is the other main 
use. What are the concrete factors that dis- 
tinguish material that is easy or hard to read 
and comprehend? What are the stylistic dif- 
ferences, in these terms, between True Story 
and The Atlantic Monthly? The answers to 
such questions are used by educators in the 
selection and even the production of appropri- 
ate books for readers of varying ability. 

In a sense, the development of readability 
measures is a story of increasing differentiation. 
In the beginning of such analysis the “word” 
was focused upon. The first studies of readabil- 
ity in the mid-1920's classified materials simply 
on the basis of the proportions of "easy" and 
"hard" words they contained, as indicated by a 
standard word list (usually the Thorndike list) . 
In the following decade additional factors were 
investigated. — sentence length, simple sen- 
tences, indeterminate clauses, prepositional 
phrases. Then followed a thorough investiga- 
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tion of no fewer than 82 assumed elements of 
difficulty, of which five were finally selected as 
correlating best with tests of comprehension: dif- 
ferent words, different hard words, personal 
pronouns, prepositional phrases, and sentence 
length (Gray and Leary, 1935). This set of 
content characteristics was next reduced to 
three (hard words, prepositional phrases, sen- 
tence length), and they were included in a 
formula which directly expressed readability 
in terms of the years of schooling required for 
easy reading of the passage (Lorge, 1944). At 
about the same time another widely used for- 
mula contained a different set of three factors 
— sentence length, personal references, and af- 
fixes (Flesch, 1943) . And another formula based 
on the number of words not on a certain list (of 
3000 words) and on average sentence length 
was constructed (by Dale and Chall, 1948) to 
overcome criticisms of Flesch’s formula, which 
was itself later divided into two facets of read- 
ability — “reading ease” and “human interest” 
(Flesch, 1951, pp. 6, 10). 

There is an important limitation to the ap- 
plicability of readability analyses. They have 
been validated against reading comprehension 
tests of general readers representing different 
educational groups. Hence, they apply to the 
general reader with general interest, and they 
do not take into account the effect upon read- 
ability of the reader's own special interests. For 
example, a poorly educated mechanic could 
comprehend some technical material which 
would prove too difficult for a welleducated 
businessman. Readability is a function of the 
reader's interest and experience, and this may 
seriously qualify its mechanical application. 
Furthermore, the formal categories in terms of 
which readability is normally measured do not 
take into account such factors as organization 
of the material, nature of the content, format, 
etc. (for a general criticism in these terms, see 
Kearl, 1948). 

To discover stylistic features. 'The problems 
of literary and oratorical style have attracted 
students since ancient times, and countless 
volumes have been written about them. What 
are the distinctive stylistic features of poetic dic- 
tion? How do writers differ in literary style? 
How have rhetorical devices changed over a 
period of time? When were particular works 
written, and by whom? Such questions have 
traditionally been analyzed by students of lit- 
erature in so-called "qualitative" terms. Buy 
Írom time to time, venturesome persons have 
experimented with the quantitative analysis of 
style. Such analysis has had an uneven recep- 
tion in the field, but on occasion it has made 
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important contributions to stylistic analysis and 
for some less complicated problems it is more 
or less standard. Consideration will be given 
first to its application to language and litera- 
ture, and then to rhetoric and oratory. 

Language and literature. There are four ma- 
jor areas in which content analysis is applied 
to differentiate stylistic patterns. The first is 
an entire language. Various linguistic studies 
have been made of different features of a whole 
mode of speech. For example, there is the analy- 
sis of the grammatical structure of American 
English, in both the standard and vulgar forms 
(Fries, 1940) . Certain recommendations about 
the teaching of English grammar are based up- 
on the frequencies with which patterns of usage 
appear in writing and speech. 

The second area of application differentiates 
the stylistic patterns of various periods. This 
has been a central concern of one of the most 
recent, and most extensive, applications of 
quantitative analysis to literary style, and spe- 
cifically to problems of poetic vocabulary (Miles, 
1943, 1946, 1951). The ratio of verbs to sub- 
stantives (nouns, plus adjectives) is taken as 
the indicator of a significant trend; “fairly 
steady rise toward an extreme, marked by ad- 
jectival quantity, and then a slower decline, 
moving toward moderation and not far beyond 
it..." (Miles, 1951, p. 520). ; 

The third area inwolves the differentiation of 
style by types of discourse. Certain stylistic 
features of types of written discourse have been 
investigated by means of quantitative analysis. 
For example, substantive-verb ratios are used to 
characterize different types of poetry (Miles, 
1951). The claim is, of course, that such crude 
measures are only indicative of more basic fea- 
tures of the poetry. 


“Substantival 
dominance” Type Examples 
2 to 1 “Metaphysical” Wyatt, Donne, Emer- 
son, Millay 
3 to l "Romantic" Wordsworth, Poe, 
Arnold, Yeats 
4 to 1 "Classical" Spenser, Dryden, 


Crane 


“The fourth area is the most particular, the in- 
dividual author. Content analysis has been ap- 
plied to the substantive meanings in literature 
as well as to its structural elements. Classifica- 
tion of the images in Shakespeare's plays illus- 
trates the wide range of figures upon which 
he drew and also provides a comparison with 
his contemporaries (Spurgeon, 1936); in addi- 
tion, the imagery of particular plays or groups 
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of plays clarifies the way the poet achieved his 
effect, thus: 

Play 
Early historical plays 


Dominant image 
growth and deterioration in 
a garden 


Romeo and Juliet light in a world of darkness 


Hamlet disease and sickness 

Macbeth “ill-fitting garments” 

Othello “animals in action, preying 
upon one another” 

Lear “human body in anguished 


movement” 


Two special cases of the quantitative analysis 
of formal stylistic elements, usually in poetry, 
are reasonably well accepted by literary schol- 
ars. One is the problem of disputed authorship 
and the other that of establishing the correct 
chronology of an author's works, These prob- 
lems are attacked through the analysis of such 
stylistic characteristics as type of rhyme, femi- 
nine endings, size of vocabulary, compound 
words, etc. Thus one scholar assigns the play 
Edward III to Shakespeare on the basis of the 
relative frequency of compound words (Hart, 
1984). Another, on the basis of vocabulary and 
sentence-length evidence, concludes that "the 
results . . . almost exclude the possibility of Ger- 
son as the author (of De Imitatione Christi) , 
but are entirely consistent with the authorship 
of Thomas à Kempis” (Yifle, 1941, p. 278). 

Rhetoric and oratory. Content analysis has 
been applied less frequently in the study of 
rhetorical and oratorical style. However, it is 
unexpectedly found in a study of the appear- 
ance in the Attic orators of types of rhetorical 
arguments from Aristotle's Rhetoric (Palmer, 
1934). The number of illustrations of the 
twenty-eight types ranges widely; for example: 


Type of Argument Number 

a fortiori several hundred 
the appeal to authority, the result more than a 

of a decision, consideration of hundred 

inducements and deterrents, and 

the juxtaposition of inconsisten- 

cies 
the argument from division, argu- 30 or more 


ment by induction and resulting 
deduction, and argument from 
motives 

exposure of inconsistencies, the re- from 10 to 20 

versal of an earlier choice, and 

the denial of an interpretation 


argument by identification and as- less than 10 
sociation, and the probability of 
improbability 

argument through ambiguity, and none 


play upon names 
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Other applications in this area have been 
made to political materials. One study of 145 
speeches made in presidential campaigns from 
1884 to 1920 disclosed that nearly 40% of 
them contained some class appeal (most fre- 
quently to labor), that about 25% of them 
contained an appeal to party loyalty, and that 
more time was spent in attacking the opposition 
than in pointing out the achievements of the 
speaker's own party (Hayworth, 1930). An in- 
tensive analysis of the style of Woodrow Wil- 
son's political speeches, after-dinner addresses, 
and sermons showed the relative frequency of 
rhetorical figures, with metaphor, rhetorical 
questions, and personification appearing most 
often (Runion, 1936). A more sophisticated 
and clever analysis of Wilson's style shows how 
his personality was expressed in such stylistic 
traits as "aristocratic affectations," “learned ad- 
dictions,” etc. (Hale, 1920) . 


PRODUCERS OF CONTENT 


To identify the intentions and other charac- 
teristics of the communicators. This has been 
a major use of content analysis. The basic logic 
of this application of content analysis is this: 
the content has such-and-such characteristics, 
therefore the communicators have such-and- 
such intentions. As in the case of cultural 
patterns, the relevant characteristics of the 
communicators are studied indirectly by this 
method only because the content is more readily 
available, 

Analysis of correspondence in content may 
support inferences about collaboration in in- 
tent. During the war, official government analy- 
ses, were made in order to reveal the extent 
to which the German and Italian propaganda 
ministries were cooperating, if at all (Berelson 
and de Grazia, 1947). The correspondence be- 
tween the Berlin and Rome radios was measured 
in several ways — by the amount of attention 
given to particular events, by the broad alloca- 
tion of radio time, by the treatment of news 
and comment, by the similarity and the 'adop- 
tion of propaganda themes, and by identifica- 
tions of the self, In every case, inferences about 
collaboration were negative. “There was neither 
general nor specific, neither directed nor imi- 
tative correspondence. The strong probability 
is that no collaboration existed . . . While 
these content analyses did not prove the exist- 
ence of noncollaboration (a feat logically 
impossible on the basis of available data) , they 
did make it clear that a close, operating rela- 
tionship between the German and Italian 
propaganda ministries was extremely unlikely" 


| 


—— 


CONTENT ANALYSIS 


(Berelson and de Grazia, p. 253). As a matter 
of fact, information secured later by occupation 
officials established the accuracy of this in- 
ference. 

Different images of Stalin held by top mem- 
bers of the Soviet elite, as expressed in birthday 
speeches of December, 1949, may be interpreted 
to reveal something of their attitude toward 
him (Leites and others, 1951). Actually there 
was a sharp difference between the conceptions 
of the old Bolsheviks and those of other mem- 
bers. 


STALIN: LENIN’S 
PUPIL OR EQUAL? 


Bolshevik Popular 


SPEAKERS Image Image 
Molotov, Malenkov, Beria 22 3 
Shvernik, Voroshilov, 

Mikoyan, Andreyev 7 17 
Bulganin, Kosygin, 

Khrushchev, Kaganovich 1 19 

Although the communicators are much 


“closer” to the production of content than the 
“cultural patterns” of another section, never- 
theless it is quite possible for inferences about 
them to be incorrect when based on the results 
of content analysis alone. Interpretations of 
motive are always difficult, and there is nothing 
in the. procedure of content analysis which 
guarantees against mistaken interpretations. A 
single piece of content may be — and ordi- 
narily is — consistent with more than one 
hypothesis about the characteristics or inten- 
tions of the communicator. There have been 
few studies which have attempted empirically 
to establish the relationship between the com- 
municator and the content on the basis of 
independent material, i.e., to test the inferences 
based upon content analysis. 

To*determine the psychological state of per- 
sons and groups. A basis for inferences dealing 
with the psychological state of individuals and 
groups is provided by analysis of symbol ma- 
terials produced by the persons about whom 
inferences are drawn. (This should be clearly 
distinguished from the analysis of symbol ma- 
terials designed for the persons about whom 
inferences are drawn.) This involves the analye 
sis of the social work or the therapeutic 
interview, of personality tests, of fictional crea- 
tions, or of letters, diaries, or other personal 
documents. 
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In recent years there has been an increase in 
the application of quantitative methods to the 
analysis of psychological documents. According 
to a review of the literature in this field, a study 
quantifying “the structure of the personality 
of one individual life on the basis of a collection 
of personal documents . . . is of considerable 
importance . . . It sets forth a new conception 
of a ‘population’ for statistics: a population of 
events and traits within the boundaries of one 
person. If the initial promise of this method is 
fulfilled it will supply an important bridge be- 
tween the statistical and the clinical points of 
view” (Allport, 1942, pp. 35-36) . 

One clear application — although the quan- 
tification is not always systematic, or even ex- 
plicitly present — is in the field of projective 
tests (Rorschach, TAT, word association, etc.). 
The personality traits of the subject are inferred 
from the communications produced by them in 
response to ambiguous stimuli (an ink blot, a 
picture, an incomplete sentence). The atten- 
tion given by the subject to form rather than 
color, or to the whole rather than the separate 
parts, or to aggressive rather than peaceful 
plots provides the analyst with the data on the 
basis of which he characterizes the subject in 
psychological terms. Usually only the presence 
or the absence of the content score is necessary 
to support the inference, but on occasion quan- 
tities are also brought into play. Thus it can 
be significant if the subject returns to certain 
topics (e.g., sexual references) , or if he sees the 
same things in different stimuli, or if he elabo- 
rates at some length on a particular matter (for 
example, see Lindner, 1950). In any case, a 
psychological description of the subject is based 
upon analysis of the communications produced 
by him, The reader may obtain an excellent 
view of systems of content analysis currently 
applied to the Thematic Apperception Test in 
a recent volume edited by Shneidman (1951). 
Some impression of the use of content analysis 
with other projective techniques may be secured 
by consulting Abt and Bellak (1950) or Ander- 
son and Anderson (1951). 

An early suggestion for objectifying the psy- 
choanalytic interview through content analysis 
lived a representative set of categories into 
which the content might be classified and 
“which it is predicted will aid in the orderly 
comparison of interview data" (Lasswell, 1938, 
p. 199). The broad headings of this proposal 
are: 


Self Self 
Pro- or anti- { Interviewer by {Interviewer 
Other Other 
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Some of the possible combinations are illus- 
trated by these formulations: 


"I am proud of my rec- — Pro-self by self 
ord” 


“You insulted me” — Antiself by inter- 


viewer 
"He said you were a — Anti-interviewer by 
fraud" other 


"He loved her" 


For psychiatric purposes, the basic classification 
is thus stated in terms of self and nonself, ap- 
proval and disapproval. In addition, refine- 
ments are suggested by forms of indulgence and 
deprivation and by patterns of internalized and 
externalized action. "Interviewers grow accus- 
tomed to expect characteristic constellations 
when there are strongly marked peculiarities in 
the associative style of the subject" (Lasswell, 
p. 204). No application of these particular 
categories to psychoanalytic interviews has 
been published, and thus their utility remains 
an open question. At the same time, it is clear 
that many psychoanalytic propositions are based 
upon data which can be analyzed by content 
analysis. 

Less formal (and usually nonquantitative) 
studies of fictional materials to reconstruct the 
personality structure of the author have been 
based primarily on psychoanalytic categories. 
Assuming that the writer cannot fail to express 
deepest feelings in his art creations, the analyst 
proceeds to uncover them by attention to al- 
legedly revealing particulars in the fictional 
works themselves. 

The novels of Charlotte Bronté show evi- 
dence of an Oedipus theme; the chief concern 
of her sister Emily was with the emotional 
relations of children. Charlotte's novels, with 
their decrease of subjectivity and reduction of 
conflict, indicate a process of emotional inte- 
gration which Emily did not achieve (Mc- 
Curdy, 1947). The latent fascism of Knut 
Hamsun was betrayed by the authoritarianism 
of his novels, the acceptance of natural forces 
as unchangeable, the combination of brutality 
and sentimentalism, the motherly and house- 
wifely role ascribed to women, the importance 
of the law of rhythm (Lowenthal, 1949). The 
novels of D. H. Lawrence reveal three stages 
of sexual development — Oedipal relations to 
his mother in the early novels, then homosexual 
relations, and finally the achievement of hetero- 
sexual relations (McCurdy, 1939). Dostoevsky's 
treatment of women in his novels reveals his 
oral character structure and his aggressiveness 
toward women (Kanzer, 1948). Charles Kings- 
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— Pro-other by other 
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ley's Water Babies constitutes a renunciation of 
the physiological process of procreation and 
indicates his need for purification (Deutsch, 
1947). 

Occasionally such analysis has been applied 
to single poems (as in the Wormhoudt, 1949, 
investigation of the unconscious sources used 
by the romantic poets in England) and even 
to the explanation of the motives of artistic 
characters (e.g., Jones, 1947, on Hamlet; and 
Kris, 1948, on Prince Hal). Of such investiga- 
tions, students of literature point out the in- 
validity of treating the work of art in any but 
its own terms — i.e., in terms of its own require- 
ments and forms, rather than the author's 
biography — as well as the tendency to over- 
took the effect of the social milieu and literary 
traditions of the times. Content analysts point 
out the unsystematic and highly selective nature 
of the evidence usually presented to support the 
central hypotheses, and the absence of a sen- 
sible comparative control. Occasionally, how- 
ever, such studies do develop stimulating 
insights which may enrich one's appreciation 
of the total art product of an author. 

Some analytic applications have been made 
directly to psychological materials, For ex- 
ample, one study in "personal structure analy- 
sis" classified a series of letters written by a 
single person into several topical (e.g. self, 
money, death, religion) and attitudinal (e.g., 
moral, favorable, lazy) categories, and investi- 
gated their frequencies and particularly their 
contiguities (Baldwin, 1942). The resulting 
data indicated certain correspondences which 
allegedly represented the principal ideational 
clusters in the subject's life. In validating this 
quantitative analysis, by comparing its results 
with the impressionistic interpretations of 
trained psychologists, the author concludes: "If 
used in conjunction with other information 
about the case, the method confirms judgments 
already formed, adds some new insights, and 
presents problems whose solution leads to fur- 
ther insight. Even if used by itself, the table 
of correspondences is able to communicate 
considerable information to an observer" (p. 
181). Thus this method of analysis not only 
"offers a technique for studying the unique 
pattern of the individual personality, an aspect 
heretofore not subject to objective analysis, 
(but) it points the way to a quantitative, sm 
tematic, objective method of clinical interpre- 
tation" (p. 183). The same basic method, 
though with certain statistical refinements, was 
applied to the analysis of the personal diary 
of a girl adolescent. (Andrews and Muhlhan, 
1943). From a frequency-contingency table 
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including some twenty categories, the authors 
matched those frequency patterns with high 
degrees of correspondence. These groups were 
then taken to represent “congruent idea pat- 
terns” — a goal-identification factor, an in- 
terest-world factor, a socio-sexual immaturity 
factor, and so on. “We do not believe the 
technique to be of such power that other as- 
pects of analyzed material should be ignored, 
but we do believe that the technique offers a 
device through which new and important in- 
sights into the hidden meaning of personality 
structures can be obtained” (p. 109). 

This procedure is not limited to the analysis 
of personal documents in this exclusive sense; 
it has been applied, for example, to autobio- 
graphic materials of some literary standing. 
Thus Richard Wright's book Black Boy, an 
eloquent statement of the growth of a sensitive 
Negro boy in the American South, has been 
subjected to a detailed value analysis which 
attempts to systematize and categorize the pic- 
ture of the world presented (White, 1947) . 
Presumably under the tacit pressure of justify- 
ing the application of such methods to this 
material, the author actually organizes his re- 
port in terms of the findings which support 
the impressions of reading and those which 
modify them. Among the first, for instance, are 
the high frustration-satisfaction ratio and 
Wright's hostility to his father and ambivalence 
toward his mother. Among the latter are the 
primary emphasis upon physical safety, the lack 
of emphasis upon social goals, and the large 
amount of 100% disapproval of the people 
around him. In most studies of this kind, deal- 
ing with a relatively limited body of material, 
analyses of such a precise, quantitative, cate- 
gorical nature probably add little to attentive 
reading and nothing to careful study. In most 
cases, the formal content analysis is not worth it. 

Finally, the method has been useful in sub- 
stituting systematic and objective data for “im- 
pressions” and “interpretations.” One example 
is the application to the analysis of group inter- 
actiors, mentioned above (Bales, 1950). An- 
other is the application to the case records of 
social work clients. 

The change in tension level throughout the 
period of a client's treatment was charted in 
one case on the basis of the “DRQ,” that is, the 
discomfort-relief quotient (Dollard and Mow- 
rer, 1947). This quotient is simply the figure 
resulting from the division of the number of 
discomfort words by the number of discomfort 
plus relief words combined, computed at dif- 
ferent stages of the case. With a technique of 
this general kind, it may be possible not only 


to identify the specific points of high or low 
tension on the part of the client but also to 
measure the general trend in his tension levels 
from the beginning to the end of the case. In a 
similar case, social work records were analyzed 
primarily in order to compare the problems 
presented at different offices of the same agency 
(Berelson, 1947). The case records were classi- 
fied in some 80 distinct sets of categories 
referring to various aspects of the “causes,” 
conditions, and consequences of the case. From 
this analysis, for example, it appeared that the 
more intensely the client feels about his prob- 
lem, the more likely he is to despair of a solu- 
tion. In the same way, the analysis established 
a positive correlation between the client's atti- 
tude toward agency service and his degree of 
confidence in the solution of his problem. 

From experience with this use of content 
analysis, it appears that the procedure is more 
difficult and time-consuming when “deeper” or 
more complicated symbol materials and mean- 
ings are under analysis, and thus in many cases 
is not worth doing. That the social work case 
record has been usefully treated in this manner, 
most students would agree (and so would many 
social workers). But there would be less agree- 
ment about the value of studies of subtle and 
complex psychological problems through quan- 
titative symbol analysis. 

To detect the existence of propaganda (pri- 
marily for legal purposes). The analysis of 
propaganda techniques, wherever they occur, is 
one thing: the analysis of the existence of 
propaganda is another. This application of 
content analysis, particularly associated with 
Harold Lasswell, was used during World War 
II by the Department of Justice in its investiga- 
tions of allegedly subversive organizations and 
their publications. 

At bottom, the procedure here is quite simple. 
Comparison of the content under study with 
that of other content explicitly identified with 
a certain propaganda position, or in some cases 
with explicitly formulated standards, serves to 
reveal the existence of propaganda. During the 
war, for example, parallels between allegedly 
native-fascist propaganda and the output of the 
Nezis were admitted in evidence by the courts 
in connection with sedition trials, and they con- 
tributed to convictions, At the same time, 
"many 'suspect' papers and radio stations were 
absolved from suspicion by the results obtained 
when issues and broadcasts were quantitatively 
analyzed. They were protected from the all too 
frequent practice of condemning an entire pub- 
lication on the basis of a few colorful examples 
of bias in news handling or editorial comment" 
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(Lasswell, 1946, p. 76). In the same vein, the 
admission of content analysis data in courts of 
law is said to stem from “the possibility of error 
arising out of . . . assumptions and subjective 


reactions and predispositions . . .” (Content 
Analysis, 1948, p. 911) . 

This application of content analysis is based 
upon several “tests” described in Lasswell, 1949 
(pp. 177-178) : 


Test 


Avowal 


Parallel 


Consistency 


Presentation 


Source 


Concealed source 


Explanation 


"Explicit identification with one side 
of a controversy" 


"The content of a given channel is 
compared with the content of a known 
propaganda channel" (by themes) 


"The consistency of a stream of com- 
munication with the declared propa- 
ganda aims of a party to the contro- 
versy” 


“The balance of favorable and unfa- 
vorable treatment given to each symbol 
(and statement) in controversy” 


“Relatively heavy reliance upon one 
party to a controversy for material” 


“The use of one party to a contro- 
versy as a source, without disclosure” 


Distinctiveness “The use of vocabulary peculiar to 
one side of a controversy” 
Distortion “Persistent modification of statements 


on a common topic in a direction favor- 
able to one side of a controversy” 


Illustration 


Of 243 self-advertisements of the Mos- 
cow News, about half referred to the 
authoritativeness of the publication 


In Pelley’s publication The Galilean, 
there were 1195 statements consistent 
with Nazi themes and only 45 against 
them 


The percentages of editorials and ar- 
ticles in the Auhagen publications 
Today’s Challenge and Forum Observer 
consistent with the Nazi aim "to exposé 
the weakness and decadence of the ene- 
mies of Germany" were 71% and 57%, 
respectively (as compared with 6% for 
the Readers Digest and 1095 for the 
Saturday Evening Post) 


In the Moscow News there were 1908 
items favorable to the U.S.S.R. and 3 
unfavorable, 69 favorable to other coun- 
tries and 328 unfavorable 


About 2095 of the material in the 
Moscow News came from acknowledged 
governmental and party sources 


(No general evidence) 


Among the most frequently used po- 
litical terms in the Moscow News were 
such distinctive symbols as commissariat, 
Soviets, masses, Stakhanovism, etc. 


Of the items in Transocean but not 
in the New York Times, 58% were con- 


Such tests, then, state explicitly the characte;s- 
tics which are used to detect the presence of 
propaganda. 

This application of content analysis is not 
without its critics and questioners. On the legal 
side, the problems of clarity, reliability, subjec- 
tivity, and significance are raised (Content 
Analysis, 1948). The logical difficulties in- 
herent in distinguishing correspondence (in 
content) from collaboration (in intent) are 


sistent with Nazi views and only $96 
were contradictory. Of the Transocean 
omissions, only 16% were consistept and 
48% were contradictory 


formidable, and there may be serious misin- 
terpretations in getting from one to the other 
(on this problem, see Berelson and de Grazia, 
1947) . 

To secure political and military intelligence. 
Stimulated particularly by the requirements of 
World War II, there has been a good deal of 
effort devoted to securing intelligence data of 
various kinds from the study of communication 
content. How is the enemy population reacting 
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to our propaganda? What is the state of their 
morale? What are the enemy's expectations as 
to our next military moves? What are his 
evaluations of our military situation? What 
will he do next? Answers to such questions, 
often on quite crucial political and military 
matters, have been sought by means of content 
analysis. 

Strong claims have been made for this func- 
tion of content analysis: “By analyzing the 
content of what appears in the channels of com- 
munication in totalitarian powers, we may im- 
prove our predictions about the next move to 
be made by the controllers of policy in those 
states. Already some spectacular successes have 
been claimed by students of communication. 
Some of the analysts connected with the British 
Broadcasting Corporation have reported that 
they were successful in making certain predic- 
tions about the military plans of Germany” 
(Lasswell, 1942a, pp. 110-111). However, until 
very recently — until the forthcoming impres- 
sive review of this function of content analysis 
by Alexander George, which deals with the war- 
time experience of the Analysis Section of the 
Foreign Broadcast Intelligence Service of the 
Federal Communications Commission — there 
has been little scientific scrutiny of this whole 
field. 

The record of governmental work during 
World War II, as analyzed by George, contains 
a number of successful predictions of major 
military actions. Perhaps the most dramatic 
of all was the case of the British predictions 
about the German V-weapon. At a time (fall, 
1943) when there was some doubt whether the 
Germans had a secret weapon and, if so, what 
it was, the British analysts made a series of 
remarkably accurate inferences not only specify- 
ing the possession of the weapon but also its 
general type and even a succession of dates 
crucial in its development and use. The bases 
for their analysis were the generalizations pro- 
vided by prior studies that “. . . German (home) 
propaganda never deliberately misled the Ger- 
man apts in questions involving an increase 
of German power” and the reasoned assump- 
tion that “. . , Goebbels would be careful not 
to give the German public a promise of re- 
taliation too far ahead of the date on which the 
promise could be fulfilled” (lest it boomerang 
into popular disillusionment). On the other 
occasions, content analysts correctly inferred 
that the Germans did not plan to move intó 
Spain and Spanish Morocco in March, 1943; 
that they did not plan a major offensive on 
the Russian front in 1943; that they planned 
to withdraw from the Kuban bridgehead on the 


Eastern front; that they did not expect a high 
level of success in submarine warfare in 1948. 


AUDIENCE OF CONTENT 


To reflect attitudes, interests, and values 
(“cultural patterns") of population groups. 
Content analysis has been used to get at “the 
spirit of the age." On the assumption that com- 
munication content "expresses" or “reflects” 
some sort of Zeitgeist, some students have used 
it as an index to attitudes, interests, values, 
mores of the population. Thus the problem of 
discovering the topics people were interested in 
during a certain period or what they thought 
about various matters is studied by determining 
the nature of the content of popular communi- 
cations at the time, and changes in such cultural 
patterns are studied in the same way. 

"There are two characteristics of this use of 
content analysis which need to be clearly recog- 
nized. The first is that inferences about popu- 
lation groups are made on the basis of content 
produced for them and not by them. Thus 
cultural patterns are derived from analysis of 
communications widely distributed in the com- 
munity. The second point is that such problems 
are investigated in this way — i.e., indirectly — 
precisely because it is not possible to attack 
them directly. If we wish to study attitudes and 
interests and values today, we have certain sci- 
entific tools reasonably adequate for the job — 
or at any rate, more adequate than this second- 
level approach. But we cannot use them to 
study a historical people or a foreign people 
from whom we are barred; for such groups the 
indirect approach is necessary. However, the 
fact that it is the only way to get at the problem 
should not obscure the other fact that it is at 
best a substitute for the real thing. Both these 
points, incidentally, raise problems of validity 
to which we shall return later. The reader will 
find in Chapter 26 a discussion of the difficulties 
encountered in the study of national character, 
many of which are related to the present topic. 

"There are any number of examples of such 
studies in the literature. The "changing social 
attitudes and interests" in the United States 
from 1900-1930 were identified through selected 
aralyses of popular magazines (Hart, 1933). 
For example, the general decline in the status 
of religion and religious sanctions during this 
period is documented not only by the amount 
of attention given to the topic but also by the 
decline of favorable references to organized re- 
ligion. Such trends in the treatment of religion 
in popular magazines are taken to reflect a gen- 
eral weakening in the position of religion among 
the population at large. 
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A shift in popular concern, during this cen- 
tury, from the “idols of production” to the 
“idols of consumption” is allegedly indicated 
by the distribution of the subjects of biographi- 
cal sketches in certain popular magazines (Low- 
enthal, 1943). Not only was there an increase 
in entertainers at the expense of politicians, but 
the entertainers themselves were drawn less and 
less from the serious arts. Substantially the 
same procedure has been used to infer the state 
of public morale from the communications de- 
signed by propagandists who have both totali- 
tarian control of the media and, presumably, 
accurate and direct knowledge about morale 
itself. “Enemy communications are continually 
used for the purpose of estimating the state of 
morale among enemy peoples. This is the study 
of content as a basis of inference about the 
changing state of attitude toward war, and espe- 
cially the intensity of the determination to 
carry on to victory" (Lasswell, 1946, p. 80). 
However, despite the great amount of work 
done on wartime propaganda, no quantitative 
index based upon formal categories has been 
devised which will reliably reflect popular 
morale when applied to propaganda materials. 
Totalitarian propaganda has also been used as 
a basis for the construction of a certain per- 
sonality syndrome (the ‘‘compulsive character” 
of psychoanalytic theory) as a “distinctive type 
of character structure in the Nazi variant of 
German culture” (Kecskemeti and Leites, 1947, 
1948a, b). Such psychological “traits” as revolt 
and submission, order and disorder, guilt and 
self-righteousness, doubt and dogmatism were 
documented from the propaganda material on 
the assumption that it could validly be used as 
an index of character structure. 

Such studies, then, employ communication 
content as an index to various cultural patterns 
which they find “reflected” in the content. But 
how valid is this procedure? Can communica- 
tion content properly be taken as a popular 
mirror of the time? Certain cautions on this 
application of content analysis must be men- 
tioned. 

In the first place, it is important to be com- 
pletely clear about just what is being “reflected” 
in the communication content. Is it the gen- 
eralized cultural and psychological level of the 
total adult population which somehow finds 
expression through the media? Or do the in- 
ferences from content refer only to the audience 
to the particular communications under analy- 
sis? If the latter, then the problem of changes 
in the composition of the audience in trend 
studies of content must be faced: that is, the 
changes in content may reflect changes in audi- 


ence membership rather than changes in psy- 
chological properties of constant population 
groups. Along the same line, do the results actu- 
ally refer to the psychological categories pro- 
posed or to some other less subtle aspects of 
reality? For example, does the change in the 
biographical subjects of popular magazines 
reflect a change in the public's attitude toward 
popular heroes any more than it does a change 
in the occupational distribution during a period 
in which commercialized sports and entertain- 
ment became big business in this country (in 
which case content data are much less satisfac- 
tory than census data on occupations)? Such 
questions must be answered in the particular 
study for clarity not only of interpretation but 
also of the analysis itself. 

The basic rationale for the use of content 
data to permit noncontent inferences of this 
kind has been mentioned in the literature. 
Here, for example, are two such arguments: 


The common day-dreams of a culture are in 
part the sources, in part the products of its popu- 
lar myths, stories, plays, and films. Where these 
productions gain the sympathetic response of a 
wide audience, it is likely that their producers 
have tapped within themselves the reservoir of 
common day-dreams. Corresponding day-dreams, 
imperfectly formed and only partially conscious, 
are evoked in the audience and given more defi- 
nite shape. (Wolfenstein and Leites, 1950, p. 13) 


Our first assumption in this study is that popu- 
lar drama can be regarded as a case of "social 
fantasy" — that the psychological constellations in 
the dramatic work indicate sensitive areas in the 
personalities of those for whom the work has 
appeal; their needs, assumptions, and values are 
expressed ("projected") in the drama. The suc- 
cessful play must be attuned to the audience. 
There can be no claims that dramatic material 
reflects the total personality of the individual who 
MEN it. (McGranahan and Wayne, 1948, p. 

d 


Thus this use of content analysis assumes that 
the nature of the audience about which the 
inferences are made is at least a, and perhaps 
the, major determinant of the content which is 
being analyzed. 

But this is clearly not always, or perhaps not 
usually, the case. Content is produced by par- 
ticular agents (writers, producers, directors, 
editors, etc.) and it is often their conception 
of audience attitudes and values, biased as it 
may be, which determines what appears. Not 
only that; communication content is also sub- 
ject to the pressure of special interest groups 
which may have enough weight to determine, or 
at least influence, the nature of the content. 
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For example, a behind-the-scenes campaign by 
the Catholic Church to “purify” popular publi- 
cations on sex issues could have effected a major 
change in the appearance of such material with 
no corresponding and certainly no prior 
changes in popular attitudes. This kind of limi- 
tation is recognized by some writers in this 
area (e.g, Hart, 1933, p. 386) and directly 
argued by others: 


In part the recurrent patterns . . . may be con- 
sciously contrived . . . But both the positive 
sanctions of the box office and the negative ones 
of the code are probably indicative of more or 
less widely diffused feelings and attitudes in 
American culture. In part also, the selection of 
certain themes and omission of others would seem 
to be less calculated, expressive of deeper and less 
conscious emotional tendencies. Within the limit 
of a code or a general plot formula there remains 
the choice of a considerable range of possibilities. 
(Wolfenstein and Leites, 1950, p. 14) 


For several reasons, then, the mechanical ap- 
plication of content analysis to problems of 
this kind should be avoided. It is difficult to 
know under what conditions inferences can be 
validly drawn about the total population or 
only about the particular audiences; to what 
extent they refer to the audience proper or to 
the producer's conception of the audience or 
to the producers themselves as (atypical) mem- 
bers of the audience; whether they legitimately 
apply to psychological variables in this sense; 
whether they are correctly based upon a concep- 
tion of audience characteristics as a source of 
the content or as an effect of it or both; whether 
popular values are somehow identifiable, how- 
ever roughly, as a direct quantitative function 
of content emphases. In short, the whole rela- 
tionship between the content and audience 
characteristics allegedly "reflected" in it is far 
from clear. At the least it is far from a one-to- 
one relationship, and this fact in itself is a 
cautionary note against the over-interpretation 
of content data. 


H 
EFFECTS OF CONTENT 


To reveal the focus of attention. Among the 
clearest effects of the mass media in modern 
society has been their extension of people's 
attention areas, Whereas in simpler societies 
people were limited to matters which came to 
their or their neighbors' direct attention, the 
proliferation of technical means of communic#- 
tion has resulted in bringing the whole world 
into the ordinary man's focus of attention. In 
a sense, every content study is an analysis of 
what comes to people's attention. Some studies, 
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however, are more or less limited to this use of 
content analysis. They have dealt with dif- 
ferent media, with widely differing problems, 
and with different analytic methods. 

One simple form of such a study, particularly 
associated with Lasswell, is illustrated in his 
World Attention Survey, which deals with the 
occurrence of selected political symbols in vari- 
ous important newspapers. Analysis of the 
relative frequency of such symbols reveals, it is 
claimed, that "the stream of public attention 
is related to policy . .. In the Voelkischer 
Beobachter . . . we see less and less attention 
paid to the Soviet Union, previously a target of 
bitter hostility. This indicated that Germany 
was clearing the path for a sudden change in 
diplomatic orientation, as was learned when 
the (Soviet-German) pact was announced" 
(Lasswell, 1941, p. 457) . 

Another form of content analysis has to do 
with the subject matter distribution of what 
appeared in the selected media of communica- 
tion. For example, the average percentage of 
foreign news in American morning newspapers 
in 1927 was shown to range from a low of about 
2% to a high of about 9% (Woodward, 1930) . 
Even the personal conversations of a group of 
people were distributed in subject-matter cate- 
gories in order to identify the major differences 
in what men and women talk about: the former 
concentrated upon business affairs and the lat- 
ter upon personal affairs (gossip) and clothes 
(Landis and Burt, 1924) . 

Such analysis is not limited simply to the 
relative distribution of different symbols or sub- 
ject matters. Some studies have added direction 
Oe, approval or disapproval) of the material 
as well. For example, the relative approval or 
disapproval of various educational topics in 
newspaper editorials indicates the areas in which 
public education is given most public praise 
and public criticism, Thus although the pub- 
lic's responsibility for supporting education is 
almost everywhere accepted, there was wide- 
spread disapproval of costs involved (Foster, 
1938) . 


PERCENTAGE OF 


t EDITORIAL COMMENT 
Unfavor- 
Favorable Neutral able 
Public responsibility for 
supporting education 90 2 
Cost of education 8 15 77 


Another relevant category in the study of 
attention areas is the source of communication 


e LI 


506 


material. For example, a study of the origin of 
war news in the New York Times during an 
early period of the first World War revealed 
that the Central Powers supplied less than five 
percent of the front page stories as against fully 
seventy percent supplied by the Entente Powers 
(Foster, 1937). Thus there was nothing like a 
balance in the war news provided American 
readers in terms of the sources of the material 
coming to their attention. 

Content analysis has also been applied to the 
determination of attention areas in fictional ma- 
terials in the mass media. For example, the 
major characters in popular motion pictures 
have been described by these procedures, About 
46% of the major characters in feature films are 
either wealthy or well-to-do, and only about 
17% of them are poor or destitute (Jones, 
1942). Similarly, about 68% of the characters 
wanted love as a major value as against only 
about 9% who wanted “rightness” or some 
other social value. And almost two-thirds of 
the characters have all of their desires fulfilled 
at the end of the picture. A more intensive 
and a more imaginative analysis of motion pic- 
tures included such categories as the general 
themes of the films, personal characteristics of 
the cast, kinds of clothes, circumstances of 
meeting and lovemaking, crime, vulgarity, rec- 
reations, and goals (Dale, 1935). In the same 
way, the world as it appears in the daytime 
serial has been described in great detail. Here 
again, for example, the major characters enjoy 
a high occupational status (much higher than 
that of the typical listener) and the major 
themes deal overwhelmingly with problems of 
personal relations rather than public affairs 
(Arnheim, 1943) . 

In any general analysis of the role of commu- 
nications in forming people's opinions and atti- 
tudes, it is important to stress the factor of at- 
tention. The self-selection of what will be read 
or listened to means that attention is not auto- 
matic and hence not always in correspondence 
with communication content. In this kind of 
investigation, content analysis plays its part in 
presenting an objective and systematic picture 
of the communication content which is avail- 
able for public attention, whatever the actval 
exposure. 

To describe attitudinal and behavioral re- 
sponses to communications. The culminating 
factor in the study of communications is their 
effect upon individuals, groups, institutions, 
society. Effects can be short-run or long-run; 
they can be immediate or remote; they can 
reinforce or convert; they can deal with in- 
formation, knowledge, opinions, taste, skill, 
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"behavior." Under certain conditions, content 
analysis can contribute to the investigation of 
effects. 

There are three ways in which content analy- 
sis has been employed in the study of effects. 
The first involves the analysis of communica- 
tions which are themselves expressions of the 
effect of previous communications. In other 
words, the analysis deals directly with the ma- 
terial as response, instead of, as in most cases, 
with communications as stimuli. 

Although there are relatively few examples 
of this type of effect analysis, it is worth noting 
as a possibility for further exploration. Two 
cases in which it was applied dealt with the re- 
ception of scientific work by the intellectual 
community. The acceptance of Freud's dream 
theory, as an instance of his total contribution, 
was a good deal less than general in the 350-or- 
so psychology textbooks analyzed (Herma, 
Kris, and Shor, 1943). And the writers who 
accepted the theory identified it as Freud's less 
than those who rejected it. In the other case, 
the reception of The American Soldier in 
scholarly and popular journals was analyzed 
into a typology of "affective responses" descrip- 
tive of the scholarly images and allegiances 
elicited by that work (Lerner, 1950). The rea- 
son there are not more such direct content 
analyses of effect is obvious: except in such lim- 
ited forms as textbooks and book reviews, the 
response material is simply not available. 

The second type has also had limited appli- 
cation as yet, largely because of technical diffi- 
culties involved in its use. It is the direct 
correlation of content data and response data, 
i.e., studies in which the frequencies with which 
certain categories appear in the content are 
related statistically to the frequencies of re- 
sponse in the audience. Such a relationship was 
attempted in an investigation of the effect of 
the content emphasis of selected political argu- 
ments upon the recognition and the acceptance 
of the arguments (Berelson, 1942). Although 
frequency in the content did determine to a 
large extent the recognition of the arguments, it 
was by no means the major influence upon 
agreement. The correlation between content 
emphasis on the arguments and agreement with 
them was small because of the role of predispo- 
sitions in determining agreement, However, it 
was demonstrated that content emphasis de- 
creases indecision and that it tends to regulate 
relative agreement among arguments “within” 
the influence of predispositions. 

Thirdly, there is the direct inference of effects 
from content data, without reference to data 
on response itself. This is probably the most 
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frequent application of content analysis to ef- 
fect problems. In fact, there are a large number 
of content analyses which simply rephrase their 
results in effect terms — mistakenly, in view of 
the role of predispositions and other factors in- 
volved in the situation. More sophisticated 
studies of content interpret responses against 
the psychological characteristics of the audi- 
ence. For example, the effectiveness of Com- 
munist propaganda in Chicago during the 
depression of the early 1930's was evaluated on 
the basis of a systematic analysis of its themes 
and appeals (Lasswell and Blumenstock, 1939) . 
The authors point out the relatively high pro- 
portion of key symbols which were distinctive 
to Communist propaganda; “thus the nucleus 
of Communist propaganda was composed of 
symbols which were counter-mores in America” 
fp. 352) . As time went on there was a declining 
emphasis upon certain distinctive symbols and 
thus a somewhat greater degree of adaptiveness 
in Communist propaganda which presumably 
increased its effectiveness. Again the content of 
Communist propaganda was not particularly 
skillful in combating nationalistic and indi- 
vidualist symbols. “Despite some favorable 
circumstances, Communist propaganda was 
blocked by American nationalism and individu- 
alism and thus contributed more to the cathar- 
sis of insecurities than to the readjustment of 
American symbols and practices. It made only 
a modest contribution even to the appearance 
in America of the revolutionary pattern of the 
epoch" (p. 357). 

Here the content analysis of Communist 
propaganda provided the basis for a psychologi- 
cal interpretation of its effect upon the re- 
cipients. 'This same method was used in a 
"sociological" analysis of the effectiveness ot 
the round-the-clock bond drives by Kate Smith 
during World War II (Merton, 1946) which 
puts the content data in a framework of re- 


sponse data. Fully half of the Smith propa- 
ganda to promote sales was composed of the 
sacrifice theme (sacrifice on the part of the 
serviceman, the civilian, herself) ; and the in- 
terpretation of Smith's effectiveness derives 
partly from the prominence of that theme. “The 
triangulation of sacrifice . .. developed in many 
listeners a strong sense of unworthiness and 
guilt. They felt compelled to do something 
more to keep self-esteem . . . After the pur- 
chase, there was strong evidence of catharsis. 
Duty had been done and guilt swept away” (pp. 
54-55). 

In a psychological context, finally, the effect 
upon third-grade school children is implicit in 
an analysis of the values presented in their 
readers (Child, Potter, and Levine, 1946). The 
types of behavior rewarded in the books is 
assumed to have a guiding effect upon the be- 
havior of the child, who may model himself, 
consciously or unconsciously, upon this ma- 
terial. 


Percentage in 


Mode of acquisition which rewarded 


Effort 93 
Buying, selling, trading 80 
Asking, wishing, taking what is 

offered 68 
Dominance, aggression, stealing, 

trickery 4l 


Just as the effect area in communication re- 
search is of major importance (see Chapter 28), 
so this application takes on central importance 
in the development of content analysis. More 
refined procedures and careful establishment of 
relationship between content and the specific 
type of effect under investigation are necessary 
in order to achieve desirable advances in the 
scientific study of the effect of communications. 


Tue Units or CONTENT. ANALYSIS 


By definition, content analysis calls for the 
quantification of content elements. As a basis 
for counting relevant aspects of the content, 
certain standard subdivisions of it must be used. 

Before specific units are discussed, three tech- 
nical distinctions must be mentioned. First is 
the distinction between the recording unit and 
the context unit. The recording unit is "the 
smallest body of content in which the appezt- 
ance of a reference is counied" (a reference is 
a single occurrence of a content element). The 
context unit is “the largest body of content that 
mar be examined in characterizing a recording 


unit." For example, the recording unit may be 
a single word or term, but in order to note 
whether the particular term was treated favor- 
ably or unfavorably the analyst will be in- 
structed to consider the entire sentence in which 
the term appears (the context unit). Or a sen- 
tence may be the recording unit and the para- 
graph the context unit; thus the paragraph 
would be taken into account in coding the sen- 
tence (Lasswell, 1942, p. 16). Second is the 
distinction between the unit of classification 
and the unit of enumeration. This involves 
simply the distinction bétween the basis on 
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which the content is analyzed or classified and 
the basis on which it is tabulated or reported. 
Thus an analyst may classify an entire news 
story into a given category, and then report the 
result in terms of column inches. Here the 
news item is the unit of classification and the 
column inch is the unit of enumeration (Wood- 
ward, 1934). Third is the related distinction 
between levels of analysis, in which a prior 
analysis of a subsidiary unit characterizes the 
larger (parent) unit for later analysis. For ex- 
ample, classification of sentences may serve to 
characterize political articles as “biased,” de- 
fined as meaning greater than 75-25 distribution 
of support to one side. Once the sentence 
analysis was completed, the articles themselves 
could be quantified for comparisons by source, 
channel, etc. Here the preliminary analysis by 
sentence provides the basis for the subsequent 
classification by article. 

In the literature of content analysis five ma- 
jor units of analysis have appeared (for verbal 
symbols) : words, themes, characters, items, and 
space-and-time measures. 


WORD 


The smallest unit generally applied in con- 
tent analysis is the word. This unit (which 
includes word compounds like phrases as well 
as single words) is identical with the “symbol” 
or “unit symbol” of Lasswell (1942) and the 
"term" of Leites and Pool (1942). Its applica- 
tion results in a list of relative frequencies of 
selected words or selected categories of words. 

This unit has been employed most extensively 
in three sets of studies. The first of these is the 
political symbol analysis associated with Lass- 
well. In his World Attention Survey, for ex- 
ample, he studied the relative occurrence in 
the world press of a list of “key symbols” in 
modern politics: freedom, liberty, constitu- 
tional government, fascism, national socialism, 
communism, and so on (Lasswell, 1941, 1942) . 
The second application of the word (or word- 
compound) as a unit is made in some analyses 
of literary style. Thus, a series of studies report 
the major terms employed by a wide range of 
English poets (Miles, 1951). The third int 
portant use of word units in content analysis 
occurs in the studies of readability, to deter- 
mine the ease or difficulty with which communi- 
cation material can be read and comprehended. 
Such studies construct readability formulas on 
the basis of personal references, prepositional 
phrases, different hard words, words with pre- 
fixes and suffixes, and so on (Gray and Leary, 
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1935; Lorge, 1944; Dale and Chall, 1948; Flesch, 
1951). 


THEME 


The next larger unit of content analysis is 
the theme. In its most compact form, the theme 
is a simple sentence, i.e., subject and predicate. 
In other words, a theme is an assertion about 
a subject matter. Thus, it is a sentence (or 
sentence-compound) , usually a summary or ab- 
stracted sentence, under which a wide range of 
specific formulations can be subsumed. Again, 
the theme has been given various names in the 
analytic literature: “assertion” and “statement” 
by Leites and Pool (1942); "proposition" by 
Lasswell (1942); "idea" by Kris and Speier 
(1944) ; "issue" and "argument" by Waples and 
Berelson (1941); and "thema" by Child, Pot, 
ter, and Levine (1946). The specific nature of 
analytic themes will depend, of course, upon 
the particular content and problem at hand. 

Thematic analysis has been done on both 
political and nonpolitical materials. Analyses 
of Axis propaganda in World War II identified 
such themes as “The U.S. is internally corrupt" 
(Lasswell, 1949) or, more specifically, “England 
does not fight herself, but gets her Allies to do 
her fighting” (Berelson and de Grazia, 1947) . 
A set of themes describing “social optimism” 
(i.e., emphasis on social rather than individual 
responsibility and "social pessimism” (i.e. 
affirmation of man’s need for mysticism) was 
analyzed in Protestant sermons (Hamilton, 
1942). The content of Kate Smith's broadcasts 
on a marathon bond drive was also analyzed 
by themes — the themes of sacrifice, participa- 
tion, competition, facilitation, and familial and 
personal appeals (Merton, 1946) . 

The theme is among the most useful units 
of content analysis, particularly for the study 
of the effect of communications upon public 
opinion, because it takes the form in which 
issues and attitudes are usually discussed. But 
it is at the same time among the most difficult 
units of analysis from the standpoint of re- 
liability, especially if it is at all complicated 
(i.e, more than just a simple sentence). One 
elaborate, complex method of attacking this 
problem is to break the theme down into its 
components, to analyze by them, and then to 
reconstruct the theme by a “mechanical” process 
(Waples and Berelson, 1941). Thus the theme, 
“Roosevelt's New Deal, by having the right man 
in the right position, has saved the farmer and 
the working man from ruin," was analyzed into 
its several (relevant) components: 
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Record of the Democratic ad- 
ministration, domestic policy, 


SUBJECT MATTER: 


general 
Objects: labor (workers) , farmers 
Values: economic security, high 
indulgence 
Method: personnel selection 


"These components, rather than the theme itself, 
were coded on punch-cards and the themes were 
later reconstructed by running the appropriate 
combinations on a punch-card machine. It was 
believed that the detailed method of coding 
would be more reliable in this case. Such analy- 
sis has been called "atomic" as against the 
"molecular" analysis of the theme as a whole. 


H 
CHARACTER 


Use of the fictional or the historical character 
as a unit is appropriate in analyses of stories, 
dramas, and biographical sketches. In cases 
where a narrative is woven around particular 
persons, they are the most convenient units on 
which to base an analysis. The entire story or 
sketch provides the basis for specific classifica- 
tions, and the entire item must be read before 
any coding decisions are made. There are ex- 
amples in the analysis of magazine fiction 
(Berelson and Salter, 1946), motion pictures 
(Jones, 1942), radio serials (Arnheim, 1943), 
and biographical articles (Lowenthal, 1943) . 


ITEM 


The unit most frequently used in content 
analysis is the item, i.e., the whole "natural" 
unit employed by the producers of symbol ma- 
terial. The item differs among the different 
media: it may be a book, a magazine article 
or story, a speech, a radio program, a letter, a 
news story, an editorial, or any other self-con- 
tained expression. This unit classifies material 
in the large, so to speak, and for this reason it 
has been frequently employed, especially in the 
case of subject matter analyses for which it is 
Particularly suitable. Analysis by the entire 
item is appropriate whenever the variations 
within the item are small or unimportant. 
Thus, for example, news stories can often be 
reliably classified on a subject matter basis 
(eg. domestic politics, international affairs, 
crime, labor, etc.). But if detailed categories 
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are added which introduce variations within 
items, the item unit may become inappropriate. 


SPACE-AND-TIME MEASURES 


Rather than analyze broadly by items, some 
studies have classified content by such physical 
divisions as the column inch or the page or the 
line or the paragraph (for print) , or the minute 
(for radio), or the foot (for film), thus allow- 
ing for somewhat more precise distinctions 
than are possible with the item unit. Thus, a 
news story containing two subject matters may 
be assigned in toto to the dominant one under 
item analysis, whereas under this system, say, 
eight inches will be assigned to one topic and 
two inches to the other. 


THE INTERRELATION OF UNITS 


There is no reason, of course, why a particu- 
lar study must use only one of the possible 
units of content analysis. The choice of the 
appropriate unit depends upon the problem 
and the content under investigation, and this 
may necessitate the use of different units within 
the same study. For example, in an analysis 
of magazine stories, both the character and the 
item were used to test different hypotheses 
about the content (Berelson and Salter, 1946) . 
Again, in an analysis of letters the theme was 
used in one part of the study and the letter as 
a whole in another (Wyant and Herzog, 1941a, 
b). 

oe unit should be selected for a particular 
study? No general answer can be given. At 
this stage in the development of content analy- 
sis, the best suggestions seem to be these: 


(a) More than one unit of analysis can 
profitably be employed in a single study. 

(b) Word counts and large units (items and 
space-time measures) are most appropriate for 
studies of straight subject-matter emphases. 

(c) Large units ordinarily provide as valid an 
analysis of direction (i.e., approval-disapproval) 
as small units. 

(d) Large units take less coding time than 
small units, for the same categories and ma- 
terials, 

(e) Since themes add another dimension to 
subject matter, they are usually harder to ana- 
lyze than symbols or large units, i.e., they take 
more time for the same reliability. 

(f) The theme is appropriate for studies of 
meanings and of relationships among meanings. 
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the communication is made or to its structural 
components. Because of this specificity of the 
category, it has usually been applied on a 
sentence-by-sentence basis. 

In Lasswellian formulations, the standard 
classification for this category is threefold (Lass- 
well, 1942): fact statements (e.g., “The Com- 
munists want to rule the world.”) ; preference 
statements (eg, “The Communists ought to 
rule the world.”); identification statements 
(e.g. "I am a Communist.”) . 

Other forms of these basic categories can be 
distinguished. For example, fact statements can 
refer to the present or to the future (in which 
case they are expectation statements). Prefer- 
ence statements can be simple preferences or 
they can be formulated as imperatives for ac- 
tion (in which case they are demand state- 
ments). The generality of the categories and 
their lack of specific content have made them 
of relatively little use, except to reveal changes 
over a long period of time (Kris and Leites, 
1947). 

Intensity. "This category, sometimes called 
“emotionalism” or sentimentalization" (Waples 
and Berelson, 1941) , refers to the "strength" or 
"excitement value" (Jacob, 1942) with which 
the communication is made. In many cases the 
analyst is interested in gauging the relative in- 
tensity of different contents, particularly be- 
cause this factor is believed to be especially 
effective in influencing readers and listeners. 


TECHNICAL 


After the units and the categories of the 
analysts have been selected, there remain several 
technical problems to which attention must be 
given. Although they are all general problems 
of social science research, and not distinctive 
to this particular method, their application to 
content analysis presents some special problems. 


WHEN TO COUNT CAREFULLY 


Content analysis can be a time-consuming and 
expensive operation. Someone must go through 
the chore of carefully counting the occurrence 
of categories, on many of which difficult deci- 
sions must be made. This job is not always 
simple or easy, and although the real costs are 
usually obscured by the free labor, voluntary 
or enforced, of graduate students in the univer- 
sities, they are nevertheless high. In research, 
as elsewhere, such costs must be reckoned 
against the returns. 

Review of a number of so-called “qualitative” 
analyses suggests similarities and differences 
between them and so-called quantitative anal- 


HANDBOOK OF SOCIAL PSYCHOLOGY 


Thus the “emotional component” of commu- 
nication content is a relatively popular category 
for analysis. Despite interest in the category, 
attempts to devise generally satisfactory indi- 
cators have not been altogether successful, 
Although such methods have “worked” in spe- 
cific cases (Ge, produced meaningful results) , 
they have not been generalized into indicators 
of intensity applicable to a variety of problems 
in content analysis (Waples and Berelson, 
1941; Jacob, 1942; Wright and Nelson, 1939) . 

Device. This category refers to the classifica- 
tion of a body of content on the basis of its 
rhetorical or “propagandistic” character. The 
category usually involves a whole complex of 
content, and is thus difficult to analyze objec- 
tively. Such propaganda "tricks" as name-call- 
ing, glittering generality, card-stacking, and 
bandwagon (Lee and Lee, 1939) are examples 
of this device category. Analysis of orations 
according to the type of argument used (e.g., 
argument by authority, argument from divi- 
sion, etc, as in Palmer, 1934) or according to 
the figures of speech employed . (e.g. personifi- 
cation, rhetorical questions, metaphor, etc., as 
in Runion, 1936) also illustrate the category. 
In the field of literary analysis, this category is 
illustrated by the investigation of the frequency 
with which the pathetic fallacy was used by 
English poets over the period of a century 
(Miles, 1946) , 


PROBLEMS 


yses (which are spelled out in detail in the 
author's book) : (1) much “qualitative” analysis 
is quasi-quantitative; (2) "qualitative" analysis 
is often based upon the presence or absence of 
particular content rather than relative frequen- 
cies; (3) "qualitative" analysis is done on small 
or incomplete samples; (4) "qualitative" analy- 
sis usually contains a higher ratio of known 
content to content statements than quantitative 
analysis; (5) "qualitative" analysis is relatively 
less concerned with content as such than with 
the content as a “reflection” of “deeper” phe- 
nomena; (6) “qualitative” analysis employs 
less formalized categorization than quantitative 
analysis; (7) “qualitative” analysis utilizes more 
complex themes than quantitative analysis. 
One may ask what are the conditions under 
which one should do a careful counting job. 
When can quantification add toa “qualitative” 
analysis? The following seem to be the condi- 
tions under which quantification is warranted. 
1. Count carefully when a high degree of 
precision and accuracy is required in the results. 
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Here, as elsewhere, precise measurement is jus- 
tified only when considerable precision and 
accuracy are needed in the results. In content 
analysis they may be needed to choose between 
alternatives, to “nail down” important conclu- 
sions, to settle impressionistic disagreements, to 
distinguish small differences, etc. Ordinarily 
this means that a comparison or other relation- 
ship is involved, implicitly or explicitly. But 
only as much precision should be bought as is 
necessary to the purposes of the study. It is the 
pretentious counting of the obvious or the un- 
necessary which gives content analysis a bad 
name. 

2. Count carefully when a high degree of ob- 
jectivity is required in the results. This is re- 
lated to but not identical with the preceding 
maxim. Objectivity (ie., the process of secur- 
ing data which are independent of the character 
of the observer) is ordinarily prerequisite to 
the attainment of precision or accuracy. If the 
counting process is carried out with some care, 
the descriptive results can be accepted with a 
higher degree of confidence than those produced 
by impressionistic, subjective, nonsystematic 
procedures. It is true, of course, that under 
some conditions results may be spuriously ob- 
jective. If a set of categories is so complex that 
it cannot be reliably analyzed, then the numeri- 
cal values which purport to represent their in- 
cidence in the content are no better than 
impressionistic data — and perhaps they are 
somewhat worse, since they carry an aura of ob- 
jectivity without the substance. 

3. Count carefully when the materials to be 
analyzed are representative enough to justify 
the effort. If the materials under analysis are 
fortuitous, irregular, uneven, or otherwise un- 
representative of whatever universe of content 
they are meant to represent, then the effort of 
counting does not seem to be worth while. Any 
numerical results must be so partial and diluted 
that the additional margin of error introduced 
by impressionistic measurement is not critical. 
By and large, counting is not worth the costs in 
such* circumstances, nor is it justified by the 
results. 

4. Count carefully when the materials to be 
analyzed are so many as to be unmanageable 
without such a summarizing procedure. If the 
content is of such magnitude that it cannot be 
adequately handled without counting, then, of 
course, counting is necessary. For example, 
some trend studies of newspaper or radio cot 
tent would be completely unmanageable with- 
out some form of counting, however crude. 
One mind simply could not review page after 
page of newspaper content and produce trend 
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data of any consequence without the counting 
process. 

5. Count carefully when a high degree of 
Specifications of categories is both possible and 
necessary or desirable. The very business of 
counting demands that categories be precisely 
defined so that their incidence in the content 
can be correctly ascertained. If the study of the 
German film had been quantified (Kracauer, 
1947) , it would have required clear definitions 
of such content categories as “championing or 
even visualizing the cause of liberty,” “testify- 
ing to the existence of a state of paralysis,” “re- 
vealing the inner workings of the paralyzed 
collective soul,” etc. If it is impossible or ex- 
tremely difficult to specify categories sufficiently 
to allow careful counting of a reliable nature, 
then obviously, the results do not carry any 
guarantee of their validity. 

6. Count carefully when the categories ap- 
pear with relatively large frequencies. No mat- 
ter how large the body of content, systematic 
counting may not be justified if the relevant 
categories occur only infrequently. In that 
case subjective procedures may be completely 
serviceable for the purpose at hand. For ex- 
ample, if one were concerned, as some Congress- 
men might be, with the degree to which 
American movies present left-wing political 
ideas, it would not be necessary to undertake a 
full-scale quantification, despite the large body 
of materials under analysis. This maxim is 
particularly applicable to those analyses which 
depend upon the presence or absence of the 
categories for their conclusions or interpreta- 
tions. 

7. Count carefully when the content data are 
to be statistically related to numerical non- 
content data. This is by no means always true, 
but it is often the case that counting is called 
for in order to provide numerical content data 
for correlation with other numerical data. This 
involves the use of content analysis for the con- 
struction of general propositions about com- 
munication and not simply for descriptive or 
historical purposes. For example, content 
trends on public issues might be correlated with 
attitude trends, or the intensity of the content 
with behavioral data, or the readability score 
with measurement of actual readership, or the 
“movement” in the content of social work cases 
with other indices of therapy. In none of these 
cases are numerical values for content necessary 
for establishing a relationship, but they may be 
more readily applied in such studies. 

As in other scientific investigations, content 
analysis should be done not as precisely as pos- 
sible but rather as imprecisely as possible — 
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that is, as roughly as the circumstances of the 
study will allow (in order to minimize costs 
relative to returns). This implies that under 
normal conditions careful counting should not 
be done unless it is quite necessary. It is prob- 
ably not too strict to insist that all the above 
conditions be met, or rather all of the first six, 
since the seventh is a special case not always 
applicable. If the study does not deal with a 
large and representative body of material to be 
analyzed in terms of a set of highly specifiable 
categories which appear with substantial fre- 
quencies, in order to produce objective and 
precise results — if these conditions are not 
met, careful counting is probably not war- 
ranted, 


THE PROBLEM OF RELIABILITY 


By definition, content analysis must be ob- 
jective. The analyst’s subjectivity must be 
minimized in the effort to obtain an objective 
description of the communication content. The 
necessity to achieve objectivity, or near-objec- 
tivity, for content analysis raises the problem of 
reliability. 

Reliability is used here in the standard statis- 
tical sense of measuring the degree to which 
the data are independent of the measurement 
instrument. Regardless of who does the analysis 
or when it is done, the same data should be 
secured under similar conditions. Thus the 
analysis of communication content rests upon 
two kinds of consistency: (1) consistency among 
analysts — that is, different coders should pro- 
duce the same results when they apply the 
same set of categories to the same content; and 
(2) consistency through time — that is, a single 
coder or a group of coders should produce the 
same results when they apply the same set of 
categories to the same content but at different 
times. The acceptance of content analysis as 
an instrument for scientific research depends 
to a large extent upon the achievement of high 
reliability on both counts. What has been the 
record of content analysis in this regard? How 
reliable has it been in actual studies and in 
specially devised reliability experiments? 

First, whatever the actual state of reliability 
in content analysis, the published record is less 
than satisfactory. Only about 15-20% of the 
studies report the reliability of the analysis 
contained in them. In addition, about five 
reliability experiments are reported in the 
literature. In brief, then, the simple fact is 
that the whole problem of reliability in content 
analysis has been given insufficient attention for 
the proper development of the field. The re- 
liability of a research technique can hardly be 


established on the basis of reports from only 
one out of every five or six studies. Nor does 
the situation appear to be improving: there 
are no more reliability reports in recent than in 
earlier years. 

Secondly, the reports on reliability which do 
appear are uniformly high. Review of some 30 
studies and experiments reveals a range of cor- 
relation coefficients between about .78 and .99 
with a concentration at about .90, and a range 
of percentage agreements between 6695 and 
96% with a concentration over 90%. However, 
such magnitudes do not guarantee the reliabil- 
ity of content analysis. Many attempts at con- 
tent analysis may have been abandoned because 
of low reliability in the preliminary results. In 
addition, the published content analysis studies 
which do not report on reliability may be pre- 
sumed to have had less satisfactory results, or 
perhaps not to have included reliability tests 
at all. In short, the high reliabilities which are 
reported are more in the nature of encouraging 
indications than absolute guarantees, 

"Third, most of the reported reliability results 
apply to relatively simple versions of content 
analysis. The reliability of a list of symbols 
(e.g, Grey, Kaplan, and Lasswell, 1949) may 
be high, but the reliability of a complex seman- 
tic analysis is another matter. Presumably the 
use of complicated and sophisticated categories 
creates serious problems in reliability. "This is 
the problem of the balance between reliability 
of the procedures on the one hand and the 
richness of the categories on the other. What 
does it matter that we gain reliability if in the 
process we lose all our insights? This critical 
area of content analysis, i.e., the reliability of 
complex categories, still needs to be adequately 
handled. 

Fourth, there is a reasonable likelihood of the 
underestimation of reliability in many studies, 
arising from the measurement of reliability on 
detailed categories which are later subsumed 
into more general categories for reporting pur- 
poses. This procedure, as a matter of fact, is 
recommended by one of the experimental stud- 
ies of reliability (Kaplan and Goldsen, 1949). 
The distinction is made between “gross reliabil- 
ity,” applying to all the coded categories, and 
“net reliability,” applying only to those cate- 
gories, often cumulations, used in the report 
(Reiss, 1949). 

Fifth, there are differentials in the reliability 
€f various kinds of analysis. Reliability is 
higher under these conditions: the simpler the 
categories and the unit, the more experienced 
and better trained the coders, the more precise 
and complete the set of coding rules, the fuller 
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the illustrations. In one case, agreements be- 
tween coders ranged from 67% on inclusion of 
items to 90% and higher for direction (Allport 
and Faden, 1940). In another, coder instruc- 
tion increased agreement from 10-15% (Kaplan 
and Goldsen, 1949). In still another, an ex- 
periment designed to test the greater efficiency 
of a classification scheme in which the analyst 
performed only “one psychological operation” 
at a time, by making a step-by-step series of 
decisions on a complex category, actually 
showed that this method was a little less reliable 
than the traditional process of deciding at once 
whether the content fitted the (combined) cate- 
gory (Schutz, 1950) . 

In sum, the problem of reliability, on which 
a good deal in content analysis depends, has 
been successfully handled in a number of par- 
ticular studies, but it is by no means a "solved" 
problem for the field as a whole. 


THE PROBLEM OF SAMPLING 


"Fhé problem of sampling in content analysis 
is complicated by the fact that as many as three 
decisions about the sample are necessary for 
many studies. There are three universes from 
which samples must be drawn, and in any par- 
ticular problem the methods of sampling can 
differ among the three. First, there is the prob- 
lem of sampling titles (i.e., specific newspapers 
or radio stations or magazines) ; then there is 
the problem of sampling the issues or dates of 
the titles; and finally, there is the problem of 
sampling the relevant content within specific 
issues of specific titles. 

Titles. As has been indicated, several studies 
have relied upon samples composed of the 
largest or the most readily available channels 
or media units However, in some studies 
samples have been selected on the basis of vari- 
ous correlates relevant to the problem under 
analysis. On occasion the selection is made 
in terms of geographical area. Newspapers 
from different regions of the country were repre- 
sented in a study of the editorial treatment of 
educatión (Foster, 1938); country weekly 
newspapers in Minnesota were chosen on the 
basis of their distance from the Twin Cities 
(Taeuber, 1932) ; for a study of foreign language 
broadcasts listening posts were set up "through- 
out the United States” (Arnheim and Bayne, 
1941). The selection may be made in terms of 
types (or presumed audience). Four types of 


magazines (general weekly, general monthly, * 


Women’s, confessionals) were included in an 
analysis of short stories (Berelson and Salter, 
1946) . Other studies have used as the criterion 
for inclusion editorial position. In a study of 
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local treatment of press association dispatches 
on foreign news, "isolationist" and "interven- 
tionis" newspapers were selected (Rogers, 
1941). Other studies have selected their sample 
in terms of size or "importance." In the study 
of editorials on education, newspapers were also 
selected by circulation. (Foster, 1938); in the 
study of country weekly newspapers the earliest 
one established in each zone was taken (Taeu- 
ber, 1932) ; in the study of radio programming, 
stations of different power and network affilia- 
tions were selected. (Baker, 1949). Another 
criterion that has been used is time of issue. 
In a study of news breaks and news repetition, 
morning and evening newspapers were included 
(Buchanan, 1942) . 

This brief list contains the major correlates 
by which samples of titles have been selected in 
content analysis studies. It serves to illustrate 
the first sampling problem which must be met, 
namely, which newspapers or magazines or ra- 
dio stations (or other units) are to be selected. 

Issues. Once the titles have been selected, 
the problem arises of how many issues of the 
title to include. Now this matter of sampling 
the title through time should not, of course, be 
identified with the matter of defining the period 
under study. That is, the period for analysis 
is selected as part of the research design, but 
the problem of selecting issues of the titles 
within the period (or periods) still remains. 
For example, an analysis of the treatment of 
minority groups in magazine fiction was done 
Íor the years 1937 and 1943 in order to test the 
effect of the war, but the problem of selecting 
specific issues of the designated magazines with- 
in those years still had to be faced (Berelson 
and Salter, 1946) . 

The problem of securing representativeness 
in a sample of issues (or dates) is conditioned 
by certain factors. For example, the relevant 
content may be subject to seasonal variations, 
and if the problem necessitates representation 
of the various periods of the year, samples must 
be chosen with this in mind (e.g., Berelson and 
Salter, 1946; Taeuber, 1932; Willey, 1926) . At 
least one study went completely amiss because 
of inattention to this detail: an analysis of the 
weddings reported in the society pages of the 
Sunday New York Times in June found no 
Jewish marriages (Hatch and Hatch, 1947). A 
critic pointed out that ". . . Jewish marriages 
are not performed in the seven weeks between 
Passover and the Feast of Weeks and in the 
three weeks preceding the day of mourning 
for the destruction of the Holy Temple in Je- 
rusalem. Almost invariably, June falls either 
into the one or the other period" (Cahnman, 
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1948, p. 97). Again there is the problem of 
what constitutes "average" or "normal" times 
for certain sample purposes. Since many anal- 
yses have demonstrated the crucial importance 
of events in determining communication con- 
tent, it has often seemed necessary to “control” 
this factor. Thus, an experiment devised to 
test reliability included material from three 
periods — one of which was characterized by 
favorable events, one by unfavorable, and one 
by neutral (Grey, Kaplan, and Lasswell, 1949) . 
Another study was based entirely upon a series 
of events involving the USSR in both positive 
and negative contexts — eg, Soviet aid to 
Loyalist Spain and the Soviet treason trials, 
1937; the battle for Stalingrad and the Second 
Front discussion, 1942, etc. (Kriesberg, 1947) . 
The most intensive experimental investigation 
of sampling methods in content analysis dealt 
with the selection of issues of Pravda over a six- 
month period in 1941 (Mintz, 1949). The 
general conclusions were that weekly samples 
were only as efficient as three-day samples for 
the month and that every-fifth-day and every- 
second-day samples were "distinctly better." 

Content. And even after the titles and the 
issues have been selected, there remains the 
problem of which sections or how much of the 
relevant content to include. After certain issues 
of certain magazines have been chosen for 
analysis, it may still be possible to satisfy the 
study's objectives by taking, say, only half the 
stories. Or, once certain newspapers on certain 
dates have been selected, the analysis may be 
restricted to a particular part of the paper, such 
as the front page, rather than extended to the 

aper as a whole. Sampling techniques are 
applicable on this level of content selection as 
well. 

The last illustration raises the question of 
whether particular sections of communication 
channels are representative of the whole. We 
know that the front page is not representative 
of the "news" in the entire newspaper. Is it 
necessary in sampling the relevant content to 
take into account certain physical factors which 
can affect the distribution of important cate- 
gories? Each of the major channels has its own 
prominent devices, as follows: x 


Newspaper Radio Film 
Space Length Length 
Location Order Order 
Page placement Intonation Color 
Size of headline type Sound Intonation 
Type face Music Sound 

Music 


Use of color 
Use of illustrations 
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Such prominence devices may be correlated with 
relevant characteristics of the content. In fact, 
the most systematic study of this problem has 
shown that they are (Stewart, 1943) . 

In sum, then, sampling procedures in content 
analysis must often be applied to three different 
universes: the titles, the issues of the titles, and 
the relevant content of the issues of the titles. 
And, of course, the same sampling methods 
need not be applied in all three cases. In fact, 
in most cases it will be appropriate to design 
different samples and use different sampling 
methods for the three universes. In some cases — 
and this has been true in several actual studies 
— it is appropriate to sample only one or two 
of the levels and take the universe for the 
other (s). This procedure has resulted in quite 
different, but still suitable, research designs. 

In a few cases different samples have been 
drawn for different purposes in the same study. 
This procedure is particularly applicable to 
studies which investigate problems of both an 
extensive and an intensive nature. Thus a 
study of German broadcasts to North America 
included one sample for general analysis over 
a seven-month period and in addition took 
every reference to a specific event for two weeks 
within that period (Bruner, 1941). The latter 
body of content was subjected to much more in- 
tensive analysis than the former. 


THE PROBLEM OF INFERENCE 


What content analysis as such does is to pro- 
vide a more or less precise description of the 
content in terms meaningful for the problem 
at hand. Yet some content analysis studies 
present conclusions and interpretations of 
wider application than the content itself. In 
the large, then, there are two points at which 
a content study can go wrong. First, it can go 
wrong in applying the techniques of analysis, 
in which case the description of the content 
itself is mistaken, and so is anything which 
follows from it. Secondly, it can go wrong in 
applying the techniques of interpretátion or 
inference to the correct content description. 
Some studies are open only to the first source 
of error, some to both sources. This raises the 
problem of inference in content analysis. 

In a great many studies there is no real prob- 
lem of inference at all. This is true for all those 
content analyses in which the description of 


"content itself is the primary objective. Such 


studiés can be said to contain implicit infer- 
ences about the causes or the consequences of 
the content — and some contain them explic- 
itly — but such inferences are in the nature of 
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addenda to or reformulations of the basic data. 
Thus a trend study of newspaper content can 
be considered material for inferences about the 
changing character of press controls and/or 
about the changing character of public atten- 
tion. But such “inferences” are usually nothing 
but reformulations in other terms of the result 
of the content analysis itself. Or take the case 
of the treatment of ethnic groups in magazine 
fiction: after the basic content data were de- 
scribed at some length, a few hypotheses about 
the probable reasons for and effects of the 
content were proposed, but these were ap- 
pended to the main job and were clearly 
presented as such; they could be eliminated 
without doing violence to the substance of 
the study. In short, where the central concern 
is with the content as such — with the content 
a» the category, not the indicator — then the 
problem of inference is virtually nonexistent. 

Nevertheless, it is useful to inquire briefly 
into the types of generalizations which charac- 
terize such studies. Mainly of a historical or 
descriptive nature, they base their conclusions 
upon these four kinds of relationships: 

Trend comparisons. 'These studies contain 
conclusions about changes in the frequency of 
the categories through time, many of which 
have already been described. Some studies 
simply present the changes continuously over 
a long period of time (e.g. Taeuber, 1932) ; 
others present them at two selected points in 
time (eg. Lowenthal, 1943) ; and still others 
present them for periods of different lengths 
which have themselves been selected with refer- 
ence to important external criteria (e.g. Leites 
and Pool, 1949). In all such cases, however, the 
description of content along a time continuum 
provides the basis for the conclusions. 

Intracontent comparisons. Here are the 
studies whose major conclusions rest upon re- 
lationships between two or more characteristics 
or parts of the same body of content. For 
example, the comparative treatment of majority 
and minority groups in magazine fiction on a 
range of selected factors is the basis for generali- 
zation (Berelson and Salter, 1946). The com- 
parative emphases in popular jokes dealing 
with Jewish, Negro, and Irish characters reveal 
the stereotypes operating in such material 
(Barron, 1950), The technical comparison of 
the treatment of Negro-white relations in dif- 
ferent parts of Negro newspapers is another, 
case in point (Stewart, 1943). In short, the 
conclusions here rest upon comparisons within 
the same body of content. 

Comparisons of different bodies of content. 
This is the related application in which con- 
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clusions rest upon comparisons between or 
among different bodies of content. Compari- 
sons of the content of Swiss and British news- 
papers (Guttinger, 1939) ; of the treatment of 
a number of wars in American and enemy text- 
books (Walworth, 1940) ; of the correspondence 
in propaganda output by two national radios 
(Berelson and de Grazia, 1947) ; of the motion 
picture versions of “good” books with the origi- 
nals (Asheim, 1949) ; of the sensationalism in 
radio and newspaper treatments of the same 
event (Gottlieb, 1947) — these are examples 
of this form of generalization. Ordinarily it 
requires the analysis of the different contents 
by the same set of categories. 

Comparison of content with a standard. In 
this application the relation between the con- 
tent and the standard which it is supposed to 
meet serves as a basis for the conclusion. The 
standard may be implicit and assumed or it 
may be explicit and established. For example, 
in analyses of newspaper and radio treatment 
of controversial issues, the conclusions are based 
upon the nonconformity of the content to a 
presumed standard of “balance” or “fairness” 
(e.g, Sussman, 1945; Gabel and Gabel, 1946a 
and b). Prior work in establishing criteria for 
readability levels, translated into content terms, 
is applied to subsequent analyses in this field. 
A considerable number of content analyses base 
their conclusions upon this form of relation- 
ship, although it is not often completely stated 
in these terms. 

A good deal of loose interpretation appears 
in content analyses because the exact nature 
of the inference is hidden in a cloud of am- 
biguous terms, which often turn out to be the 
verbs which bind the content subject to the 
noncontent predicate. "Typical terms are "re- 
flects," "relates," "expresses," "indicates," "pro- 
jects,” "suggests," "means," etc. To the extent 
that such terms have value, it lies in their 
assertion that the two sets of factors have some- 
thing to do with each other; their deficiency 
lies in the vagueness of the assertion. How, 
concretely, does the German Zeitgeist, whatever 
it is, come to determine the content of the 
German film — through the conscious, delib- 
efate exploitation of the public's attitude by 
discerning producers? Through the uncon- 
scious expression of public feelings by the pro- 
ducers, as members of the society? Through 
their conscious expression of such feelings? 
What part of the public? What group of movie 
makers? 

On occasion it will be impossible to base 
at least parts of the inference upon a previ- 
ously established generalization relevant to the 
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problem. For example, the British predictions 
about the German possession of V-2 weapons 
were based upon the historically correct proposi- 
tion that the Nazi propagandists did not arouse 
popular expectations of such action without 
real reason. A similar proposition served as 
the basis for the American analyst's inference 
that Goebbels’ threat of “counter terror" meant 
that the Nazis were considering renewed anti- 
Jewish measures. Where such supports for the 
chain of inference exist they are invaluable. 

Basically it might not be far wrong to suggest 
that there are two types of inferences drawn 
from content data — the direct and the inverse. 
In the direct method, essentially, the content is 
taken at face value. What it says is, in effect, 
translated into an inference about noncontent 
matters. If Fritzsche says that the German 
people should not expect continued successes 
from submarine warfare, the inference is that 
the Nazi elite does not expect such successes. 
If popular magazines treat subject 4 with high 
frequency and subject B with low, the infer- 
ence is that people are more interested in the 
former topic. 

In the inverse method, the opposite of the 
content statement is taken as the inference. If 
the Nazi radio insists that the Italians are full 
of fight, the inference is that the Nazi elite 
thinks that the German people do not think 
well about the sticking power of their allies. 
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If the American movies depict a situation in 
which the hero is loved unconditionally by the 
heroine while he remains doubtful in his affec- 
tion, the inference becomes a “reversal of the 
experience of American boys with their exacting 
mothers . . . In this case the relation between 
a movie theme and one of its probable real life 
sources is a reversal rather than a reproduc- 
tion" (Wolfenstein and Leites, 1950, p. 306) . 

The basic problem, then, is to decide which 
inference to make — the direct or the inverse. 
Little progress has been made toward differen- 
tiating the conditions under which the one or 
the other is most appropriate. Suppose an 
analyst in 2051 notices that American novels of 
1951 contain a great many sexual references. 
What is he to infer about sexual mores in this 
country — that they were free and open (as 
revealed by the attention given them in litera- 
ture) or that they were repressed and inhibited 
(as revealed by the same)? 

In sum, the problem of inference in content 
analysis is no different from similar problems 
in other areas of social research. Ultimately 
the inference itself must be tested by methods 
directly applicable to it. To that’end, full and 
unambiguous statements of the detailed. infer- 
ence itself, of the line of reasoning that went 
into it, and of the assumptions contained there- 
in are desirable. 


CONCLUSION 


'This, then, is content analysis. Not all the 
studies reported or described in this review were 
well conceived or well executed. Some of them 
are quite pretentious, some obvious, some 
pedestrian, some clever but poorly done. No 
brief is made here for all the studies used for 
illustrative purposes; in any case, this is what 
content analysis currently amounts to. 

How the contributions of content analysis 
measure up against those of other methods in 
social science research is difficult if not impos- 
sible to say. Because of the relative availability 
of the raw material, many sins have been cem- 
mitted in the name of content analysis simply 
because something else was harder or more in- 
convenient to do. At the same time, however, 
the method has produced a number of useful 


studies which combine original ideas with 
sound, careful documentation. This is, of 
course, the heart of the matter. 

In content analysis, as anywhere else in social 
research, it is important to start in the right 
way. Simply going on a fishing expedition 
through some common communication material 
is almost certain to be unrewarding. Unless 
there is a sensible, or clever, or sound, or reveal- 
ing, or unusual, or important notion underlying 
the analysis, it is not worth going through the 
rigor of the procedure, especially when it is so 
arduous and so costly of effort. Content analy- 
sis, as a method, has no magical qualities — you 
rarely get out of it more than you put in, and 
sometimes you get less. In the last analysis, 
there is no substitute for a good idea. 
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CHAPTER 14 


The Cross-Cultural Method 


Jonn W. M. WHITING 
Harvard University 


The cross-cultural method utilizes data col- 
lected by anthropologists concerning the culture 
of various peoples throughout the world to test 
hypotheses concerning human behavior. Some 
of the hypotheses tested have been derived from 
theories of cultural evolution, others from 
theories concerned with the integration of cul- 
tyre, and still others, particularly in recent 
years, from theories of individual and social 
psychology. 

The cross-cultural method was first used by 
E. B. Tylor (1889), who presented at the meet- 
ings of the Royal Anthropological Institute of 
Great Britain a paper entitled “On a method 
of investigating the development of institutions; 
applied to laws of marriage and descent.” It is 
interesting that Sir Francis Galton, one of the 
fathers of modern statistics, presided at this 
meeting. Although the intent of the study was 
to support Tylor's particular view of cultural 
evolution, its importance was that in the paper 
and the discussion which followed most of the 
basic assumptions of cross-cultural research were 
touched upon. 

It was not until twenty-five years later, how- 
ever, that the method was again used and, as 
before, to test evolutionary theory. Hobhouse, 
Wheeler, and Ginsberg (1915) published a 
monograph in which the frequency of occur- 
rence of certain social institutions was deter- 
mined for each stage of economic development 
from lower hunters to advanced agricultural- 
ists. Essentially similar (but more methodo- 
logically and theoretically sophisticated) studies 
folloWed after another lapse of nearly twenty- 
five years with Simmons (1937) and Murdock 
(1937) . 

In the last fifteen years, the cross-cultural 
method has not only become more popular but 
has changed in its theoretical orientation, It 
has drawn upon the theory of general behavior 
science rather than that of cultural evolution. 
Ford's (1939) study of the frequency of eran 
types of behavior presumed to be closely related 
to biological needs was the first to represent 
this change, even though it was concerned with 
the development of an appropriate conceptual 
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system and the establishment of cross-cultural 
norms rather than with the testing of hypoth- 
eses. His later work (1945), and with Beach 
(1951), is also primarily conceptual and nor- 
mative. 

The first study to be published which was 
concerned with the testing of hypotheses de- 
rived from behavior science was that of D. G. 
Horton (1943). He used the cross-cultural 
method to investigate the relationship between 
the drinking of alcoholic beverages and anxiety, 
and showed that alcohol reduced inhibition in 
all societies unless special measures were taken 
to prevent it. 

Murdock’s Social Structure, published in 1949, 
represents a landmark in cross-cultural research. 
In this study he used a sample of 250 societies 
distributed over the world to test hypotheses 
concerned with the correlation between kinship 
terminology and other aspects of social struc- 
ture; particularly, residence rules, the form of 
marriage, and rules of descent. He draws spe- 
cifically upon behavior theory to explain the 
origin and generalization of the incest taboo. 

Studies with a behavior science orientation, 
but of more limited scope than Murdock's, in- 
clude B. B. Whiting's (1950) study of the rela- 
tion between sorcery and social control, 
McClelland and Friedman's (1952) study of 
the relation between child training practices 
and achievement motivation, Wrights (1952) 
study of the relation between child training and 
aggression as expressed in folktales, and Barry's 
(1952) study of the relation between child 
training and certain formal characteristics of 
decorative art. 

The most elaborate use of psychological 
theory in cross-cultural research is that of 
Whiting and Child (1953). In this study cer- 
tain hypotheses relating to the Freudian 
mechanisms of fixation, displacement, projec- 
tion, and the development of guilt are tested. 
The antecedent variables in this study consist 
of child-training practices, and the consequent 
variables consist of magical beliefs and practices 
concerning the cause and cure of illness. Thus, 
ethnographic data from primitive societies have 
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been used not only to test theories at the level 
of culture and society but also theories which 
are derived essentially from individual psychol- 


It might be asked at this point why ethno- 
graphic material should be drawn upon to test 
psychological principles. Why not stick to 
materials gathered in one’s own society, where 
the language and culture are familiar and 
where more adequate control of the process of 
data collection is possible? 

The advantages of the cross-cultural method 
are twofold. First, it ensures that one’s findings 
relate to human behavior in general rather than 
being bound to a single culture, and second, it 
increases the range of variation of many of the 
variables, 

With respect to the first point, that of gen- 
erality, one might ask questions such as the 
following: Is the Oedipus complex a universal 
phenomenon or is it peculiar to middle-class 
western Europeans? Does the relation between 
frustration and aggression hold in societies 
other than our own? 

Since most social psychological studies are 
done within the framework of western European 
culture, one can never be certain whether the 
discovered relationships are valid for all man- 
kind or whether they are an artifact of some 
limitation or special circumstance of the cul- 
ture in which they have been discovered. An 
example will serve to illustrate this point. Sears 
and Wise (1950) found as a result of a study 
of a sample of eighty children living in Kansas 
City that there was a positive relationship be- 
tween the age of weaning and the degree to 
which the infant gave indications of emotional 
disturbance. That is, the later a child was 
weaned the more disturbance he showed. This 
relationship reached the 1% level of signifi- 
cance. 

One might ask whether this relation between 
age of weaning and emotional disturbance is 
a general human trait or whether it is peculiar 
to the culture of Kansas City. Anthropologists 
are all too eager to find exceptions to any rule. 
Thus, Beatrice Blackwood (1935) reports that 
the Kurtatchi of the Solomon Islands do not 
wean their children until they are over three 
years old and that Kurtatchi children show no 
signs of emotional disturbance. Does this single 
case disprove the rule? More than two cases, 
e.g, Kansas City and Kurtatchi, are needed to 
answer this question. 

Fortunately, the data are available. Whiting 
and Child (1953) in a cross-cultural study of 
child training and personality collected ma- 
terial, on the same variables as those used by 
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Sears and Wise, from a sample of seventy-five 
societies distributed over the world. Evidence 
was available so that judgments could be made 
for age of weaning and emotional disturbance 
for thirty-seven societies. The mean score for 
emotional disturbance on a seven-point scale 
for the seventeen societies in which the onset 
of weaning was at two years or less was 3.5, 
whereas the mean score for the twenty societies 
in which weaning took place later than two 
years was 2.8. 

It will be seen from this that in this instance 
the Kurtatchi are not a single exception but 
are typical of a tendency for late weaning to 
result in low rather than high emotional dis- 
turbance and thus there really seems to be 
contradiction between the Sears and Wise find- 
ings and the cross-cultural evidence. If the 
methods of measuring these two variables are 
accepted as reliable and comparable in the two 
sets of data, one might conclude that different 
psychological principles were operating in Kan- 
sas City than in the rest of the world. Closer 
examination of the data, however, reveals a 
much more plausible explanation and illus- 
trates the second advantage of the cross-cultural 
method, namely, that of increased range. 

The effective range of age of weaning for the 
Kansas City sample was 0-7 months (only five 
of the eighty cases were later than seven 
months) , whereas for the cross-cultural sample, 
with the exception of two cases, it was from 
twelve months to six years. Thus, there is prac- 
tically no overlap between the two distributions 
and, as is shown in Table 1, the two findings 
complement each other to indicate a curvilinear 
rather than a linear relation between the two 
variables. Due to lack of range, neither study 
tells the whole story. 

The preceding case illustrates that, with re- 
spect to the age of weaning at least, psycholo- 
gists would be exceedingly hampered by lack of 
range in developing a theory of its effects which 
would be applicable the world over. There is 
considerable evidence that this lack of range 
is not restricted to the age of weaning. “Thus, 
although much of the accuracy of measurement 
and experimental control generally found in 
psychological studies on individuals in our so- 
ciety is lacking in cross-cultural studies, this 
spread in the variables provides a useful check 
upon and supplement to psychological research. 

Even though extreme cases on some variable 
may be found in our society, these are deviant 
from the cultural norms, In cross-cultural re- 
search the norm for a whole society is taken as 
a score, Thus no matter how extreme a case 
may be from our point of view, the individuals 
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TABLE 1 


RELATION BETWEEN AGE AT ONSET OF WEANING AND AMOUNT OF EMOTIONAL DISTURBANCE SHOWN 
BY CHILD. 


Comparable data from eighty individual children from Kansas City (Sears & 
Wise, 1950) and from thirty-seven societies (Whiting & Child, 1953) are pre- 
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who manifest this behavior do not perceive 
themselves to be deviant. To return to the 
age of weaning, a Kurtatchi mother who does 
not wean her child until he is between three 
and four years old is doing just what other 
Kurtatchi mothers do. An American mother 
who nursed her three-year-old child, however, 
could not help but be aware that she was devi- 
ating from the usual practice — a factor which 
might have considerable effects on her and 
might therefore contaminate one's findings if 
one were interested in the consequences of late 
weaning rather than of deviant behavior. 
Furthermore, the cross-cultural method, by 
studying cultural norms, holds individual varia- 
tion constant. Psychological studies of irftli- 
viduals in a single society do just the opposite, 
in that cultural norms are held constant and 
individual variations are studied. A combina- 
tion of these two methods should supplement 


and correct each other in the development of a 
general theory of human behavior. 

Granted that the cross-cultural method is use- 
ful in that it provides an increased range for 
certain variables and a test for the generality 
of hypotheses, there are some basic assumptions 
involved in the use of the method which should 
be considered. First, the method rests on the 
assumption that the customs of a society are 
truly comparable to the habits of an individual. 

*There are those who argue that culture is essen- 
tially independent of psychological laws. White 
(1949) , for example, holds that there is a sci- 
ence, which he calls culturology, that concerns 
itself with the evolutionary development, dif- 
fusion and transmission of culture, and that 
the principles which govern these processes are 
on a different level of abstraction from that of 
psychology. Another view, represented by Ford 
(1939), Gillin (1948), Murdock et al. (1950), 
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Whiting and Child (1953), and Whiting et al. 
(1953) , while recognizing the limitation which 
social interaction imposes upon culture, has 
defined a custom (the basic unit of culture) as 
a special case of a habit. Thus, the psychologi- 
cal principles which apply to habits should, by 
this definition, apply to customs. 

To be more explicit, a custom may be thought 
to refer to the behavior of a typical individual 
in a given society. Thus, the Kurtatchi custom 
of weaning the child between three and four 
years of age, which has been referred to above, 
may be understood to refer to a typical or aver- 
age Kurtatchi mother. Put in another way, the 
thirty-seven societies presented in Table 1 repre- 
sent thirty-seven typical parents and their chil- 
dren, one in each of the thirty-seven societies. 
The problem of defining and measuring a cus- 
tom in this way is discussed in detail in Whiting 
et al., Chapter IV. 

If one makes the assumption that a custom 
is the habit of a typical member of a society, 
then it becomes important to scrutinize the 
nature of ethnographic data and the methods 
by which they are collected. Were the societies 
chosen by anthropologists as large and complex 
as ours, to describe a custom would involve ex- 
ceedingly elaborate sampling techniques, such 
as are described in other chapters of this hand- 
book. It is the general practice of anthropolo- 
gists, however, to study small and relatively 
homogeneous societies, or when large and more 
complex societies are studied, to limit the re- 
port to a single village or band. 

My own experience with the Kwoma (1941) 
is not atypical of ethnographic practice. The 
Kwoma is a tribe situated in central New 
Guinea, The total population of some nine 
hundred people are divided into four subtribes. 
Each subtribe is in turn divided into sibs and 
each sib into hamlets. Except for a brief trip 
to the other subtribes and a few visits to other 
hamlets, virtually all of my observations were 
made on Rumbima hamlet of the Hayamakwo 
sib of the Hongwam subtribe of the Kwoma. 
Rumbima hamlet consisted of thirty-two adults 
and thirty-two children. Since my house was 
built in the hamlet and I lived there for nine 
months, the customs I report are concerned, 
with typical people of various statuses within 
this hamlet. Whether the customs were repre- 

sentative of the Kwoma as a whole or of the 
Hongwam subtribe or of only the Hayamakwo 
sib I cannot say. For cross-cultural purposes, 
however, Rumbima hamlet — not the Kwoma 
— should constitute the case. 

It is unfortunately the practice, as far as cross- 
cultural work is concerned, for anthropologists 
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all too frequently to use in the titles of their 
monographs or articles some group larger than 
the one which they studied, just as I, m 

did. (Becoming a Kwoma rather than “Be- 
coming a Rumbima.”) This is misleading, for 
it suggests that the culture which they describe 
is representative of the larger group, an assump- 
tion which they are generally not warranted in. 
making. Even more important, several eth- - 
nographers may each report on a given society 
but study quite different subgroups whose cul- - 
ture may vary to an unknown degree. If it 
were the case that one ethnographer described. 
child-training practices in detail and another . 
had a fine collection of folktales it would be 
very tempting, if one were studying the effect 
of child training on projective systems, to con- 
sider both reports to refer to a single case — _ 
tempting but dangerous. It is especially da 
gerous when the ethnography refers not even - 
to a single society, but to a linguistic stock such 
as the Sioux or the Zulu. 

Fortunately, anthropologists generally iden- . 
tify the group on which they actually gathered 
their material. This is usually in the introduc- 
tory chapter. As a caution, to a novice under- 
taking a cross-cultural study — choose your 
sample from this identification and not from. 
the title. An excellent discussion of this prob- 
lem may be found in an article by Murdock 
(1953) . } 

Although a small homogeneous society such 
as Rumbima hamlet presents no problem as - 
far as sampling goes, it does raise a question. 
of the stability of one's observation. Thus, in 
Rumbima hamlet only two children were being 
weaned during the nine months when I was 
present. Since both of these children were be- 
ing observed I obtained a 100% sample of 
weaning practice at Rumbima in 1937. Two - 
instances, however, do not provide a very stable — 
measure. Individual variation in the two chil- 
dren could obviously introduce considerable 
bias. In fact, in the present instance, one of 
the children observed was quite sickly and his 
response to weaning may have been influtnced | 
more by this fact than by the actual age of wean- 
ing. 
There is, however, an anthropological field ` 
technique that compensates to some extent for ` 
this defect. Anthropologists generally do not 
rely alone upon the observation of individuals 
or the questioning of respondents about their 
own experience, but they also use informants, — 
who tell them what "the people" do (cf. Chap- 
ter 12). Thus, when I asked my Rumbima ` 
informants when they (the people of Rum- 
bima) weaned their children, their answer 
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represented not only the two children whom I 
saw but their opinion based upon all the chil- 
dren which they had observed being weaned. 
Although this greatly increases the number of 
individuals observed, it introduces another 
difficulty — that of the bias of the informant. 
Needless to say, this is a problem that is always 
with the social scientist who uses verbal report 
as a technique for gathering data. To minimize 
this type of bias, anthropologists generally check 
all information with a number of informants 
and they pay particular attention to instances 
of disagreement. They also check instances of 
disagreement between the statements of in- 
formants and their own, i.e., the anthropolo- 
gists’ observation. As an example of this last 
point, my Rumbima informants’ report about 
weaning did not correspond to my observation 
of the sick child. When confronted with this 
fact, my informants agreed that this was due to 
the boy's illness. 

In general, then, a reasonably reliable esti- 
mate of the typical behavior of people in small 
homogeneous societies or subgroups through- 
out the world can be made on the basis of eth- 
nographic data. 

There is another basic assumption, however, 
which underlies the cross-cultural method. This 
assumption is that customs can be compared 
from one society to another. There are many 
who argue that every culture represents a com- 
plex whole such that a custom which may look 
the same as a custom in another culture really 
means something quite different. This same 
problem is quite similar to that of the psycholo- 
gist who compares individuals. Child psycholo- 
gists express the same view when they insist on 
considering the “whole child.” 

It is, of course, true that any individual case, 
be it a person or a society, is unique and, as a 
whole, incomparable with any other individual 
case. Scientific investigation, however, is based 
on the assumption that attributes of the whole 
can be abstracted and compared. Thus it is 
said that no two snowflakes are identical, and 
yet it is also a valid statement that snowflakes 
are hexagonal, white, and will melt in tempera- 
tures above 32° Fahrenheit. Some investigators 
may be concerned with the beauty and intricacy 
of the crystalline pattern of each individual 
snowflake; others with discovering the general 
truths about snowflakes. Both are legitimate 
enterprises, although one might not think so if 
one listened to the heated arguments betwgen 
proponents of these two approaches. e 

Those who are concerned with the descrip- 
tion, explanation, or diagnosis of a single case 
generally use rather complex analytic units 
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which combine a very complicated cluster of 
attributes or variables. Clinical psychologists 
are wont to speak of the Oedipus complex or 
the oral syndrome. Similarly, anthropologists 
in describing a single society speak of the politi- 
cal aspect, the religious system, the social struc- 
ture, the economic institutions, or the ethos. 
These abstractions are so complex and multi- 
dimensional that they are of little use for cross- 
cultural research. If no further analysis is 
made, the ethos of one culture can scarcely be 
compared with the ethos of another culture. 
If one views cultures, or individuals, at this 
complex level of abstraction, one is left with 
the feeling that they are indeed unique and 
incomparable. 

Comparison demands analysis into simple 
enough units so that certain attributes may be 
seen to vary along a single dimension, Com- 
parative anthropologists, whether they have 
been concerned with historical or functional 
problems, have tended to use much simpler 
units of analysis than those who have been 
more concerned with the description of a single 
society. For example, Malinowski, concerned 
chiefly with a complete description of Trobriand 
culture, used the institution as his basic analytic 
unit, whereas Kroeber in his early historical 
comparative work used the concept of culture 
trait. 

The concept of culture trait, although it is 
more promising than units of greater complex- 
ity, has, nevertheless, a weakness as far as testing 
hypotheses derived from psychological theories. 
It refers both to behavioral and material events. 
Cradleboards and methods of child care, spears 
and hunting techniques, drums and dance forms 
are all listed as culture traits. Cradleboards, 
spears, and drums are a different order of events 
from child care, hunting techniques, and dance 
forms. Principles of physics and chemistry ap- 
ply to the former; principles of psychology and 
sociology to the latter. 

The concept of a custom, discussed above, 
has the advantage of being both psychologically 
meaningful and simple. It is appropriate here 
to add to the definition of a custom, which has 
been defined above as a habit of a typical in- 
dividual in a society, its essential attributes. 

"They consist of (1) an agent or a subject (2) 
performing an act (3) in given circumstances. 
To distinguish a custom from an individual 
habit it is required that the agent, the act, and 
the circumstances be generally recognized as 
categories by the members of the society. Thus, 
the subject of a custom would be a mother or 
a farmer rather than Mary Smith or John Jones. 
Similarly, the circumstances and acts would be 
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recognized categories such as “at dinner time" 
or "driving a car" rather than events that are 
relevant only to a particular individual or to 
some experimental setup. A unit with essen- 
tially these attributes was first suggested by 
Ford (1939), and developed by Murdock et al. 
(1950) and by Whiting and Child (1953). 

Not only does the concept of custom have the 
advantage of simplicity but, since it is defined 
as a special kind of habit, psychological prin- 
ciples such as those of learning and extinction, 
generalization and discrimination, and motiva- 
tion and conflict can be drawn upon to suggest 
hypotheses about culture. To do this, of course, 
customs must be measured in the same manner 
that habits are. 

Many anthropologists agree that a culture 
can be analyzed into its component customs, 
but are still concerned about the comparison 
of customs across cultures. They point out that 
actions which are formally identical have quite 
different meanings in different societies, Belch- 
ing may be complimentary or rude; thumbing 
the nose insulting or meaningless. 

This problem is not so difficult as it at first 
seems. Cross-cultural comparison requires 
equivalence in meaning rather than formal 
equivalence. In other words, if customs are de- 
fined in terms of universal panhuman dimen- 
sions they can be compared across cultures. An 
insult is an insult whether it be accomplished 
by thumbing the nose, biting the thumbnail, 
or shouting “karaganda yikafa" ("little baby” 
in Kwoma). As Kluckhohn (1953, p. 517) puts 
it, ". . . biological, psychological, and socio- 
situational universals afford the possibility of 
comparison of cultures in terms which are not 
ethnocentric . . .” 

Motives based upon social interaction such 
as aggression, dependence, affiliation, achieve- 
ment, and the like provide a set of universal 
dimensions by which customs that are of par- 
ticular interest to psychologists can be cross- 
culturally compared. McClelland and Friedman 
(1952) , Wright (1952), and Whiting and Child 
(1953) have already used such a device in cross- 
cultural research. 

But how does one go about doing cross-cul- 
tural research? What practical problems are 
involved? The method is not essentially dif-^ 
ferent from any correlational study. The sample 
must be selected, the variables chosen, scales 
defined, judges trained, reliability established, 
care taken to prevent bias in judgments, etc. 
There are a few special precautions and some 
useful short-cuts, however, which should be 
mentioned. 

First there is a precaution as to the choice 
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of a sample. A criticism which has been applied 
to cross-cultural research ever since Tylor (1889) 
presented his pioneering paper is that of the 
independence of cases. These critics argue that 
if two societies derive from a common origin 
or have extensively borrowed from each other, 
they should not be counted as two instances but 
only as one. As Boas (1927, p. 120) stated it: 


In order to make a statistical method a success 
it is essential that the phenomena counted must 
be independent of one another. If a number of 
them go back to the same historical sources they 
cannot be considered as separate units. 


Proponents of cross-cultural research. consider 
this an extreme statement, and take the view 
that as long as two societies are politically dis- 
tinct they are as independent as two individuals 
in a single society. This also is an extreme 
view. The problem is complex and not as yet 
completely solved. All empirical evidence so far 
supports the view, however, that Boas was over- 
cautious. B. B. Whiting (1950) and Whiting 
and Child (1953) have each used somewhat 
different methods to correct for this possible 
error. B. B. Whiting checked her results by 
recalculating her major correlations on a sample 
which included only one society from each ma- 
jor culture area in the world. Whiting and 
Child used the technique of testing some of 
their hypotheses independently in each of the 
five continental and oceanic areas of the world. 

Even though such corrective devices are avail- 
able, one should bear the problem of inde- 
pendence of cases in mind in selecting the 
original sample. A simple precaution is to 
avoid using two cases which are known to have 
derived from a common origin within the re- 
cent past — that is, within such a short time 
that their cultures have not had a chance to 
change. A more cautious rule would require 
choosing no more than one case from a linguis- 
tic area in which the languages were still 
mutually intelligible, Still another method 
would be to get a fairly equal representation 
of cases from the five major areas of the world 
— North America, South America, Eurasia, Af- 
rica, and Oceania. 

A second methodological precaution relates 
to the reliability of sources. The report of a 
traveler who has visited a society for a few 
days and writes an account of its culture clearly 
should not be given as much credence as the 
report of a trained ethnographer who has lived 
with the people for a year or more and learned 
to speak their language. To illustrate the type 
of report one sometimes finds in travel ac- 
counts, there is a persistent story told on the 
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coastal towns of New Guinea by the Australian 
plantation owners to uncritical visitors. The 
story runs that there is a tribe of natives living 
in the interior who have long tails and live in 
houses on piles. In order to sleep comfortably 
at night, they have holes bored in the flooring 
through which they stick their tails. Unfortu- 
nately this makes them an easy prey to their 
headhunting neighbors who creep beneath their 
dwelling and tie knots in their tails before at- 
tacking. Clearly, there is considerable range in 
the reliability of sources and some assessment 
of sources should be made before using them. 

A third precaution for a novice undertaking 
cross-cultural research relates to the identifica- 
tion of a society. This problem has already 
ben touched upon above in discussing the prob- 
lem of sampling. In this context, however, it 
should be mentioned that the same society may 
We called by a different name in different 
sources, Thus the Teton tribe is also referred 
to as the Western Dakota and the Western 
Sioux. 

A fourth precaution relates to the treatment 
of societies in which the variables under con- 
sideration depend upon a judgment of the 
presence or absence of certain beliefs or prac- 
tices, Thus, the belief that animal spirits have 
the capacity to cause illness might be predicted 
to occur in societies with a high degree of ag- 
gression anxiety. Whiting and Child (1953), in 
attempting to test this hypothesis, found that 
this belief was reported to occur in thirty-eight 
out of their sample of seventy-five societies. In 
the remaining thirty-seven societies no mention 
of this belief could be found in the ethnographic 
sources, In none of these latter societies, how- 
ever, was there the explicit statement that this 
belief was absent. This is by no means an 
unusual state of affairs. If ethnographies con- 
sisted of a listing of all the potential beliefs and 
practices which did not occur, as well as those 
which did, the bulk of the monograph would 
consist of negative evidence, since any one so- 
ciety has but a small fraction of the total num- 
ber of customs that may be found throughout 
the world. 

How should societies for which there is no 
mention of a particular trait be treated? Can 
it be assumed that the trait is absent if it is 
not mentioned, or should these societies be omit- 
ted on the basis of inadequate information? 
The latter solution is the more conservative 
and, as a rule of thumb, should generally be 
followed if both the antecedent and consequent 
variables depend upon the presence or absence 
criterion, Otherwise one may obtain a spurious 
positive correlation which reflects only differ- 
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ences in the adequacy of ethnographic coverage. 
That is, both traits would tend not to be re- 
ported more frequently in the poorly covered 
tribes than in the well-covered tribes. Such a 
spurious correlation will not result when at 
least one of the variables depends upon some 
variation in the degree of a trait rather than 
its presence or absence, and when such variation 
in degree does not depend upon the amount of 
information. To assume that no information 
means absence in such an instance, although it 
does not introduce bias, does, however, increase 
the amount of error variance, and for this rea- 
son should be avoided if possible. 

The practice of always omitting cases with 
no information, however, often reduces the 
number of cases so that an adequate test cannot 
be made. To overcome these difficulties it is 
suggested, in instances where the presence or 
absence criterion is used, that a distinction be 
made between those societies in which no men- 
tion is made of a particular custom but the 
context in which such custom normally occurs 
is described in detail, and those societies in 
which the appropriate context is not described. 
The former can be scored as inferred absence 
whereas the latter can be scored as not ascertain- 
able and the case omitted. "Thus, for example, 
if one is interested in “bloodletting” and finds 
no mention of this practice in an ethnography 
which has a complete and detailed description 
of medical theory and therapeutic techniques, 
it could be fairly safely assumed that the prac- 
tice was either absent or quite unimportant. If, 
however, there was no discussion of the medical 
practices at all, it would be much safer and 
more appropriate to omit the case. 

Fortunately for cross-cultural research, many 
of the practical problems of selecting a sample, 
working up a bibliography, and identifying a 
tribe are greatly facilitated by the Human Re- 
lations Area Files. This is a cooperative enter- 
prise among sixteen universities to collect and 
process the ethnographic material on a world- 
wide sample of societies. Processing consists 
of working up a complete bibliography of the 
major sources on a society, translating those 
which are in a foreign language, classifying or 
coding each paragraph according to a standard 

“outline (Murdock, et al., 1950) , duplicating the 
paragraphs on 5 x 8 cards, and distributing 
them in the appropriate categories in a file. 
Each member university has a complete set of 
files. Of particular importance for the problems 
which have been discussed above is the fact that 
the files aim at world coverage, with each so- 
ciety carefully identified and each source evalu- 
ated. By using these files, a cross-cultural study 
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can be done in a small fraction of the time it 
would otherwise take. 

The selection of cases by the Human Rela- 
tions Area Files has another advantage for 
cross-cultural research. The analysis of a sample 
drawn from these files will be available for the 
next study based upon the same or an overlap- 
ping sample. This is particularly valuable if 
the analysis includes scalar judgments done with 
due care for independence and reliability. For 
the information of those who may wish to take 
advantage of scores on numerous variables on 
overlapping samples, they are available in the 
following sources: Horton (1943), Murdock 
(1949), B. B. Whiting (1950), Ford and Beach 
(1951), and Whiting and Child (1953). 

The following example will demonstrate the 
cumulative value of cross-cultural research 
based on overlapping samples of societies. Ford 
and Beach (1951, p. 219) report on the dura- 
tion of the taboo restricting coitus following 
childbirth, Whiting and Child (1953, pp. 841- 
343) have a scale measuring the severity of 
childhood sex training on twenty-nine overlap- 
ping societies, On the hunch that the length 
of the taboo might be related to sex anxiety 
induced during the child-training process, these 
two variables were correlated. The results, pre- 
sented in the following table, show that the 
societies with high sex anxiety are more likely 
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to have long postpartum taboos than those with 
low sex anxiety. 

Thus far, cross-cultural research based upon 
available ethnographic literature has been un- 
der consideration. There is, however, another 
method being developed that is of particular 
interest for psychologists. This method consists 
of gathering material in a sample of societies 
for the particular purpose of testing a hypothe- 
sis or a set of hypotheses. Such a method obvi- 
ously cannot be undertaken by a single person. 
To learn a language, establish rapport, and 
adapt instruments and then gather material is 
a long and arduous task. For one person to 
gather material on enough societies for an ade- 
quate sample would take a lifetime or more. It 
is feasible, therefore, only through cooperative 
efforts. 

An example of a cross-cultural study with the 
above type of orientation is the values project, 
“A Comparative Study of Values in Five Cul- 
tures,” directed by Kluckhohn, Roberts, and 
Vogt of Harvard University. In this project five 
southwestern communities in the same environ- 
ment but with differing cultures are being 
studied. As a part of the research design of this 
project, the same variables relating to an over- 
all theory of values have been studied as they 
manifest themselves in each society. 

This type of project not only has the advan- 


TABLE 2 


RELATION BETWEEN THE DEGREE OF SEX ANXIETY INDUCED DURING SOCIALIZATION AND THE DURATION OF 
THE POSTPARTUM TABOO RESTRICTING COITUS 


Duration of the post-partum taboo restricting coitus 


A few weeks A month One year to 
to a month to a year three years 
EE OIN e Hoo Codd tae m NN 
Abipone 
Azande 
Chiricahua 
High sex Dahomeans 
anxiety Dobuans 
Manus Chagga Kwoma 3 
Sanpoil Chamorro Tiv d 
Tanala Riffians Wogeo 
Chenchu Kwakiutl Bena 
Chewa Hopi Lesu 
Lamba Thonga Trobrianders 
Low sex Lepcha 
anxiety Marshallese 
Marquesans " 
Murngin 
Siriono 
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tage of increasing the comparability of data 
gathered from different societies, but it also 
permits the testing and retesting of hypotheses 
based upon individual differences in a number 
of different cultural settings. For example, a 
number of hypotheses relating to techniques of 
child rearing and the internalization of moral 
values were tested on a sample of individual par- 
ents and children in each of three of these cul- 
tures by Faigin (1952) and Hollenberg (1952) . 
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This method bids fair to greatly increase the 
power of cross-cultural research, since a particu- 
lar theory can be tested within a society at the 
same time that an additional case is being added 
to the cross-cultural sample. 

In sum, then, the cross-cultural method, al- 
though it is still in its infancy, shows promise 
of being a useful adjunct to other research 
methods designed for the development of a gen- 
eral science of human behavior. 
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CHAPTER 15 


The Social Significance of Animal Studies 


D. O. Hess AND W. R. THOMPSON* 
McGill University 


We do not restrict ourselves in the present 
chapter to making a review of studies of social 
behavior in subhuman animals; in fact, not 
even a thorough review of this kind can be 
attempted. Our aim instead is to deal with 
the animal work that may help in understand- 
ing human society. In doing so, we must 
emphasize intellectual processes, and we shall 
give a large proportion of our space to emotion 
and motivation in the individual animal. Thus 
we neglect some important studies of social 
organization, especially in the ant and the bee 
(but also in birds and mammals when it does 
not seem that anything in particular is added 
to the understanding of mań). Our treatment 
is largely determined by the look of open as- 
tonishment that we have often seen at the idea 
that the study of animals has any interest for 
social psychologists. We cannot do justice in 
one chapter to the literature of comparative 
social psychology, but we do hope to convince 


the reader that he should read further, and 
that he need not think it a social error to be 
found with a copy of the Journal of Compara- 
tive and Physiological Psychology in hand. 

The study of animal behavior is significant in 
social problems for the same reason as in the 
problem of learning; we shall, in fact, try to 
show that social psychology must be dangerously 
myopic if it restricts itself to the human litera- 
ture. Our aim is thus to treat here the sub- 
human-animal data which relate to the prob- 
lems, methods, and theories of human social 
psychology. After looking briefly at the logic 
of this relation, our discussion will concern it- 
self first with intellectual factors in communi- 
cation and cooperation and will then go on to 
motivational factors, although we recognize 
fully that this is an artificial separation. Finally, 
we shall try to add all this up in a theoretical 
approach to human society from the compara- 
tive point of view. 


VALIDITY OF ANIMAL STUDIES 


There is a common opinion that understand- 
ing human behavior will result from the study 
of human behavior only, on the ground that 
proving something true of an animal does not 
prove it for man. The latter part of this state- 
ment is both correct and irrelevant. It implies 
that an empirical science proceeds by proof and 
disproof, which is a most misleading idea. 

No scientific theory can ever be proved true; 
nor indeed can we ever be finally sure of having 
proved one wrong. Too often in the history of 
science the demolished hypothesis has turned 
up again full of new life, following some slight 
modification of its assumptions. Lavoisier's oxy- 
gen theory of combustion was early disprovod 
(Conant, 1947); Newton's corpuscular theory 
of light, once discarded on the assumption that 
light must be either corpuscular or undulatory, 
but not both, is now in good standing again in 
somewhat modified form. As for proving that 
a hypothesis is true, it is notorious that ten 
thousand confirmations cannot prevent some 
spoilsport from thipking tomorrow of a new 
experiment that upsets it at one blow. 


? H 
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Thus the statement that animal experiment 
proves nothing for man does not go far enough: 
finding that something is true of man up till 
now, and for the limited sample of men that 
one has actually been able to test, cannot logi- 
cally establish that it will be true of other men 
in the future, as Hume noted long ago. Finding 
that a psychotic behaves in a certain way does 
not prove anything about normal persons, but 
no one would argue from this that clinical 
studies have not helped in our understanding 
of the normal. In principle, study of animals 
has exactly the same status. 

All very fine, says the social psychologist, but 
when in fact has animal psychology lived up to 
this billing, and helped in the way that abnor 
mal psychology certainly has? As a first answer 
we choose an example that may seem to have 


*We gratefully acknowledge the critical advice 
received from graduate students at McGill Univer 
sity and from Drs. D. Bindra, D. Krech, A. $ 
Luchins, H. W. Nissen, T. C. Schneirla, D. C. 
Williams, and J. P. Zubek. 


532 


THE SOCIAL SIGNIFICANCE OF ANIMAL STUDIES 


little bearing on social psychology — partly 
because it does have a bearing, and partly be- 
cause it clears up the logic of the way in which 
animal experiment contributes to a knowledge 
of man. 


STUDIES OF ANIMAL INTELLIGENCE 


The example is from the study of individual 
problem solving in the rat. There is a confused 
and extensive literature on the relation of en- 
vironment and heredity to the development 
of human intelligence. One group of studies 
has seemed to show that environment can have 
little to do with intelligence; another that it 
must have a great deal. In either case the argu- 
ment is somewhat indirect and not clearly de- 
cisive, and the literature itself has not led to a 
resolution of the apparent contradiction, In- 
deed, both Goodenough (1946), and Jones 
(1946) , in reviewing the literature, express dis- 
satisfaction with the ambiguities of human ex- 
periments on the subject. Studies on the effects 
of nursery school training on I.Q., for example, 
do not agree whether there is any increase in 
measurable intelligence. And the few case 
studies available on single individuals who 
have been severely restricted in infancy (Mason, 
1942; Hill and Robinson, 1929; Davis, 1940, 
1947) usually involve so many uncontrolled 
variables that interpretation is exceedingly dif- 
ficult. The situation obviously calls for a sys- 
tematic human breeding experiment together 
with a radical variation of the environment 
during growth. Of course no such experiment 
can be done. And of course if it is done with 
animals it would prove nothing for man. How- 
ever, the animals experiments have been done, 
and the significance they do have, vis-à-vis the 
human problem, is to show that the entire con- 
tradiction may stem from implicit assumptions 
about man for which there is no factual sup- 
port. The animal experiments, furthermore, 
suggest that they are wrong. 

Experimental studies have shown clearly that 
when fhe postnatal environment is held con- 
stant the level of learning capacity and problem 
solving in the rat is innately determined 
(Heron, 1935; Tryon, 1940; Thompson and 
Bindra, 1952) ; that is, one can breed rats that 
will be brighter, or duller, when brought up 
in the same environment. But they show 
equally clearly that when heredity is held con- 
stant between experimental groups, by the splite 
litter method, infant experience determines 
the level of adult problem solving (Hymovitch, 
1952; Bingham and Griffiths, 1952; Forgays and 
Forgays, 1952). The rat brought up in a “free 
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environment," or even with almost no freedom 
of bodily movement but with wide and exten- 
sive perceptual experience, is the better problem 
solver at maturity. For the rat, then, adult 
intelligence depends both on heredity and on 
the stimulating action of the postnatal environ- 
ment. 

Now the implicit assumption has been made 
in interpreting the human data that the effect 
of learning on intelligence can be estimated at 
any time during growth; if special experiences 
at the age of 10 or 12 have little effect on the 
I.Q., if a radical change of environment at age 
6 has only a slight effect on the I.Q. by age 7, 
it is concluded that the kind of experience the 
child has, at any time during his growth, is not 
an important factor in his intellectual develop- 
ment. This is, in effect, the assumption that 
the learning process is the same at all stages of 
growth, as is also the generality of transfer 
effects. 

But we have no evidence to show that this is 
so. 'The animal evidence, furthermore, indi- 
cates that the assumption is wrong. If instead 
we assume for man, as for the rat, that experi- 
ences early in development may have a widely 
generalized and lasting effect which they do not 
have at a later age, the contradictions of the 
human data disappear. It is still necessary to 
see how the principle is to be applied to man 
in detail. We still must discover at what ages 
the various aspects of his experience have their 
lasting effects, and what these experiences are. 
But finding that the adult LO. is unmodifiable, 
or that putting the underprivileged 6-year-old 
into a good school does not greatly raise his 
I.Q. in the next year or so, is no longer incon- 
sistent with the idea that the man reared from 
birth in an impoverished environment has a 
lower LO, than he would otherwise. 

Thus the first point we make in citing this 
work is that animal experiment may clarify a 
human problem without "proving" anything. 
It may draw attention to facets of human be- 
havior one has not noticed; it may point to a 
troublemaking but implicit assumption; it may 
suggest a new principle of human behavior. 
Furthermore, animal experiment in the past 
hes repeatedly shown that the treatment of 
some human problem or other has been over- 
simplified. But in all these cases, the relevance 
of the animal work is strictly dependent on 
whether, when applied to man, it does clarify 
the human problem. 

The closer one stays to man's behavior the 
less one can hope to see its broader outlines. 
One takes for granted too easily the things that 
all men have in common "without seeing how 
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extraordinary some of them are. We concede 
that animal psychology has had its own myopic 
errors, but these have usually arisen when study 
is concentrated on some one laboratory animal, 
rat or cat, regarded simply as a convenient sub- 
stitute for man and not as the object of com- 
parative study. For effective recent discussions 
of the distinction between animal psychology 
and comparative psychology see Beach (1950) 
and Schneirla (1952) . 


THE EARLY ENVIRONMENT 


The second point we wish to make, in con- 
nection with the experiments cited in the pre- 
ceding section, has to do with the role of the 
early environment in the development of social 
intelligence and social attitudes. The problem 
is on all fours with the “mental hygiene" prob- 
lem of emotional development. What childhood 
experiences make for an ability to "understand 
people," to be sensitive to another's mood, to 
know how a contemplated action will affect 
someone else? We can no longer assume that 
the insensitive person simply fails to inherit the 
necessary social intelligence. He may or may 
not. The social problem of education is not 
limited to the guidance of emotional develop- 
ment and the establishment of desirable atti- 
tudes toward others. 

Mental hygiene in this sense is not enough. 
It would be silly to think that a young woman 
would have much chance of being emotionally 
stable if she continually blundered in her hu- 
man relationships, frustrating herself socially; 
or that a politician could put “democratic” 
principles into effect, no matter how good his 
intentions, without being able to recognize 
ability and honesty and enthusiasm, or without 
knowing how to lead except by giving orders. 
"There is no proof whatever that man's social 
intelligence, his social skills, must be fostered 
in childhood to reach {ull development, but 
everything we know suggests that this is so. For 
a true mental hygiene, therefore, as well as for 
a better social order, we must find out in what 
circumstances the child's social intelligence 
reaches its full development. 

As to social attitudes, again, there is no reason 
to think that all the recently emphasized work 
on the role of the early environment in lower 
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animals can be applied directly to man, but 
there is little doubt that it applies in principle, 
when we can homologize properly. Hoarding 
in the adult rat (Hunt, 1941; Hunt, Schlosberg, 
Solomon, and Stellar, 1947) , increased by hun- 
ger in infancy, may or may not be homologous 
with miserliness in man. It is even harder to 
draw a human moral directly from the very 
important experiments of Lorenz (1935, 1952) 
on "imprinting" in young ducks and geese. But 
taken more broadly, with other studies showing 
a general importance of early experience on 
social attitudes and emotionality, such as those 
of Hall and Whiteman (1951), Bernstein 
(1952), Levy (1934), Ross, Smith, and Denen- 
berg (1950), and Thompson and Heron (1953), 
the observations in question assume greater sig- 
nificance — presenting as it were undiluted 
examples of a kind of lasting fixation that may 
be fundamental in mammalian behavior but 
obscured or overlaid by the greater complexities 
of learning in higher forms. This possibility is 
brought closer to man by Nissen’s (1951) esti- 
mate of the relation between sexual deviations 
in the chimpanzee and extensive early exposure 
to man. 

Failing the possibility of radical rearing ex- 
periments with human children, it seems that 
comparative study offers a solid line of advance 
in this field. Studies of emotionality and social 
attitude in a number of species will provide 
some basis for tentative extrapolation of the 
curves of phylogenetic development through 
chimpanzee to man, where the ideas thus ar- 
rived at can be tested for their value in clarify- 
ing clinical and naturalistic observations. The 
striking parallel in the implications of von Sen- 
den’s (1932) data on human congenital blind- 
ness and Riesen’s (1947, 1950) on chimpanzees 
reared in darkness, and the equally significant 
study by Nissen, Chow, and Semmes (1951) on 
the role of early experience in somesthesis, show 
that in one area at least, that of perceptual 
function, some of the effects of the early en- 
vironment on human behavior could have been 
predicted from animal experiment. The prob- 
lem with respect to social attitudes should not 
be intrinsically more difficult, and the more 
the points that are determined in the phylo- 
genetic curve the more certain the extrapola- 
tions can be. 


SociAL. PHENOMENA AMONG INSECTS 


President Lowell of Harvard is said to have 
introduced W. M. Wheeler for an honorary 
degree as a distinguished scientist who had 
shown that ants, too, can build a complex so- 
ciety without recourse to reason. We would 
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"not argue against Lowell that human society 


is the product of dispassionate deliberation. 
We do urge that even its primitive forms wou 
be impossible if its members did not have their 
capacity for thought and speech. Whether he 
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uses his reason in a reasonable way or not, it is 
quite clear from phyletic comparison that the 
first key to man’s social structure lies in his 
distinctive intellectual qualities. 

The point can be put briefly: the ant's social 
mechanisms are sensory, man's ideational. This 
is a distinction that is fundamental to our whole 
treatment. Accordingly, we shall first clarify 
this use of terms, and go on in the present 
section to describe some of the social behavior 
of insects. It contains striking parallels with 
human society. But these parallels are super- 
ficial, as most of the rest of this chapter will 
try to show. 

On the use of terms: first, though our defini- 
tion may not be rigorous, we shall consider a 
social act to be one that is determined by past 
or present stimulation, or expectancy of stimu- 
lation, from another member of the species; 
or done with the expectancy (intention) of 
evoking a response from him. This does not 
wholly exclude interaction between members of 
different species, if the behavior of the two is 
at all similar and produces relationships similar 
to those within a species; a pet dog is sometimes 
considered, and considers himself to be, a mem- 
ber of the family. But we do exclude pet gold- 
fish; together with cattle-herding, fly-swatting, 
lion-taming, salmon-fishing, and other forms of 
predation, It is important also to exclude 
pseudosocial behavior, in which grouping oc- 
curs only because of some stimulus external to 
the group: examples are animals running from 
a forest fire, moths attracted to a light, and (a 
slightly closer approach to social phenomena) 
situations in which a by-product of one animal's 
existence attracts another, as in the association 
of a worm-seeking robin with a ploughman, or 
the huddling of cattle for warmth. 

Next, for an adequate description of what 
happens in phylogenesis, there is no escape from 
distinguishing between behavior that is more 
and less stimulus-bound (Goldstein, 1940) or, 
more broadly, situation-bound. This is Schneir- 
la's (1946) “biosocial” versus “psychosocial” 
distinction. The biosocial act is one which is 
dominated by the present environment (includ- 
ing humoral factors which are part of the 
environment of the nervous system) as distinct 
from the psychosocial act, in which “ideation” 
and “purpose” appear to be crucial. 

The empirical criteria of sense-dominated or 
biosocial behavior are immediacy of response 
and the possibility of eliciting the same re- 
sponse repeatedly by presentation of the same 
stimulating situation. The predictability may 
be for the individual animal only, but the pic- 
ture is clearest when the same stimulus-response 
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relationship holds for the whole species (which 
does not necessarily mean that it is entirely “un- 
learned”). Examples: the response of bird or 
fish to one of the "releasers" of Lorenz and 
Tinbergen (Tinbergen, 1951), or the dog's 
aggressive response to the growl of another dog. 

The psychosocial act, under ideational con- 
trol, is harder to define — there may in fact be 
several categories here if we know how to dis- 
tinguish them — but in general it falls under 
the heading of behavior which has led in the 
past to use of the term expectancy, or some 
other reference to a nonsensory determinant 
of response. The behavior is well adapted to 
the environment and hence partly under sen- 
sory control, but is not predictable from the 
present stimulation alone and makes it neces- 
sary to postulate expectancy, set, attitude, or 
selective attention as codeterminants of the re- 
sponse. 

The reader is likely to have difficulty with 
this distinction if he regards it as absolute, or 
primarily due to some theory or other. It is 
primarily empirical. It is not a distinction 
between learned and unlearned, and our em- 
phasis is not on supposed differences in the 
complexity of the response mechanism. One 
can hardly be certain that all "ideational" re- 
sponses, about which we know little, are more 
complex or "higher" than all reflex responses, 
some of which are known to be complex indeed. 
Our emphasis instead is on differences in the 
directness of control of response by sensory and 
humoral influences. 

The most elaborate social structures occur 
far apart in the phyletic scale: man's at the top, 
the social insects' near the bottom. In between, 
organization is loose and sporadic. Apart from 
the division of labor involved in sex behavior 
and in parental care, there are few cases in 


‘which subhuman vertebrates are essentially de- 


pendent on other members of the species for 
specialized functions that they do not have 
themselves. Individual animals may act in a 
way that serves the whole group, but the service 
could be performed about as well by other mem- 
bers and there seems to be nothing in the 
behavior of the gregarious birds and mammals 
that parallels the specialization of function in 
the castes of social insects, or the specialization 
of the human officeworker who might die if he 
were suddenly left without food and clothing 
produced by others, before he learned how to 
produce the tools or weapons he would need 
to support life. 

The figures of speech, caste, queen, soldier, 
and worker, point up this analogy between so- 
cial insects and man. But as Schneirla (1946) 
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in particular has insisted, it is an analogy, not 
to be pushed too far. Despite having only a 
rudimentary social structure, the chimpanzee 
has more to tell us concerning the nature of 
human society than all the social insects taken 
together. However, the phenomena of the in- 
sects social organization are fascinating in 
themselves, and (as we have said) they do clar- 
ify the human problem by setting it in perspec- 
tive. 


THE PHENOMENA OF INSECT SOCIETIES 


The founding of a colony varies only slightly 
with the family. In most wasps, ants, and 
termites, the fertilized female leaves with or 
without a small company of swarming workers 
to found a new home. In the bumblebee, a 
queen, alone, selects some cavity in the ground 
or in a log and at first behaves like a solitary 
bee. Later, the social behavior characteristic of 
a worker appears in the care of her young. Still 
later, when there are workers, she takes the 
role of the queen only. The progression of the 
colony from a rather primitive to a more com- 
plex social organization is also characteristic of 
ants. In a new colony the first brood is always 
composed of workers only. These take care of 
the second brood, in which soldiers appear. 

The division of labor in human societies is 
approximate and flexible. In the social insects 
it is much more exact and rigid, based on the 
needs of reproduction, nutrition, and protec- 
tion which underlie colony life. Each insect 
caste is dedicated to a certain function, the 
queens to reproduction, the workers to nutri- 
tion, and, in wasps, to protection also, although 
in ants and termites this latter function is 
served by soldier castes. Each caste is highly 
specialized structurally to perform its appropri- 
ate task. In a manner, though there seems 
little to be gained by it (cf. Schneirla, 1946) , 
they may be considered as analogous to parts of 
a "supraorganism" (Allee, Emerson, Park, Park, 
and Schmidt, 1949) . 

The determination of castes is probably genet- 
ic as well as nutritional. In the cases of the two 
sexes (and thus of the two reproductive castes) 
it appears to derive from genetic differences in 
the egg (Light, 1942). But in bees and wasps, 
determination of queen and worker castes de- 
pends largely on nutritive factors. The larvae 
that are to become queens are fed entirely on 
“royal jelly,” the worker larvae on “royal jelly” 
only for the first three days and then on “bee 
nectar.” (The workers are a sterile caste which 
may be said to base been “nutritionally cas- 
trated,” to use Wheeler's phrase) The queen 
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reaches maturity in about sixteen days, as 
pared with twenty-one for workers and twenty- 
four for drones. Similarly, in ants, caste 
differentiations between queen, workers, and 
soldiers of the same sex appears to be deter- 
mined more by environmental than by heredi- 
tary factors, It is known, for example, that ant 
colonies with a small supply of food will pro- 
duce only workers (Light, 1942), and again 
that soldier ants in a colony over a given thresh- 
old number will inhibit the development of 
more soldiers (Gregg, 1942), this possibly being 
accomplished by the soldiers’ exuding an exo- 
hormone that passes on to developing individu- 
als or sometimes to the unlaid eggs (Allee et al, 
1949). In general, caste determination appears. 
to be fairly flexible in the first stages of growth 
in the social insects but, once determined, an 
individual’s caste seems to remain rigidly fixed. 

The structural differences of the castes, un- 
derlying the functional ones, are often striking, 
involving gross differences in size as well as in 
specific characters. In one genus of ant (Core: 
bara) the queen is several thousand times as 
large as the worker (Wheeler, 1922) . The wasp 
workers which double as soldiers are equipped 
with a sting, unlike the drones; but in some 
ants and termites, the soldier may guard the 
colony merely by blocking the entrance with 
his head. In other species, the soldiers are 
equipped with large, powerful jaws. Many 
mandibulate termite soldiers and all the nasute 
have a large frontal gland which excretes a 
sticky liquid or “latex” capable of disabling 
enemies, 

In addition to the different castes, there are 
often “guests” and “slaves,” more often in ant 
colonies, but also in those of bees, wasps, and - 
termites. Wasmann (1905), Wheeler (1928), 
Plath (1935), and others have discussed these 
phenomena fully. Treatment of guests ranges 
from bare tolerance to definite encouragement. 
An interesting example of an ant guest (Lepto- 
thorax emersoni) is described by Wheeler. It 
is associated with a larger host (Myrmica cana- 
densis) and lives in small chambers near the 
surface of the latter's nest. The smaller ants 
have the habit of climbing on the backs of 
passing host workers and licking their bodies, 
especially the heads and mouthparts. The hosts 
permit this and occasionally regurgitate droplets 
of food which the guests eat. The guests, OR 
the other hand, do not allow Myrmica into 
their own colony, except under special condi- 
tions. Many species are able to induce Phytoph- 
thora (plant lice, scale insects, mealy bugs 
etc.) , by stroking them with their antennae, t 
regurgitate honeydew the latter have su 
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up from plants. These guests are often housed 
in specially constructed caverns (“barns”) in 
the nest, and thus were called by Linnaeus “the 
dairy cattle of the ants.” 

The extraordinary social phenomena of the 
insect world include slaves also, Ant slave- 
makers have been classified into two types, 
“obligatory” and “facultative,” depending upon 
whether slaves are necessary for existence or 
not. Four known genera of ants are slavemakers 
(Formica, Polyergus, Strongylognathus, and 
Harpagoxenus). One commonly has slaves but 
can get along without them, three are of the 
obligatory class and must have slaves to sur- 
vive. The slaves are generally taken as larvae 
in raids on other colonies, hatched and raised 
in the slavemakers' nest, and put to work exca- 
vating the nest, feeding and caring for the 
young, and doing other chores normally carried 
out by the workers of that colony. Polyergus 
ants are completely dependent on their slaves 
and often deplete a whole nest of its larvae and 
pupae, although the attendant workers are not 
harmed unless they resist (Plath, 1935). A 
similar phenomenon, but this time analogous 
to child labor instead of slavemaking, is found 
in another species (Oecophylla smaragdina) : 
the worker ants use silk produced by their 
larvae to bind leaves together for the nest. The 
head of a larva is applied to each leaf in turn 
by the worker’s holding it in the appropriate 
position. During the process the larva ''co- 
operates" by bending its head whenever it is 
lowered to a leaf (Plath, 1935). Many other 
similar examples could be cited. 


COMMUNICATION IN INSECT SOCIETIES 


One essential key to social organization is the 
means of communication, in the broad sense 
of the way in which the behavior of one animal 
induces cooperative behavior in another. As a 
prelude to examining the development of com- 
munication, let us see what the cues are upon 
which the complex social organization of in- 
sects depends. 

For the ant, the "signals" are mainly tactual, 
chemical, and auditory (Allee et al, 1949). 
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Of these, the chemical may be dominant, An 
ant with a strange odor is attacked, even if a 
member of the colony; a foreigner, experimen- 
tally provided with the colony odor, can be 
introduced safely (Dropkin, 1941). But this 
is by no means representative of all insect inter- 
action. Wheeler (1928) reported that the 
hungry larvae of a wasp colony protrude their 
heads from their cells and may actually scratch 
on the walls of the cell, thereby attracting the 
attention of wasp nurses which, in feeding the 
larvae, are rewarded by drops of sweet saliva 
excreted from the mouths of their charges. A 
most interesting type of communication occurs 
in those species of ants and termites whose sol- 
diers block the exits to the nest with their heads. 
When a worker wishes to leave or enter the 
nest, she strokes the soldier's head or abdomen 
with her antennae, whereupon the living "door" 
moves back to let her pass. After she has gone, 
he moves back into position (Wheeler, 1928) . 
The ability of the soldier to discriminate a 
friend from a stranger is, in itself, a remarkable 
feat. How it is done is not known, nor whether 
it is learned (that is, whether the discrimina- 
tion requires practice or improves with repe- 
tition) . 

The most remarkable communication occurs 
in bees, brilliantly demonstrated in systematic 
experiments by von Frisch (1950). As he re- 
marks (p. 85), the results at first glance seem 
fantastic, but they have been attested by Thorpe 
(1950) and others. The worker who has found 
a food source is able without leaving the hive 
to give her fellows not only the direction but 
also the distance of the food, by the "wagging 
dance." The farther the food, the fewer the 
number of turns in the dance. The angle of 
the dance on the vertical comb, relative to grav- 
ity, gives the direction relative to that of the 
sun. If the sun is hidden, polarized light from 
an area of blue sky still provides orientation, 
and it appears that the polarization is the essen- 
tial cue whether the sun is out or not; no com- 
munication is possible if the sky is completely 
overcast. Through this extraordinary behavior 
other workers are enabled to find the food 


source. 
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LEVELS OF COMMUNICATION AND COOPERATION 


Such phenomena raise at once the question of 
their similarity to human communication. 
Does von Frisch’s evidence mean that the bee 
has the equivalent of a human sign language, 
though with a small vocabulary? The question 
might be phrased, Is the bee's behavior sym- 
bolic?, but the term symbolic has been used 


in so many ways that we find it confusing. A 
main purpose of this section is to find out how 
to ask the question better, to find better ways 
of defining the essential differences of communi- 
cation at various phyletic levels. 

Most insect communication is clearly in the 
class of the sense-dominated social act. The 
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typical stimulus is a scent, touch, or sound that 
evokes a typical response, not varying much at 
different times. The invariability of insect be- 
havior has often been exaggerated, but certainly 
the relation between stimulus and response is 
much closer than with mammals. 

The behavior described by von Frisch does 
not appear to be essentially different, although 
it does appear unusually complex. The wagging 
dance or “message” involves a delayed response 
but is directly controlled otherwise by the spe- 
cial effects of polarized light on the complex 
eye of the bee. The number of turns in the 
dance may be determined by fatigue or in some 
other equally elementary way, since it actually 
corresponds to amount of effort or elapsed time 
rather than distance: it decreases with distance, 
and also with a head wind, but increases with 
a tail wind. No one, of course, would suggest 
that the bee performs some kind of trigonomet- 
ric computation which is then expressed in the 
dance, and remarkable as the whole behavior 
is, it may still lie in the reflexive class. 

Similarly, the problem of the way in which 
other bees receive the message is unsolved. It 
has not been shown that the behavior has the 
characteristic of intentionality (Bierens de 
Haan, 1929) or purposiveness (Maier and 
Schneirla, 1935) , which is essential before one 
can conclude that the bee has a language. This 
point is spelled out in the following section, 
but we can anticipate here to the extent of 
suggesting the sort of evidence that might be 
looked for. Evidence of intentionality might 
be provided, for example, if only the first of 
several returning bees made the wagging dance 
— if the message is already being conveyed to 
the colony by ten other bees there is little sign 
of purpose in the behavior that conveys it once 
more. Or again, if one could remove the return- 
ing bee's audience experimentally, would she 
still make the wagging dance, as if the audience 
were there? We have no intention of making 
light of this extraordinary behavior, but under- 
standing it requires that we see its limits as 
well as its extent. The evolutionary origins of 
so-called instinctive behavior in general are as 
difficult to understand, as miraculous, as those 
of the so-called intelligent behavior of higher 
mammals; but let us avoid mixing up our 
miracles. 


PURPOSIVE COMMUNICATION AND LANGUAGE 


Bierans de Haan and Maier and Schneirla 
Have reserved the term communication for pur- 
posive communication, the act that is done 
“with regard to the effect it has on other in- 
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dividuals.” We propose here to use the term 
more broadly but not to lose their essential dis- 
tinction. By including a class of “reflexive” 
(nonpurposive) communication, we are able to 
recognize yon Frisch's phenomenon as commu- 
nication and still not confuse it with phenomena 
of a psychologically higher level. 

The conception of purposive behavior is not, 
of course, a vitalistic one. The difficulty in iden- 
tifying behavior as such is that it may have to 
be seen repeatedly in order to be sure that its 
objective is achieved neither accidentally nor 
mechanically; that it shows “means-end-readi- 
ness” (Tolman, 1932), or a variation with cir- 
cumstances that tends toward the same end 
effect. A bird’s “warning” cry would be pur- 
posive, for example, if made only when the 
young were exposed and stopped when they got 
under cover, or if the warner approached the 
warned and called quietly, without drawing 
the marauder’s attention also, The essence of 
the purposive communication is that the sender 
remains sensitive to the receiver's responses, 
during sending, and by modification of his send- 
ing shows that his behavior is, in fact, guided by 
the intention (expectancy) of achieving a par- 
ticular behavioral effect in the receiver. The 
“broken wing” behavior of the grouse, for ex- 
ample, is purposive, since it is modified accord- 
ing to the marauder’s behavior in such a way 
as to draw him away from the hiding chicks; 
but the warning cry of the gull, although it 
tends to produce a protective immobility in the 
chicks, seems quite reflexive. Carpenter (1942) 
points out that the alarm calls of a group of 
primates, startled by an observer, continue long 
after they have got away out of danger. The 
calls are therefore more emotional expression 
than purposive, 

Birds and animals do show purposive com- 
munication. It seems likely that the crow, for 
example, or the parrot, may on occasion call 
or “speak” with intent to influence another. A 
dog's bark, or a chimpanzee's begging gestures, 
may be unmistakably purposeful. This does 
not mean, however, that the bird or subhuman 
mammal has language. It is important to recog 
nize that purposeful communication occurs in 
animals, and just as important to recognize that 
language is something more. p 
_ The criterion of language is not only that it 
is representational, for representational acts OC 
cur in animals. Man has what neither chimpan- 
zee nor parrot has: the capacity to combine 
and readily recombine representative or sym- 
bolic noises (words), movements (gestures) , OF 
modifications of inert things (writing, carving) - 
We propose therefore that the minimal ctr 


Eg a 


THE SOCIAL SIGNIFICANCE OF ANIMAL STUDIES 539 


terion of language, as distinct from other pur- 
posive communication, is twofold. First, lan- 
guage combines two or more representative 
gestures or noises purposefully, for a single 
effect; and secondly, it uses the same gestures 
in different combinations for different effects, 
changing readily with circumstances. As Mor- 
ris (1946) and Schneirla (1946) have noted, 
language has other distinctive characteristics, 
but we believe that these criteria are enough 
to set it off fully from animal communication. 
Too much attention, we believe, has been given 
to man’s special vocal behavior. In sign lan- 
guage and gesture man's distinctiveness is 
equally clear (Révész, 1944), and we avoid 
the confusing issue of whether the subhuman 
primate has the vocal equipment for producing 
human sounds. ; 

, From this point of view, the outstanding fact 
is man's capacity for a varied combination of 
symbolic acts, and we therefore propose to call 
language syntactic behavior. We thus have 
three classes of communication: (a) reflexive 
(not true communication); (b) purposive but 
nonsyntactic; and (c) syntactic (and usually 
purposive), or true language. 

There are evidently differences of level with- 
in each of these classes, and though it seems 
clear that true language is unique to man, we 
can trace an increasing complexity of pur- 
posive behavior in subhuman animals which 
suggests that the discontinuity between ape and 
man, in communication, is less than it appears. 
The chimpanzee may well be close to a liminal 
level that would make language possible (Hayes 
and Hayes, 19512) . 

Underlying syntactic behavior there must be 
a certain independence or autonomy of con- 
ceptual processes. T'wo or more concepts must 
be capable of being aroused simultaneously, 
instead of one crowding the other out; and, 
as the term concept itself implies, they must be 
arousable by associative mechanisms in the ab- 
sence of the sensory events that originally gave 
rise to them, That is, they are independent of 
any particular external environment, and of 
each other. Conceptual process 4 may be linked 
momentarily with conceptual process B at one 
time, with C or D at another. We are not talk- 
ing now of the processes underlying the elabo- 
rate sentences of adults, but simply of what is 
needed to account for the performance of the 
child who can use a vocabulary of four words to 
form the propositions Daddy go,” "Mommy 
go,” "Mommy come,” and “Daddy come.” This 
is beyond the level of any animal except man; 
but putting the problem in these terms allows 
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us to relate it to other phenomena that do oc- 
cur in the animal world. 

Let us propose às hypothesis the more general 
proposition, that the intellectual development 
in phylogenesis that eventually makes speech 
possible is an increasing independence of the 
conceptual activity from the present sensory 
environment, and an increasing capacity for 
entertaining diverse conceptual processes at the 
same time. This is the capacity to respond to 
the present environment in one way and think 
of responding in another; for perceiving the 
present situation as it now is, and conceptually 
adding something else to it; for planning a se- 
ries of actions of which only the first is immedi- 
ately feasible. At the lowest level, it is the 
capacity for delayed response or a simple ex- 
pectancy; at the highest level, for "holding" 
not only a series of words but also of sentences, 
whose meaning only becomes clear with later 
words or sentences (Hebb and Bindra, 1952). 

There is clear evidence of phylogenetic de- 
velopment in such capacities. The chimpanzee 
moving a box to stand on, to get fruit that is 
otherwise out of reach, manifests a “first this, 
then that" planning which is beyond the dog, 
and Kóhler's (1927) evidence leaves little doubt 
that the sequence of acts is, in fact, planned in 
advance. Entirely unmistakable evidence of 
such ability is found in the deceitful attack that 
almost any adult chimpanzee is capable of. We 
shall return to this behavior in a later section, 
but here we may cite the chimpanzees of the 
Yerkes colony who, seeing a visitor being es- 
corted through the Laboratories, (a) slip over 
quietly to the faucet and get loaded, (b) return 
to the front of the cage nearest to where the 
procession will pass and sit quietly, with no 
sign of hostility, until (c) the visitor is within 
range, at which point he is suddenly drenched 
with a pint or so of water. This makes two 
points. It is evident, first, that a series of actions 
is planned in advance; and secondly, that the 
chimpanzee has the ability to act in one way 
while thinking about acting in the opposite 
way. 

GË related is the empathy described by 
Köhler (1927, p. 168 and Plate VII). One chim- 

epanzee watches another precariously reaching 
for food. The watcher stays at a distance, does 
not try for the fruit himself, and still identifies 
with the other to the extent of making identical 
reaching movements. That such spontaneous 
imitation can occur in chimpanzees is confirmed 
by the socially significant paper of Hayes and 
Hayes (1951b), and Kóhler's empathy should 
be collated with Mead’s idea (Morris, 1946, p. 
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45) that communication involves “taking the 
role of the other” and “the ability to be the 
other at the same time that he is himself" (ital- 
ics ours). Such a proposal becomes intelligible 
if two or more phase sequences (Hebb, 1949b), 
or trains of thought, can exist side by side in 
the brain of the higher mammal. 

But this, of course, is not the place to elab- 
orate a whole theory of communication. We 
can sum up the present discussion by saying 
first that the criterion of syntactic behavior 
clearly separates man from ape, in communica- 
tion; but secondly, that the appearance of some 
other aspects of behavior in the chimpanzee 
shows that the ape may be closer to man in this 
respect than he seems. 


` ANIMAL COOPERATION 


Much the same conclusions must be drawn 
concerning cooperation. Phyletically, coopera- 
tion parallels communication to such a degree 
that there is a prima facie case for concluding 
that both depend on the same intellectual char- 
acteristics. The cooperation of insects, as we 
have seen, is complex indeed but shows little 
sign of purpose. Short of man, the highest level 
that has been demonstrated experimentally is 
in the chimpanzee (Crawford, 1937, 1941) but 
it seems true of his cooperation, as of his com- 
munication, that the ape is not decisively 
superior to other mammals and definitely lacks 
a kind of teamwork that is common in man 
(Nissen, 1951). Crawford's experiments have 
been summarized by Nissen (1951, p. 444) as 
follows: 


In a laboratory experiment, Crawford first 
trained young chimpanzees individually to pull 
in a weighted box, by means of an attached rope, 
in order to obtain the food reward on the box. 
"Then the weight of the box was increased (so that 
one animal could not move it); next, two ropes 
were attached, the box was baited with two por- 
tions of reward, and two animals were put in the 
cage. At first each chimpanzee pulled without any 
reference to his partner's actions; only by chance 
did the two pull at the same time and so move 
the box. It was necessary for the experimenter to 
give a signal, which the partners had individually 
associated with initiating the pull, to get them ro, 
pull together. Once the temporal coordination of 
effort had been established in this way, the ex- 
traneous signal could be omitted. One of the 
partners, 4, usually watched the other one, B, 
and was ready to add her pull as soon as B 
started heaving on the rope. Sometimes it was 
arranged that B was satiated when the experiment 
began; A would then urge and direct B to pull. 
Eventually B would pull, although she had no 
desire for the food reward. In these circumstances 
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A often reaped the entire material benefits of 
the teamwork. 

In a later experiment by Crawford one animal 
was trained to push against four colored plaques, 
arranged around the sides of a cage, in a certain 
order. The cage was then divided into two 
sections by a partition of bars; the colors to be 
pushed first and third were on one side, the 
colors to be pushed second and fourth were in the 
other part of the cage. The trained animal was 
then put on one side, and her partner, who 
knew the mechanics of pushing the plaques but 
not the correct order, was put on the other side 
of the partition. The trained animal X watched 
her partner's actions and pushed the color avail- 
able to her after Y had pushed the color preceding 
it in the sequence; X sometimes solicited her 
partner, using begging gestures . . . that were 
spatially directed toward the particular color on 
her partner's side that was the next one in order. 


We know of no reliable report of cooperation 
by other animals that quite comes up to this, 
with its evidence of purposiveness. Yet it must 
be pointed out that there was hardly more than 
the barest beginning of teamwork as we know 
it in man, and that even this was achieved only 
in an artificial experimental setting which 
needed the planning of the human experi- 
menter. The experiment may be thought of as 
parallel to that of Hayes and Hayes (1951a) in 
communication, who managed by careful train- 
ing to get a chimpanzee to produce recognizable 
English words, but still without achieving real 
language. 

What may be described as a “one-sided coop- 
eration" certainly occurs in the chimpanzee. An 
adult animal rescues an infant heading for 
trouble (Nissen, 1931, p. 87): A helps B, but B 
does not simultaneously help 4. Teamwork at 
the human level is another matter. It involves 
a kind of prior understanding between coopera- 
tors that is impossible, perhaps, without a 
human capacity for communication, When X 
and Y cooperate fully, very complex mental 
processes are required. At each stage of action, 
X must anticipate not only his own next act and 
its effect, but also those of Y. Further, 3 may 
have to keep a number of possible eventualities 
in mind, and in each case have the expectancy 
of Y's acting in an appropriate way. This is 
the relationship that Sears (1951) has defined 
as dyadic. It is clear that the expectancies 
involved become very complex when the situa- 
tion is not under the complete control of the 
ccoperating partners. "Teamwork thus makes 
intellectual demands of the same order as those 
made by language. Psychologically, it may, in 
fact, be hard to distinguish the two. 

Let us then define three levels of cooperation, 
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corresponding very well on the whole to the 
three levels of communication already proposed: 
(a) reflexive, or nonpurposive; (b) purposive, 
but “one-sided”; and (c) “two-sided,” or team- 
work, The second of these is seen in higher 
vertebrates; the last seems to occur only in man, 
as far as one can tell from the available evi- 
dence. 

The difficulty in reviewing the literature on 
cooperation in its various forms is that those 
reporting have usually not known or have not 
understood the significance of Tolman’s (1932) 
clarification of purpose as a behavioral and 
nonteleological conception. They have tended 
therefore to see purpose in all behavior, since 
almost any act is bound to have some biological 
significance; or at the other extreme have tended 
to make all infrahuman social behavior reflex- 
iye and without any insight whatever. In either 
case the details of the behavior are often not 
reported in a way that allows one to judge 
whether its end effect is achieved with insight 
and intention or not. There is a great deal of 
anecdotal material to suggest the existence of 
altruism and cooperation in primate societies, 
for example, but the evidence in general has 
been so poor that Zuckerman (1932) had to 
conclude that no altruism exists, that the be- 
havior is always “immediate responses to spe- 
cific stimuli,” without comprehension. We 
shall try to show that this is not so, and that 
one-sided purposive cooperation and even the 
beginnings of altruism can be found in lower 
animals, but it certainly seems that much which 
at first sight is a planned cooperation is no more 
so than a litter of kittens is planned parent- 
hood on the part of a tomcat. 

Lorenz (1934) has reported a good example 
of behavior that only looks altruistic. He found 
that jackdaws would always come to the aid of 
another daw that had fallen into the hands of 
a common enemy, like Lorenz himself. When 
he had a captive bird in his hand, the others 
would attack him vigorously. But they would 
also attack him, with equal enthusiasm, when 
carrying a wet black bathing suit. Assuming 
that the jackdaws’ visual acuity is equal to the 
discrimination involved, as it must be, there is 
no reason to suppose that either attack was a 
purposeful attempt to help a fellow. 

A further difficulty, in reviewing the litera- 
ture on cooperation, is that some of the most 
interesting and significant observations are 
made in the field, and are naturalistic instead 
of experimental. The behavior cannot be re- 
peated at will, varying the circumstances for 
analysis. The phenomenon of leadership is an 
outstanding example: leadership can be quite 
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independent of dominance and has much 
greater social significance, as an integrating 
instead of disintegrating influence (Darling, 
1937; Carpenter, 1942). But leadership cannot 
easily be studied in a cage, and dominance can. 
As a result, dominance has been greatly over- 
rated as a social phenomenon (Schneirla, 
1952), and despite such excellent reports as 
Darling's or Carpenter's we know little about 
the psychological conditions of leadership or 
the closely related role of the sentinel. 

Dominance often determines leadership, but 
often not. Greenberg (1947), for example, 
found that dominant sunfish also tend to be the 
leaders in group maze-running. Fischel (1927), 
on the other hand, did not find such a relation- 
ship in hens. The leaders in foraging expedi- 
tions in an orchard were not always the 
dominant birds. Katz (1949) with Barbary 
sheep as subjects, Scott (1945) with domestic 
sheep, and Stewart and Scott (1947) with goats, 
report that dominance does not play a major 
role in group organization. Leadership, how- 
ever, is of great importance and usually rests 
with a female. Such “matriarchal” societies are 
very different from the “patriarchal” societies 
of roe deer (Darling, 1937) or baboons (Car- 
penter, 1942), in which the most aggressive and 
dominant male occupies the central position of 
control, and Darling has stressed the value of 
matriarchy as making for true group cohesion. 
The leadership of the hind is more “selfless,” 
whereas in a patriarchal society the dominance 
relation, between leader and followers, is di- 
rectly at odds with the leadership relation. 
Thus the hind group among the red deer is 
much more closely knit than the stag group. 

As we have said, not much is known about 
the way in which leadership is acquired, or the 
level of insight that is involved on the part of 
leader and follower. That is, we do not know 
whether the follower is less alert than the leader 
because the leader is already acting as sentinel, 
or whether the follower is simply a less alert 
animal. Is the leader more alert, as leader, than 
he would be as a solitary animal? Even in 
Darling’s (1937) report of the greater selfless- 
ness of the leader, who did not go to food until 
satisfied that the coast was clear and who as a 
result got less food, it is not clear that concern 
for others was in question, or anything more 
than a special wariness. 


FRIENDSHIP AND ALTRUISM 


But we do not regard Darling's opinion as 
fantastic, or even as intrinsically improbable, 
although it must be hartl to establish in the 
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conditions of his study. There is definite evi- 
dence in other animals of a phylogenetic devel- 
opment of something we shall call altruism, 
defined as an intrinsically motivated concern for 
others. Without attempting to make the defini- 
tion exact, we use “altruistic” as a convenient 
designation for certain behavior that is not 
reflexive or immediately rewarding biologically: 
not necessarily "generosity," nor acting against 
inclination, but due to a capacity — or weak- 
ness — for developing a nondestructive interest 
in others. The weakness is very marked in man, 
and it does not matter for our present purposes 
whether it is a “learned drive" (Miller, 1951) 
or not. If the concern for others must be 
learned, the comparative evidence still indicates 
that we are dealing with something fundamental 
to man's nature, for we can trace phyletically 
some signs of an increasing readiness for such 
learning and an increasing frequency of the 
behavior in question without special conditions 
of reinforcement. 

The biologically primitive form of a concern 
for others is, first, in parental care, at a pur- 
posive level. Secondly, it is clearly seen when 
an attachment to a sexual partner extends be- 
yond the actual period of copulation, a phe- 
nomenon which is said to occur in some birds 
and mammals. 

A further step in development appears in 
selective friendships which do not depend di- 
rectly on primary reinforcement in Hull's 
(1943) sense. Such friendships are occasionally 
seen between dogs, and we may cite also the 
pet dog's protective behavior toward younger 
members of the family, when they are not the 
ones who feed or pet him. The clearest evidence 
that this behavior can be independent of pri- 
mary reinforcement is from the dog that makes 
a nuisance of himself by persistent "rescue" of 
children in swimming, despite repeated punish- 
ment. 

With the chimpanzee one cannot fail to see a 
step closer to man. Most chimpanzees, most of 
the time, seem entirely selfish; but so often are 
people. Friendships are marked (Kóhler, 1927) , 
and as Nissen (1951) has showh, the chimpanzee 
may be upset and unhappy when the long-time 
friend is taken away, even though other animals, 
are still present. 

Nissen's description of the effect of begging 
is particularly interesting here. It seems to be 
"both unpleasant and a very compelling stimu- 
lus. . . . Often it appears that the [chimpanzee 
with food] cannot tolerate this; reluctantly or 
even with some show of irritation or anger, he 
hands over some of the food, or throws it vio- 
lently at the beggar^ (Nissen, 1951, p. 445). 
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From one point of view this is not generosity 
at all, but from another the very reluctance to 
give makes the gift all the more significant, by 
showing the strength of the compulsion behind 
it. 

Here we can also give an example of what 
seems a purely altruistic concern for another 
chimpanzee, in an attempt to get the other out 
of danger. In the "bold and timid man" experi- 
ment described in the following section, Lia 
was afraid of the experimenter (outside the 
cage) playing the part of the bold man, and 
usually preferred to stay away from him. But 
Mimi, with whom she was caged, had other 
ideas. She stayed close to the cagewire and 
attacked repeatedly. The role of the bold man 
demanded that aggression be met with aggres- 
sion, and the following note was made on the 
data sheet. 


E 


Mimi tried to get my hand into reach, obvi- 
ously to repeat her earlier attacks. I caught one 
of her fingers with my gloved hand and bent it 
back slightly, enough to hurt, whereupon she 
screamed with rage, beat on the wire, and then 
returned to putting her hands out through the 
wire, trying to catch hold of me. Lia, in the 
meantime, was also angry at first, when I hurt 
Mimi, and attacked me until I struck at her hand 
with my fist. Thereafter she was alternately ag- 
gressive in my direction (but not getting within 
reach) and protective of Mimi, five or six times 
pulling Mimi's hands back from the wire, first 
one, then the other, to get her out of danger 
(Feb. 14, 1944), 


A week later when the next bold-man test 
was made, Lia showed no anger or aggression 
but, when the foolhardy Mimi persisted in try- 
ing to get the bold man into position for an 
attack, strenuously tried to pull her away for 
the whole observation period of one or two 
minutes. 

Finally, we may conclude with a remarkable 
observation of friendship between porpoises 
which is reported by McBride (1940, p. 26) — 
remarkable especially because the female re- 
ferred to was sexually receptive, and befofe and 
after the incident described the two males were 
bids bent only on preventing the other's access 
to her: 


The smaller of the two large males was re- 
moved from the tank and transported by means 
of our tank truck to Tampa. It was exhibited 

, there under the auspices of the State Conserva- 
tion Department in a concrete tank for three 
weeks, When the animal was released into the 
tank, the greatest amount of excitement on the 
part of the larger male was exhibited. No doubt 
could exist that the two recognized each other, 
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and for several hours they swam side by side rush- 
ing frenziedly through the water, and on several 
occasions they leaped completely out of water. 
For several days, the two males were inseparable 
and neither paid any attention to the female. 


Socially, the porpoise is an interesting animal 
indeed, and it is most unfortunate that natural- 
istic observation outside captivity is impossible. 
On the one hand, dominance is clearly marked, 
and vicious attacks with no apparent reason 
have been repeatedly seen. On the other, very 
elaborate and apparently unmalicious teasing 
and social play are prominent in the animal's 
behavior, as well as clearly marked cooperation. 
At sea in their own habitat the female presum- 
ably gives birth in isolation; at any rate, the 
female giving birth in captivity in the Marine 
Studios was very exciting to the males, one of 
which became aggressive. The other females 
gathered round the one in labor and helped 
ward off the attacking male. When the newborn 
infant began its first gradual ascent to the sur- 
face to breathe, another female accompanied the 
mother in swimming just below the infant in 
readiness to support it if it had failed to make 
the grade (McBride and Hebb, 1948). From 
other behavior in these animals it is reasonably 
certain that the behavior described was pur- 
posive, to assist the infant if necessary to get 
to the surface (as it must, of course, to breathe) , 
for another mother whose infant died during 
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the night was found in the morning steadily 
supporting the dead body at the surface. If 
such a level of behavior at first seems surprising, 
it may be remembered that the porpoise is a 
mammal with a brain fifty percent bigger, and 
with more cortex and a more convoluted cortex, 
than man’s. The brain lacks a frontal lobe, but 
in other respects it has striking similarity to 
that of the higher primate (Grünthal, 1948) . 
It is not anomalous, accordingly, to class the 
porpoise (Tursiops truncatus) as a higher ani- 
mal (McBride and Hebb. 1948) . 

The evidence indicates, therefore, that a dis- 
interested concern for others can be found in 
mammalian development. Although it reaches 
its greatest potential only in man, it is not 
foreign to any higher mammal, and is not 
something that is imposed only by reward and 
punishment on the growing human child. It 
can, of course, be stultified or fostered during 
growth, but by his intellectual and emotional 
characteristics man has a greater aptitude for 
altruistic attitudes than any other animal, and 
it is of great importance to gain a more precise 
knowledge of the conditions of its development. 
It is not likely that such knowledge will be 
obtained by human study alone (who would 
attempt such an experiment?) , but here again 
there is hope that real advances will be made 
in pooling the methods of animal experiment 
and naturalistic human observation. 


Tue ANALYSIS OF SOCIAL ATTITUDES 


We have argued that there is a definite gap 
between man and his nearest relatives in their 
effective levels of cooperation and communica- 
tion. No chimpanzee has language, or takes 
part in teamwork as we know it in man. But it 
was suggested also that the chimpanzee may not 
really be as far from man as he seems in these 
matters, and we turn now, in this and the follow- 
ing section, to behavior in which he seems 
human indeed. The evidence with which we 
deal here concerns, in general, the chimpanzee's 
social motivations; and the social situations 
considered will be those particularly which 
comprise two individuals, the dyadic unit of 
Sears (1951) : chimpanzee-chimpanzee or chim- 
panzee-caretaker, 

We also come now to things that have more 
obvious meaning for human social psychology. 
In this chapter, one of our objects, of course, is 
to persuade the reader that there is such meafi- 
ing in animal work. Since it is most evident 
and direct with chimpanzees, it is here that most 
of our emphasis must lie; and this in turn 
implies giving most of our attention to observa- 


tions made at the Yerkes Laboratories of Pri- 
mate Biology, the chimpanzee breeding and 
research station in Orange Park, Florida. The 
work at the Yerkes colony is of unique impor- 
tance for psychology, especially in the general 
area of personality or temperament. Not only 
does it constitute the only really extensive body 
of work on the great apes (Kóhler's was a 
single-handed attack, now nearly forty years 
ago), it also constitutes the only thorough 
longitudinal study, from birth onward, of a 
higher mammal, the same animals being studied 
by a number of persons from a variety of points 
of view (Yerkes, 1943) . 

© It has been noted elsewhere (Hebb, 19462) 
that exposure to a group of adult chimpanzees 
gives one the overwhelming conviction that one 
is dealing with an essentially human set of atti- 
tudes and motivations. The feeling is not less 
strong in the psychologically trained than in 
the untrained (and its strength can be embar- 
rassing to the purist who thinks that naming an 
animal's attitude or emotion is "subjective" and 
commits the deadly sin of anthropomorphism) . 
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It is probably a common experience to all who 
have worked at the Yerkes Laboratories to feel 
that the bare bones of human personality, the 
raw essentials, are being laid open before his 
eyes. At the same time, it is hard to convey this 
to others, and to support it with behavioral 
evidence. 

Merely describing what the chimpanzee did 
that gave one this feeling is not enough. One 
would also have to take the space to put the 
act into its setting: a life history of the animal, 
plus a detailed description of his usual habits. 
As the reader will see, interpretation of the act 
depends on such a background. The interpreter 
usually does not realize how he arrives at the 
interpretation, and so is embarrassed to find 
himself holding what seem to be unscientific 
convictions; but it is possible to show that the 
answers, though lHrighly inferential, are based on 
objective evidence of the same kind that is used 
in identifying human attitudes and emotions 
(Hebb, 1946a). Even though such studies have 
not gone far, therefore, they have significance 
for understanding human social interaction. 

Some of the chimpanzee behavior with which 
we are concerned takes the form of rather dra- 
matic episodes that are infrequent and cannot 
be repeated at will for study. These verge on 
the anecdotal, perhaps, but cannot be left 
entirely without mention when one wishes to 
show at what social level the chimpanzee oper- 
ates, at least on occasion. For example, May 
more than once convinced competent observers 
that she was skillfully and intentionally trans- 
forming the hostile pursuit of a male, bigger 
than she, into a friendly game of chase. 
Another example is Lia's attempt to keep Mimi 
from a foolhardy attack on the “bold man,” 
cited in the preceding section. Perhaps the most 
striking of these isolated observations is Vera's 
deliberately picking a fight with Jack in the 
next cage in order to distract Shorty's attention 
from the food being brought for them both — 
Shorty and she being caged together, Shorty 
dominant and sometimes taking her food. The 
interpretation in this last case rests partly on 
the fact that Vera's hair was not erect (as an 
excited chimpanzee's would be). Also, she left 
the fight as soon as Shorty entered it on he? 
side and came without evident excitement to 
get and eat her food. Finally, she screamed 
loudly again, without leaving her food, when 
the fight between Jack and Shorty began to 
peter out without her help. The scream re- 
newed it for the better part of a minute. 

Such reports, being reports of interpretation 
as well as of fact, do not, of course, make very 
satisfactory evidence. The objection may apply 
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less strongly to some of the repeatedly recorded 
aspects of the chimpanzee’s sexual behavior. Ex- 
amples are Pan's preferential interest in a young 
female with whom he had never copulated, 
so that he masturbated while she was led by out 
of his reach and disregarded the fully receptive 
female within his own cage; and Fifi's repeated 
refusal to copulate with Bokar over a three- 
month period (three periods of oestrus) though 
she had mated satisfactorily with him on earlier 
occasions and though the two were on friendly 
terms in other respects. Placed with Frank, 
Fifi then mated at once, quite normally. Such a 
selectivity has been observed repeatedly, with a 
number of animals. A refusal to copulate may 
or may not show social intelligence, but it does 
show how far the chimpanzee’s sex motivation 
is from the more reflexive (hormone- and 
sense-dominated) behavior of dog or cat. It 
shows, in short, the dominant role that attitude 
may have in the chimpanzee's social behavior, 
and as such a closeness to man. 

Yerkes and Elder (1936) have given a detailed 
description of the chimpanzee's mating behav- 
ior, and the reader will also find accounts of a 
fascinating interplay between sex and domi- 
nance in the chimpanzee in papers by Yerkes 
(1939, 1940, 1941) , Crawford (1940) , and Clark 
and Birch (1945; Birch and Clark, 1946, 1950) . 
In general, there is a clear relationship between 
phylogenetic level and a developing complexity 
in sexual behavior which Beach (1947, 1948) 
has worked out in detail, and which is applied 
by Ford and Beach (1951) to a clarification of 
human sex behavior. Their last chapter in 
particular, "Human Sexual Behavior in Perspec- 
tive," is an excellent example of the contribu- 
tion that a comparative study can make to an 
understanding of man. 


RECOGNITION OF SOCIAL ATTITUDES 


The conditions in which social attitudes are 
identified in the chimpanzee allow one to un- 
dertake a kind of analysis that is rather rare in 
comparative psychology. That is, while pretend- 
ing to study the chimpanzee, one can really be 
analyzing the reactions of the caretaking staff 
(including one's own reactions) and ask, “Just 
what cues in the chimpanzee's behavior deter- 
mine the human judgment that he is interested, 
or friendly, or malicious?" With the animal 
safely behind cagewire, unable to bring suit for 
damage to his self-esteem, one can experiment 
with emotion better than if the subject were 
man, but one can also study better the human 
recognition of emotions, with fewer confusing 
variables and more complete access to the data 


affecting the observer's judgment. 
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If the judgments were made at random, the 
analysis would have little significance. But it 
will be found that the experienced man refuses 
more often than not to say what the motivation 
is behind a given act. The behavior is often 
very puzzling to the caretaker, but when he 
names an attitude or motivation with confi- 
dence it is generally a dependable guide to the 
animals future behavior. The naming is not 
made from superficial similarities with man, 
which are often very misleading, and the worker 
may want to know an animal for years before 
venturing an opinion as to what lies behind 
some particular act (Hebb, 1946a) . 

This provides an exceptional opportunity for 
study of the complex social stimulus. What are 
the actual cues that determine a social response? 
What parts of the response determine the fur- 
ther responses of others? When a child acquires 
some attitude from his parents we neither know 
what stimuli entered into the learning, nor how 
the learning occurs. We know little about the 
effective cues by which an emotion or attitude 
spreads infectiously through a crowd. As All- 
port (1940) and Jones (1944) have pointed out, 
it is urgent for psychology to find out what 
happens in the common-sense intuitive social 
judgment. Until that is done, there remains a 
fundamental defect in the data of social psy- 
chology. One could almost wish that the various 
rating and ranking methods had not been 
devised, for their success in quantifying social 
judgments has allowed us to forget that we do 
not know what is being quantified and has led 
us to neglect the essential task of analyzing these 
implicit predictions of human behavior so that 
we can learn to improve on them. 

The first key fact is that recognition of a 
"mental state" in the chimpanzee usually does 
not depend on the present behavior alone. It 
is a perception of the relation of the present 
behavior to what the animal does in other cir- 
cumstances. "Interest" describes a heightened 
level of responsiveness, not a particular level: 
the behavior that is a sign of interest in one 
animal is apathy in another. "Anger" implies 
a sharp deviation of behavior from an accus- 
tomed base line. 

Accordingly, such identifications usually re- 
quire a thorough, long-time knowledge of the 
individual, in addition to background knowl- 
edge of the species, Experienced caretakers 
refused, for example, to make any statement 
about the attitudes of two chimpanzees, Shorty 
and Vera, with whom they had worked for some 
months, on the ground that they did not know 
them well enough yet. The animal's history 
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seems crucial always. A single observation of 
aggressive behavior on the part of Frank, in 
rather unusual circumstances, caused a serious 
revision of the generally held view that he was 
friendly to man, and these doubts persisted 
despite Frank's exemplary behavior afterward. 
At one time Dita had not attacked a member of 
the staff of the Laboratories for several years at 
least, but a knowledge of some of her earlier 
behavior still made her distrusted. 

On the other hand, animals like Bimba and 
Fifi were frequently aggressive, but their aggres- 
sions were put down to brief bouts of anger, 
again to a great extent on the basis of their 
histories. The judgment that an animal is 
basically friendly, therefore, is an especially 
complex inference; this view is supported by 
the fact that there may be some disagreement 
among the staff as to the friendliness of a par- 
ticular animal, and that the judgment is often 
made very tentatively. Although it is definitely 
based on behavior, and thus “objective,” it is 
hard to find the crucial factors that determine it. 

But other judgments are more amenable to 
analysis. Let us present next a method of re- 
cording used in a study of individual differences, 
or of temperament, in the chimpanzee (Hebb, 
19492) . 


A METHOD OF RECORDING SOCIAL ACTIONS 


Systematic samples of the behavior of 30 adult 
chimpanzees were taken during the course of 
the ordinary routine of the Yerkes Laboratories, 
and records taken also of behavior in the "bold 
and timid man" experiment, to be described 
shortly. 

Describing the behavior was the essential 
problem. It would have begged the question 
completely to record that the chimpanzee was 
friendly or frightened or bluffing: such judg- 
ments were the ones to be analyzed. It was also 
very difficult to avoid contamination of the 
data by such judgments made implicitly. 
Eventually some 36 specific acts (“first-order 
categories" of behavior) were defined in which 
interpretation was at a minimum, such that all 
observers could agree in their use. All records 

ewere made in terms of these acts, and thus 
seemed fully "objective." 

At this level, however, the raw data were an 
indigestible mass in which no order or meaning 
could be found. The next step was to group 
together acts that had roughly the same behav- 
ioral significance. "These "second-order cate- 
gories" were "friendly behavior,” “aggression,” 
"quasi-aggression," and “avoidance” (together 
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with a class of "no response”). Interpretation 
is definitely involved in making the classifica- 
tion, but it remains under control, and the data 
are recorded in first-order categories only. 
"Friendly behavior" does not mean "friendli- 
ness" but is defined as the occurrence of one of 
the following actions: putting the head, shoul- 
der, back, belly, or arm against the cagewire 
at the observer's approach, or (if it was there 
already) keeping it there while the observer 
scratches or rubs the skin; thrusting fingers or 
mobile lips through the two-inch wire mesh (a 
common invitation to contact); grooming the 
observer's skin; and making a sputtering noise 
with the lips (Bronx cheer) while approaching 
the observer. "Aggression" and "avoidance" 
are self-explanatory; "'quastaggression" is a 
combined category of intimidating and nuisance 
behavior: walking erect with the threatening 
swagger of the chimpanzee about to attack. (or 
trying to bluff), beating the ground or some 
noisemaking surface, throwing sand or feces 
and spitting. 

Order could now be found in the data, but it 
was still clear that something was missing: two 
animals that were known to be quite different 
might show quite similar totals in these higher 
categories. With the next step, however, light 
began to break through. The “third-order cate- 
gory" of behavior was defined as (a) the occur- 
rence of behavior from two second-order 
categories in definite sequence (e.g., avoidance 
followed by aggression is one category, aggres- 
sion followed by avoidance another) or (b) the 
occurrence of an act in one second-order cate- 
gory in the absence of any act from another 
specified second-order category (e.g., aggression 
not preceded by quasi-aggression). When the 
data were arranged in this way new meaning 
began to appear. 

The change in analysis made by the third- 
order category is most clearly shown in the data 
for males versus females, in their relationships 
to the staff. First consider the second-order 
categories of friendly behavior, quasi-aggression, 
and aggression, as in Table I. These data say 
that male and female show friendly behavior 
about equally often, but that the male is sig- 
nificantly more aggressive. The implied con 
clusion, Look out for the male, is markedly out 
of line with the caretakers' experience; nineteen 
out of twenty cuts or scratches they receive come 
from the female. Now consider the third-order 
categories of aggression preceded by quasi- 
aggression (the openly prepared attack), and 
aggression preceded by friendly behavior (de: 
ceitful attack), as in Table II. 
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` Since Table II may seem to have an antifemi- 
nist bias, we hasten to add several things. The 
male does not appear from the data to be less 
often desirous of doing injury than the female, 
but may be less astute about it. The deceptive 
attack on the staff by males does occur, although 
rarely enough that it was not found in the be- 
havior sample; and 7 deceptive attacks were 
made by males in the timid-man experiment 
(mean, 0.9; for females, 1.4). Some females, 
also, tend to show a male directness of attack, 
and individual differences in this respect seem 
to be greater than among males. 

The method of analysis also clarifies certain 
judgments about individual animals which 
would puzzle an outside observer, or perhaps 
confirm his suspicion that the one making the 
judgment is going far beyond the facts, when a 
caretaker says that exactly the same behavior in 
one animal is the expression of anger and in 
another attention-getting. The judgments are 
complex, but one can now show that they are 
based on observable behavior. 

Take the simple case in which the same act, 
spitting, is called evidence of desire for attention 
in one animal, malice in another. At the lowest 
level, we are dealing with two sequences of 
behavior, spitting followed by friendly behavior 
and spitting followed by aggression, From past 
observation of each animal the observer can 


Taste I 


INCIDENCE OF THREE FORMS OF BEHAVIOR BY 
MALES AND FEMALES: SECOND-ORDER CATEGORIES 


8 males 22 females 


Friendly behavior, total . 205 531 
mean 25.6 24.1 
Quasi-aggression, total. . . — 73 25 
mean . . 9.1 1.1 
Aggression, cac Se e 41 38 
mean .. 5.1 1.7 
Taste II 


INCIDENCE OF TWO FORMS OF BEHAVIOR BY MALES 
AND FEMALES; THIRD-ORDER CATEGORIES 
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8males 22 females 

pies ee C 
Quasi-aggression, then aggression 37 0 
Friendly behavior, then aggression 0 15 
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predict the second part of the sequence and as 
soon as the spitting is seen, without further 
observation, can say, “That is (or is not) mali- 
cious." The diagnosis is a prediction, and the 
prediction does not arise from some esoteric 
penetration into the consciousness of the chim- 
panzee but from a knowledge (a) of what 
chimpanzees in general are capable of and (b) 
of what this chimpanzee in particular has done 
in the past. Again, bluff is the sequence of 
intimidating behavior followed by retreat if the 
opponent does not retreat but moves forward. 
As soon as he sees the intimidating behavior in 
certain circumstances the observer may call it 
"bluff" without having to see what follows. But 
he can do this only from prior observation, so 
his diagnosis once more is a prediction. 

The reader may think all this too simple to 
need to be spelled out. In practice it is not. 
One's present perception of behavior is deeply 
colored by past experience. Experimentally, one 
may waste months in a futile effort to identify 
subtle distinguishing marks that are not there. 

For example, it usually seems to the observer 
that there is something in the manner of the 
bluffing chimpanzee that is different from the 
manner of one that is really going to attack. 
Perhaps there is, but the experienced handler 
of chimpanzees still declines to make the judg- 
ment with confidence until he has seen a good 
deal of the animal, which indicates that he does 
not really identify the bluff by present behavior 
alone. One of us (Hebb) had worked on this 
whole problem of analysis for three years before 
he discovered that he was still recording intimi- 
dating behavior in two ways, according to what 
he thought it meant: he recorded “bluff” when 
aggression did not follow immediately, but 
"aggression" when it did. In the latter case the 
prior intimidating behavior did not seem some- 
thing separate—it was a natural part of the 
whole picture of one aggressive act. The record 
of behavior became clear only when the two 
patterns were put down as (1) quasi-aggression 
followed by aggression, and (2) quasi-aggres- 
sion nôt followed by aggression. Bluffing is a 
third-order category, that is, not first order. 
Bluffing, as such, should not be entered in the 
record of behavior. 

The same keen observer took two years to 
discover that his observations of Bimba's attacks 
were colored by his belief that the behavior was 
caused by anger, not cunning malevolence, so 
that he failed to see that the attacks were some 
times preceded by friendly behavior to get the 
victim within reach, Only when a satisfactory 
method of recording was used did the truth 
appear. 


547 
THE “BOLD AND TIMID MAN" EXPERIMENT 


Any sensible person working with adult chim- 
panzees is constantly on guard, even with ani- 
mals that in general are well disposed. But this 
very caution may be an incitement to attack, and 
it seemed possible that further light on individ- 
ual differences might be had if one could test 
the animals’ reactions to a person with no 
fear, on the one hand, and to an exceptionally 
fearful one on the other. 

Accordingly, the two roles were set up, played 
by the same person in two disguises. In the 
one, with hands protected by heavy gloves, the 
experimenter strode confidently up to the out- 
side of the cage, making rough but friendly 
contact when the animal stayed within reach 
without aggression; but any attempt to scratch 
or bite was answered by striking back. The 
timid man acted the part of an excessively ner- 
vous person, very hesitant in approach, starting 
back at any sudden move of the animal, and 
making no near approach to the animal as long 
as he was aggressive in manner. 


TABLE III 


PERCENT OF TRIALS (2700 WITH STAFF, 450 WITH 
TIMID, 450 WITH BOLD MAN) SHOWING CERTAIN 
RESPONSES: Fr, FRIENDLY BEHAVIOR; Ag, AGGRES- 
SION; QAg, QUASI-AGGRESSION; Av, AVOIDANCE 
(DATA FROM HEBB, 1949A) 


Staff Timid Bold 


Apparently friendly (Fr alone) 26 29 23 


Fear (Av alone) 02 741.30 
Aggression (Ag in any 
combination: 
2nd order category) 50 15 13 
Annoying or teasing 
(QAg without Av or Ag) Si, 24 188 
Bluffing (QAg-Av) 00 22 56 
Deceit (Fr-Ag or Fr-QAg) 09 17 2.7 
Deceitful attack (Fr-Ag) 06 87 22 
Deceitfully startling or 
e annoying (Fr-QAg) 03 84 04 
Afraid but friendly (Av-Fr, Fr-Av) 00 2.7 27 
Afraid, antagonistic 
00. 2.7 10 


(Av plus Ag or QAg) 


Some of the results are given in Table III. 
The outstanding one was fear of the bold man, 
although he did nothing to start trouble but 
only answered in kind’ Avoidance alone 
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occurred on 37 percent of trials with the bold 
man, as in the table, but if the trials in which 
avoidance was combined with some other be- 
havior were included the total would be 49 
percent. Some of the fear of the bold man pre- 
sumably was due to the strange costume worn, 
since the rather similar appearance of the timid 
man also aroused some fear (bold and timid 
man each wore false faces); but the two cos- 
tumes were similar, so it may be concluded that 
the difference between 37 percent avoidance 
of the bold man and 7 percent of the timid 
man was due to the difference in behavior. It 
will be recalled that both roles were played by 
the same person. 

A more significant result for our present pur- 
poses is the high frequency of aggression and 
quasi-aggression to the timid man, and especially 
the frequency of behavior which, to state the 
matter as clearly as we can, enticed an innocent 
close just to scare the hell out of him. The role, 
of course, demanded that the timid man fall 
into this trap as often as it was laid, and that 
he should show a maximum of startle every 
time. Many animals found the temptation 
irresistible; only 9 out of 30 failed to show such 
behavior on one or more trials. 

The individual differences revealed by the 
test method are at least as interesting as the 
group results, although they cannot be gone 


THE CONDITIONS OF 


The results of the experiment just described 
serve very well to introduce our next main 
point, which is that the social determinants of 
fear and hostility are very complex. If the 
reader does not find this conclusion new and 
startling we can say it another way. The prob- 
lem of social motivation is apt to be taken too 
lightly, with too much of the reversed-panacea 
approach—the idea that all our social troubles 
stem from economic pressure, or Oedipus diffi- 
culties, or frustration, or some other single 
source, Fix whichever one it is that makes the 
trouble, and all will be well. A main object 
here is to show that the infrahuman animal has 
a more complicated set of motivations than is 
allowed to man by these one-shot prescriptiors. 

Some of our conclusions in this chapter will 
be quite obvious ones: frustration is not a suffi- 
cient statement of the causes of aggression; the 
unknown, as such, may cause fear; man, who so 
often avoids pain and effort, also goes to some 
trouble to expose himself to threat of pain and 
to situations demanding effort. These things 
are common knowledge. But they are not com- 
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into thoroughly here. Pan, often aggressive to 
the staff but considered a gentleman because he 
always gave clear warning, began by attacking 
the bold man; being outbluffed (thanks to a 
strong cagewire) , he appeared to lose all inter- 
est and moved off to a far corner of the cage 
whenever the bold man appeared. Two much 
more vicious animals, Pati and Beta, were 
equally impressed by their failure to cow the 
bold man, but in a different way. Like Pan, 
they first attacked; when the bold man did not 
retreat but struck back, they became markedly 
friendly. To the observer this transition was 
little short of astonishing. Instead of their 
usual lack of interest in man, alternating with 
vicious attack, they now vigorously solicited 
friendly contact. The behavior was not fawning 
by any means (both tried occasional further 
attacks, for example) , but they appeared to be 
strongly impressed by the bold man, and on the 
whole very favorably. 

Another pattern of response was shown by 
Mimi, She was markedly friendly to the timid 
man, and was tempted into only one attack on 
him, midway through the experiment; but she 
attacked the bold man with persistence and 
determination. Still other responses were shown 
by Jack and Frank, who were highly interested 
but unaggressive, and unafraid, with both bold 
and timid man. 


Fear AND HosriLITY 


mon knowledge in psychology, and we must 
remind the reader that everyone knows a lot of 
things that are not true. One necessary task 
for scientific psychology is to sort out, among 
the obvious facts of human nature, the true 
from the untrue, the useful ideas from the use- 
less. The reason for preferring simpler theories 
of human action is not that anyone thinks that 
the facts are really simple, but that it is too 
easy to postulate explanatory entities whenever 
the going gets hard. Which are the fertile pos 
tulates? Which are really needed to account for 
the facts? It is especially necessary to bercareful 
when a discrepancy between fact and theory is 
found only at the human level, in the adult 
subject, because of the multifarious and uncon- 
trolled experiences that occur during the long 
period of human growth, together with the 
special effects of verbal learning, So it will not 
be new if we conclude from the animal data 
shat man is hostile to strangers; this is "known" 
already, but the need for taking theoretical 
account of it is clearer if its phylogenetic devel- 
opment can be traced, and when related behav- 
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ior can be demonstrated in lower animals, in 
more controlled conditions and without the 
contaminating influence of language. 


CHIMPANZEE FEARS AND AGGRESSIONS 


Fear of innocuous objects is very common in 
the chimpanzee, even though these objects are 
not associated with some more primitive cause 
of avoidance, The diary records of the Yerkes 
Laboratories contain a great many references 
to unexplained fears of a particular person, 
some piece of apparatus or part of it, rope of a 
particular size, color, and texture but not of 
other ropes, a biscuit in which a worm is found, 
a vegetable of a peculiar shape, and so on. 
There are great individual differences, of 
course, but as a group the adult chimpanzees 
are markedly subject to such fears. Kóhler's 
(1927) observation of panic at sight of small 
animal models such as a toy donkey is well 
known. A chance observation, made with a clay 
model of a chimpanzee head (Hebb, 1946b), 
extended the range of objects known to have 
such effects. The clay model, reasonably lifelike 
in proportions but about half life size, produced 
extreme panic in nearly half the thirty-two 
adults to whom it was shown, and very marked 
signs of fear in most of the rest. An actual head 
preserved in formalin had the same effects, and 
so had a very lifelike colored model of a human 
head from a display dummy. 

An attempt to deal with such facts theoreti- 
cally (Hebb, 1949b) is not too satisfactory in 
certain respects, but the line of reasoning sug- 
gested a relationship to human emotional 
disturbance at sight of a mutilated body and 
fear of the dead, and so the following observa- 
tions were also made. These are cited in full 
because of the socially significant aggressions 
that were elicited along with the fear. A dead 
chimpanzee was not available for experiment, 
but anesthetized ones were: 


Aug. 15, 1943, Mars, a young animal who had 
been anesthetized by nembutal, in order to make 
physital measurements, was carried out and shown 
to four of the adults, Jack, Dick, Don, and Dita. 
The first three of these were markedly excited 
(Dita less so) and Don attempted to attack. Aug. 
17. Mars, anesthetized again, was laid on the 
ground in the infants’ enclosure, with his fellow 
youngsters. One of them, Art, was persistently 
aggressive, biting and hitting the inert animal, 
who had to be rescued. Mars was then carried to, 
the cages of some of the adults. Nana and Helene 
showed interest but no emotional excitement; 
Don was excited, with hair erect, and somewhat 
aggressive in manner; Dina and Bula manifested 
à nonspecific excitation, with hair erect; Wendy's 
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hair was erect, she did not seem as excited, but 
she came close and remained; and Josie (as it 
happened Mars’ mother, from whom he had been 
separated early) manifested an apparent mixture 
of aggression and fear — she was very excited, 
ran inside when Mars was first brought near, 
then attacked him through the cagewire. Her 
hair was erect throughout. 

Since the sight of an anesthetized infant being 
carried by one of the staff was common, the 
observations above were later repeated with an 
anesthetized adult. Feb. 11, 1945. Don, under 
nembutal, was wheeled on a handcart up to the 
cages of nine other adults. Ami, Nira, and Vera 
showed fear, and Dina and Bokar did also but 
then followed this by a show of aggression at a 
distance; Kambi showed generalized excitation 
and screaming only; Frank, with hair erect, spat 
at the anesthetized Don; Pan first avoided, then 
attacked through the cagewire; and Lia, with 
general excitation but not avoidance, also at- 
tacked. (The youngsters in the infants' enclosure 
were afraid, one very much so, and all showed 
signs of marked excitation.) 


Now let us look at the chimpanzee's aggres- 
sions as they occur in more ordinary circum- 
stances, in the form of teasing, jealousy, anger, 
or chronic malice. 

The motivation for teasing is often strong. 
It seems certain that the chimpanzee gets satis- 
faction simply from the discomforture of the 
human being who is startled and frightened, or 
angered, by a sudden screaming attack on the 
cagewire nearest him or on being squirted with 
water. Certainly the animal goes to consider- 
able trouble to achieve such results, in spite of 
being often punished for it and getting no 
other reward than seeing the effects of the act. 
Other chimpanzees are teased as well as human 
beings, as Kóhler (1927, pp. 83ff) noted. De- 
liberate planning is involved, with an evident 
expectancy of the end effects of the behavior. 
“Teasing” is too mild a word for some attacks 
that seemed to be made only for the sake of 
others' discomforture (as when Alpha would 
occasionally yank a handful of hair out of a 
thoroughly dominated cagemate, with no provo- 
cation) , but it is important to note that these 
minor aggressions are not always indications 
of a desire to do more serious damage if only 
tke victim were within reach. Quite often he 
is within reach, or becomes so later without 
receiving more damage. In the timid-man 
observations, some of the animals seemed genu- 
inely malicious, but others, after once getting 
a rise out of the experimenter, became gentle 
and made no attempt to injure him when he 
came within reach. 

Look next at the causes of anger, and their 
relation to frustration. The usual stimulus to 
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anger is the behavior of a person or another 
chimpanzee acting in a way that is not desired, 
or failing to act in a desired way. "Desire" is 
one of the terms proscribed in Watson’s 
Revolution along with images and ideas, not 
because of its vagueness or trouble in defining 
it (after all, “learning,” “intelligence,” and 
"emotion" were retained) but because such 
terms were thought to be entirely subjective and 
anthropomorphic in their reference. We can 
now see that they are much better treated as 
referring to intervening variables, postulated 
processes, either with man or animal. As such 
they are more or less suitable for use in an ob- 
jective psychology. They are still vague, but 
the fault is not in the way the terms are defined 
but in our theoretical conceptions, which will 
not be made any better just by using new terms. 
If one is really serious about stating the condi- 
tions in which anger occurs, a reference to 
desire or some equivalent is necessary. 

To speak of “frustration” instead does not 
avoid the necessity, for it is clear that it is a 
desire or an expectancy that is frustrated. By 
some writers, of course, the term “frustration” 
has been used to emphasize the complexity and 
importance of internal representative processes 
in a Freudian or near-Freudian way (see, e.g., 
Rosenzweig, 1944) . With this we have no quar- 
rel. But some writers have seemed to feel that 
the reference to frustration is a means of simpli- 
fying the problem, and that it avoids the need 
of postulating conceptual processes. Others 
have pointed out that this will not work for 
man (e.g., Sargent, 1948; McKellar, 1950) . The 
evidence seems even clearer for the chimpanzee. 

When a female chimpanzee in heat is intro- 
duced into the next cage, the male seeing her 
gets up and comes as close as he can, until he is 
stopped by intervening cagewire, At this time 
he shows no anger, although a goal-directed re- 
sponse, strongly motivated, has been inter- 
rupted. But anger becomes highly predictable 
if the male is first led to expect, by the care- 
taker's actions, that he will be admitted to the 
female's cage, and then is not. It is the expect- 
ancy that gives the physical barrier its effect. 
In other situations, one can only say that the 
chimpanzee is angry because he does not lite 
what another is doing. No goal-directed 
response is interrupted, and a physical barrier 
has nothing to do with the cause of anger (al- 
though it may prevent the effects of the anger 
from being visited on its object) . 

Thus Pan had a temper tantrum at the sight 
of two experimenters taking a vaginal smear 
from a female in heat in another cage. Mimi 
was driven to paroxysms of rage by the two 
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inmates of the next cage, who would wait till 
her back was turned and then spring screaming 
at the intervening cagewire, repeatedly startling 
her. Bokar was almost as much infuriated by 
Dick's spitting water at him, even when he was 
not hit. Kóhler (1927, p. 293) has noted that 
noisemaking, and social disturbance as such, 
may be a cause of anger for a chimpanzee just 
as for man. The diaries of the Yerkes Labora- 
tories likewise record a number of instances in 
which a noisy temper tantrum by one animal 
has led another to attack him. We have already 
commented (in discussing "altruism") on the 
anger of the chimpanzee at another's begging, 
even as he yields and hands over food. Defining 
the frustration in such cases of anger will pre- 
sent difficulty, unless it is frankly incorporated 
in a theory of thought so that frustration can 
be treated as some sort of conflict with the 
thought process. With anger, as with fear, the 
phenomena of emotion point to the intimate 
relationship of motivation to intellectual proc- 
esses, and the impossibility of treating these 
things in separate compartments of theory. 


THE PRACTICAL PROBLEMS OF SOCIAL MOTIVATION 


It is reasonable, even desirable, to oversim- 
plify theory in its preliminary stages, as long as 
one is concerned only with theoretical problems. 
The idea is that later stages of theory will do 
justice to the facts if early ones cannot. But 
it is quite another thing to forget that theory 
is still in these early stages when one is advising 
on a practical problem. What is virtue in one 
endeavor is vice in the other, and the more 
difficult the problem the truer this becomes. No 
other scientist deals with issues of such practical 
urgency as the social psychologist, and even 
though he has recognized the complexity of his 
problem the social psychologist may still find 
it worth while to consider some of the practical 
implications of the emotional behavior of 
higher animals, and perhaps also to be re- 
minded, by the animal data, of the practical 
dangers of tacitly accepting too simple a view 
of motivation. d 

This last remark, be it noted, is not a covert 
attack on the theory of biological rewards (plus 
conditioning), Although we have probably 
betrayed our preference for another sort of 
approach, such as that of Harlow (1951) oF 
Maslow (1953) , we must point out that biologi- 
calreward theory is being greatly elaborated 
by such recent treatments of secondary rewa 
as those of Mech) and MacCorquodale (1948, 
1951), Mowrer (1950, 1952) , and Miller (1951). 
These discussions imply no simple approach to 
the problem of social motivation. 
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In the year 1950 the main explanations of 
social hostility, as outlined by Katz (1951) ina 
review of the literature, make the hostility 
learned, or in some way a product of frustration. 
Although the difficulties in the way of one or 
the other of such simplifications have been 
recognized (Katz, eg., p. 155, concludes in 
favor of multiple causation) the comparative 
psychologist may point out that the literature 
reviewed by Katz predicates a view of motiva- 
tion that on our present knowledge is doubt- 
fully adequate for the laboratory rat and quite 
inadequate for the dog or chimpanzee, let alone 
man. 

From the practical point of view in this mat- 
ter, the animal evidence offers good news as well 
as bad. It complicates the issues theoretically, 
but some of the complications mean that there 
may be hope for man as a social animal. On the 
plus side there is, as we have seen, evidence of 
a fundamental aptitude in higher animals for 
friendship and helping others. We do not know 
just what conditions of rearing foster this altru- 
ism, but it should be possible to find out 
experimentally. 

A further plus item is that frustration does not 
always increase hostility, and may even decrease 
it without evidence of undesirable side effects 
("displacement"). Punishing an adult chim- 
panzee for a misdemeanor like spitting or 
scratching the caretaker does not usually pro- 
duce signs of resentment. On one occasion at 
least, punishment of a young animal for 
repeated biting led to a strong friendship (diary 
of Gamma, Yerkes Laboratories, 1935). The 
reader may be reminded also of the sudden 
friendliness of Beta and Pati to the bold man, 
when he promptly answered attack with attack. 
In animal, as in man, the effect of aggression in 
some circumstances is respect and liking. Nissen 
(personal communication) points out that forc- 
ing a chimpanzee to do something, or forcibly 
depriving him of something, is as often followed 
by an increase of friendliness as by a decrease. 
The friendliness cannot be an effect of the 
frustration alone; it must depend also on the 
fact that the frustrater is a rewarder at other 
umes. But with a background of beneficence, 
the important thing is that deprivation need 
not decrease goodwill. Social living and educa- 
tion inevitably mean an unending series of 
frustrations but these, it seems, should not be 
thought of as always making for hostility. 

But the bad news must be reckoned with, too* 
On the minus side is the evidence that timidity, 
as such, may incite aggression. Fortunately, 
this is not always so, and the consistently 
friendly behavior of some chimpanzees to the 
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timid man justifies some hope concerning pri- 
mate behavior in this respect. Also in the debit 
column are the mammal’s fear and hostility 
with strangers, even when no injury has ever 
been received from a stranger (Hebb and Rie- 
sen, 1943). This has an evident meaning for 
social problems, and so has the combined fear 
and hostility toward a fellow chimpanzee under 
anesthesia (if, as it seems, this means that the 
cause of aggression was the failure of the anes- 
thetized animal to behave in the normal way). 

There is also another kind of evidence, which 
has both good and bad implications. It shows, 
in brief, that mammals seek excitement, and 
this search has desirable possibilities in society 
as well as extremely undesirable ones. The 
animal that is so vulnerable to emotional dis- 
turbance nevertheless seeks the situations that 
produce emotional disturbance in mild degree. 
There may be difficulty about dealing with this 
theoretically (Hebb, 1949b, pp. 232ff), but it 
appears empirically that for any one animal 
there is an optimal level of fear or frustration. 
First, the evidence concerning fear: strange sur- 
roundings tend to produce emotional disturb- 
ance in the rat, but Montgomery (1951, 1952) 
and Thompson (1954) have shown experimen- 
tally that the rat which has the choice of 
familiar and unfamiliar territory will tend to 
move toward the unfamiliar — the well-known 
exploratory drive. Whiting and Mowrer (1943) 
and Berlyne (1950) have suggested a connec- 
tion between fear and investigative tendencies, 
and the dog that is frightened by a strange 
object is nevertheless apt to return to look at 
it again, balanced between closer approach and 
flight (Melzack, manuscript in preparation). 
The same thing can be observed in chimpan- 
zees, and Woodworth (1921) and Valentine 
(1930) have described the behavior of young 
children who ask to be shown again — at a safe 
distance — the object that has frightened them. 
Secondly, concerning frustration: Harlow, 
Harlow, and Meyer (1950) have demonstrated 
the monkey's willingness to expose itself repeat- 
edly to the frustrations inherent in problem 
solving, without extrinsic reward. Mahut, 
working in the McGill laboratory, has shown 
theat the rat which is offered two routes to food, 
one short and direct, the other via a maze prob- 
lem, will choose the problem on twenty to forty 
percent of the runs. Some of the time even a 
rat prefers to work for his living. 

Such phenomena are, of course, well known 
in man: in the liking for dangerous sports or 
roller coasters, where fear is deliberately 
courted, and in the addigtion to bridge or golf 
or solitaire, vices whose very existence depends 
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on the difficulty of the problems presented and 
an optimal level of frustration. Once more, 
when we find such attitudes toward fear and 
frustration in animals, we have a better basis for 
supposing that we are dealing with something 
fundamental if a man prefers skis to the less 
dangerous snowshoes, or when we observe an 
unashamed love of work (problem solving and 
frustration included) in the scientist, or in the 
businessman who cannot retire. Such behavior 
in man is usually accounted for as a search for 
prestige, but the animal data make this un- 
tenable. It seems much more likely that solving 
problems and running mild risks are inherently 
rewarding or, in more general terms, that the 
animal will always act so as to produce an op- 
timal level of excitation. One explanation has 
been attempted by Hebb (1949b) ; another, into 
which the facts on the whole would fit very well, 
is the treatment by Mowrer (1950, 1952) of 
anxiety reduction as a reward (though here 
one needs to assume that problem solving is 
also a cause of anxiety). 

The potential social benefits of a liking for 
work and (in some essential occupations) for 
taking risks are sufficiently obvious. This is 
the plus side of the mamma's liking for excite- 
ment. But, as the chimpanzee's behavior shows, 
excitement may be sought in other less de- 
sirable ways. The plain fact is that the primate 
is only too ready to become a troublemaker 
when things are dull, and we had better stop 
comforting ourselves with the accurate but in- 
sufficient statement that man has no instinct 
to make war. He also needs no special coaching 
to discover a taste for “adventure,” and some 
of his adventures may be socially disastrous, 
Fear or dislike of the strange is not innate, since 
it depends on certain prior experiences, yet 


it still does not have to be taught (Hebb, - 


1946b). If, therefore, man is not born with a 
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dislike for those who differ from him in habits 
or appearance, he can still pick up the dislike 
with no help or encouragement. Making war 
is not instinctive; neither, unfortunately, is an 
aversion to certain forms of excitement that may 
lead to war. 

It appears, therefore, that we stand on very 
slippery ground when we suppose that social 
harmony is to be had on a negative basis, by 
minimizing frustration for the growing child 
and not teaching him bad ideas about others. 
Suppose we do give the child as much freedom 
as we can; suppose the adult does see that he 
is not being injured or threatened by other 
groups — is there not still a strong presumption 
that more will be necessary? That the child 
must be taught somehow to like others different 
from himself, to get pleasure from knowing 
more about them and learning about their un- 
usual ways? In addition to correcting economic 
injustice, may it not be necessary also to de- 
velop, systematically, socially harmless sources 
of excitation? Poker and bridge and pennant 
races, and bowling alleys and detective stories, 
may have more to do with our social stability 
than we think. Even today, an economically 
successful society may require circuses, in some 
form, as well as bread. 

Economic factors obviously play a large part 
in social conflict; it is only too evident that 
children can learn hostile attitudes from their 
elders. We must recognize and control these 
factors for a sound society. But it is equally 
essential to recognize that there is no scientific 
basis for asserting that these are the only factors 
underlying war and race prejudice. We may 
even doubt that they are ever effective by 
themselves, without the deepseated tendency 
to be disturbed emotionally by what is different 
from ourselves. 


A THEORETICAL APPROACH TO HUMAN NATURE AND SOCIETY 


We have made some effort in the preceding 
pages to be economical of theory, using no more 
than was needed to make a connected story. If 
we have nevertheless been speculative, we must 
now become more so. Our purpose in this final 
section is to offer a theoretical view concerning 
human emotions which, we believe, casts light 
on the evolution and structure of society. 

Doing so is not outside the scope of the pres- 
ent chapter. The views we set forth derive from 
the study of animal emotions; and we may 
remind the reader, if need be, that comparative 
psychology includes man, regarding him simply 
as the highest in "he animal series. Human 
culture is unique, but this only makes it the 


more interesting as a comparative phenomenon. 
From the comparative point of view it consti- 
tutes a special problem. 

Man is the most intelligent animal; and the 
most striking feature of his behavior, compara- 
tively, is an increasing control of the physical 
environment and the accumulation of “wealth” 
and wealth-producing skills. But almost equally 
striking is the progressive modification that has 
gone on in other aspects of human behavior: 
broadly speaking, a moral and political develop- 
ment. The economic growth can be treated as 
the result of a search for comfort by an excep- 
tionally intelligent animal; a unique power 
communication, particularly, allows for pro 
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gressive problem solving. But the cumulative 
change in other respects: can this be ascribed 
to the same source? 

Can we derive man's social evolution from 
his exceptional intellectual characteristics? 
Perhaps we would also allow him some altru- 
ism, on occasion, and we must certainly 
recognize that his powers of learning and com- 
munication will perpetuate some mistakes, so 
that if we see man as intellectually distinctive 
only we need not imply that all his social ar- 
rangements are intelligent ones. Even after 
making these allowances, however, we may ask 
whether such a simple model of the origins of 
society would not be significantly improved by 
recognizing that man also has exceptional emo- 
tional characteristics. 

We have already proposed that the mammal 
seeks excitement when things are dull. This 
by itself makes an important change in the 
theory of “economic man." That theory can 
hold only in an impoverished society (in which 
the level of frustration is already above the 
optimum) and not in an economically success- 
ful one. With a general increase in wealth and 
security, the risky venture may be preferred to 
the sure thing, the interesting occupation to one 
that pays well. 

But let us now go further. Evidence from 
species comparison suggests that emotional sus- 
ceptibility increases with intellectual capacity. 
Man is the most emotional as well as the most 
rational animal, But this susceptibility is partly 
self-concealing: its possessor tends to seek the 
environment in which a too-strong emotion is 
least likely to occur, the one in which disturb- 
ance is nearer the optimal level. In man, this 
makes for the establishment of “civilized” so- 
cieties, the chief characteristic of which (at 
least until recently) is not that they improve 
the economic lot of the average member, but 
that they provide an environment in which the 
frequency of acute fear, disgust, anger, and 
jealousy is decreased. The further a society has 
advanced along this path, the less subject to 
Strong emotion its members must appear. 
Violent emotional outbursts vanish, the well- 
bred man becomes the norm, and psychologists 
Who seldom see the strong, acute disturbance 
(except in young children) begin to think of 
adult man as typically unemotional But the 
susceptibility to the disturbance may still be 
there and, although concealed, have a major 
influence on social organization. T 


EMOTIONALITY AND INTELLECT 


Such views and their comparative origin will 
be more intelligible if we see how they were 
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arrived at, in the course of constructing a theory 
presented elsewhere (Hebb, 1949b). 


The first aim was relatively modest: to find 
some way of bringing together the facts of learn- 
ing, perception, and set or attitude. This, it 
seemed, might be feasible if one could treat the 
thought process as a “phase sequence,” as a 
series of events in the brain each of which might 
be aroused sensorily or centrally, or both. 

The principal question was as to the nature of 
these events in the brain. Existing anatomical 
evidence, and some of the psychological evidence, 
led to the conception that each of the series of 
activities would be the activity of an “assembly” 
of neural cells. These assemblies would have to 
be established before overt learning could occur, 
which seemed to deny the truth of the theory, 
until von Senden's (1932) evidence was recalled. 
The second main difficulty entailed by such ideas, 
however, is the one that concerns us more. The 
assembly as postulated would be rather fragile in 
its function, easily disturbed or disrupted by un- 
usual sensory conditions. The theory thus arrived 
at the improbable conclusion that a strange en- 
vironment would interfere with thinking, or 
might disorganize it completely. 

Then came a freeing hypothesis: suppose the 
interference with thought to be what we already 
know as emotion, and that it is usually mild. The 
neurophysiological speculations would permit the 
disturbance to occur in varying degrees, so let us 
assume that in mild degree it would not disrupt 
the phase sequence but only slow it. Slowing the 
phase sequence would mean that thought would 
continue, with the same general content, for a 
longer time. The thing thought about (or per- 
ceived) would be by definition more “interest- 
ing,” more capable of arresting and holding 
attention. A mild disturbance might thus be posi- 
tively motivating; and a strong disturbance, even 
though of the same kind, have opposite effects. 
Since (as we have seen) the behavioral evidence 
shows that there are such paradoxical effects of 
emotional stimulation, these neurophysiological 
notions seemed to have theoretical possibilities 
even though they still left much to be accounted 
for. 

If such ideas were to be taken seriously, how- 
ever, they had a further implication which led, 
rather deviously, to the present problem. If emo- 
tion is in some sense a breakdown of the thought 
process, it seemed that emotionality would be- 
come more marked when thought is more com- 
plex. The more elaborate the machinery, the more 
ways its operation can be disturbed. It seemed 
therefore that development of a higher level of 
intelligence would mean an increased vulnerabil- 
ity to emotional disturbance. 


When the evidence was examined, some sup- 
port for this idea appeargd. The chimpanzee 
seems to be more subject to emotional disturb- 
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ance than the dog, and the dog more so than 
the rat. There were also hints of an ontogenetic 
correlation, of increased emotionality with in- 
tellectual development from infancy to ma- 
turity. Now we come to our problem: adult 
man should be still more emotional than the 
very emotional chimpanzee and more emotional 
than the three-year-old child, and he does not 
seem to be. 

But certain aspects of man’s social behavior 
were very suggestive when they were examined 
from this point of view, and led us to the con- 
clusion that the predicted susceptibility exists, 
that it is a major influence on social evolution, 
and that it tends to produce societies such as 
ours in which it can hardly be detected. 

This view depends in part on the phyloge- 
netic correlation. Let us therefore consider it 
first. There are certain primitive causes of 
avoidance and aggression that seem to work 
equally well at all levels of mammalian de- 
velopment: pain stimuli, sudden loud noise, 
sudden loss of support, restriction of movement 
by the animal that is used to freedom — all 
have immediate effects that are comparable in 
laboratory rat and man. But as we go from 
rat to man there is a progressive increase in 
the range of effective stimuli to avoidance and 
aggression, and in the duration and complexity 
of the response. 

In the rat, for example, there is little need 
of such a term as “anger” for describing the 
animal's behavior. A rat is aggressive or he is 
not, and the aggression has about the same pat- 
tern in different circumstances. The same seems 
true of the dog, although occasionally he shows 
something that may be homologous with the 
primate's sulking. But with the chimpanzee, 
it is essential to distinguish anger from chronic 
malice if the animal is to be handled safely. 
The peculiarly human patterns of temper tan- 
trum and sulking occur frequently. The causes 
of aggression are more varied in the dog than 
in the rat, and far more varied in the chimpan- 
zee than in the dog. Finally, the period of 
emotional disturbance following a brief stimu- 
lation also increases from rat to dog to chimpan- 
zee (the chimpanzee Fifi, for example, sulked 
for three weeks over not getting a cup of milk; 
first showing outright anger, then refusing to 
accept milk from anyone for a day or so, and 
continuing for three weeks to refuse it from 
the one who had denied it to her). 

The phylogenetic development of fear sus- 
ceptibilities is even clearer. In higher mam- 
mals' irrational fears, the motionless object may 
be nearly as much avoided as the moving one. 
For a lower form like the rat, avoidance of a 
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strange motionless object may occur (Hudson, 
1950) but it certainly is not a prominent part 
of the behavior. The dog falls part way between 
rat and chimpanzee. Melzack (1952) has de- 
scribed fears in the dog which are obviously 
like those of the chimpanzee, although the dis- 
turbance seems less intense. As for man's near- 
est relatives, the apes, it would take a page or 
more to list the specific objects and situations 
that are known to have caused persistent fear 
in the chimpanzee, as reported in the diaries 
of the Yerkes Laboratories and in systematic 
experiments (Kóhler, 1927; Yerkes and Yerkes, 
1936; McCulloch and Haslerud, 1939; Hebb, 
1946b) . 

When one considers also his manifold angers 
and hostilities, the chimpanzee at first sight is 
a bundle of emotional disturbances. This is 
not really so, of course. Most of the time the 
chimpanzee is no more angry or afraid than 
his human caretaker, and frequently less. The 
thing that one fears another does not. Some 
animals are "nervous" or short-tempered, but 
others are placid. Our point here, however, is 
that as one goes from rat to dog to chimpanzee 
one finds an increasing variety in the causes 
of emotional disturbance, an increasing variety 
of manifestation, and an increasing duration 
following brief stimulation, all of which is con- 
sistent with the idea that susceptibility to 
emotional disturbance increases with intellec- 
tual development. 

To this phylogenetic correlation may be 
added an ontogenetic one. In the observations 
with anesthetized animals as test objects (de- 
scribed in the preceding section) the frequency 
and degree of excitation aroused in the half- 
grown chimpanzees was markedly less than in 
the full-grown. The clay model of a head 
hardly got a second glance from the one- and 
two-year-olds; it absorbed the attention of the 
five- and six-year-olds but produced no avoid- 
ance; in most of the adults it produced strong 
fear, with some avoidance in all of them. Jacob- 
sen, Jacobsen, and Yoshioka (1932) and Mc 
Culloch and Haslerud (1939) have described 
the increasing emotional responses of the chim- 
panzee in the first years of life. For man, 
Jersild and Holmes (1935) have shown a simi- 
lar increase up to the age of five; they conclude 
that the trend is thereafter reversed, but we 
shall try to show that this may indicate, not à 
reduction of emotional susceptibility after five 
years, but the success of our society in protect 
ing us from most of the situations that would 
produce fear in older children and adults. 

Jones and Jones (1928) demonstrated that 
fear of snakes, at least, is much stronger and 
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more frequent in young adults than in children, 
none of their subjects having had prior ex- 
posure to snakes. This was a cross-sectional 
study, and a longitudinal observation may be 
added. Two young children, two years apart 
in age, were encouraged by a parent to handle 
snakes and liked to do so whenever they saw 
and could capture one. But the older at the 
age of eight, the younger at seven, began to 
show not a fear but a dislike of touching a 
snake, and this repugnance subsequently be- 
came strong in both, still definitely without fear 
and definitely without any associated injury, 
in a part of the country where poisonous snakes 
are never found. 

Even without the special theoretical views 
from which the discussion took off, such phylo- 
genetic and ontogenetic correlations between 
increasing intellect and increasing emotionality 
would suggest that thought and emotion are 
intimately, essentially related. There must be 
doubt concerning any treatment of emotion as 
a state or process independent of intellectual 
processes, and having a separate seat in the 
nervous system, Brainstem theories of emo- 
tion, as developed originally by Head and by 
Cannon and Bard or more recently by Lindsley 
(1951), do not seem to give enough weight to 
the evident cortical elements in the fears and 
angers of the higher animal. These appear in 
the kind of perception that can cause emotion 
in the higher animal (including the role of 
fantasy reported by the human subject) , as well 
as in the correlation with intellectual level that 
has just been discussed. 


EMOTION AND SOCIETY 


Such theoretical considerations lead one to 
expect that adult man will be more subject to 
emotional disturbance than the young child 
or subhuman animal. But such an expectation, 
surely, is nonsensical. At the age of three to 
four years the child is apt to be plagued by 
imaginative fears (Jersild and Holmes, 1935), 
and tb plague his parents with his strong dis- 
likes and outbursts of temper. The adult 
chimpanzee must be treated, in effect, as a wild 
animal, and as yet no one has been able to 
domesticate him, Surely it is adult man who is 
least subject to emotional disturbance. The 
chimpanzee, it seems, is explosive, unreliable, 
fearful, dangerous; man (normal man, normally 
reared) is ordinarily quiet, not given to sudden 
violent attack on others without warning, not 
forever in fear of this or that harmless object. 

Perhaps so, but let us look closer, —— 

First, in most primitive societies it is clear 
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that man has generally found himself ringed 
around by malignant ghosts and devils, with 
beneficent spirits in the minority, and has found 
it constantly necessary to spend time, effort, 
and wealth to propitiate even the friendly ones. 
Fellow members of the tribe commonly possess 
evil powers; and the one who has the evil eye 
is feared and hated, or killed, according to cir- 
cumstances. For any one of the members to be 
greatly different from the others in appearance, 
habits, skills, or tastes would be more than apt 
to cost him his life. As for an outsider, even 
of the same race, language, and culture — any 
foreigner is distrusted and feared. He may be 
a convenient source of trophies (a scalp or an 
embalmed head to show that one is a self-re- 
specting human adult) , or alternatively may be 
captured and brought back whole for torture 
or ceremonial sacrifice for the edification of 
the whole tribe. 

These are well-established facts of human 
behavior, and not rare exceptions either. How- 
ever, if they are cited, and if one cites also the 
concentration and slave-labor camps of our own 
generation, to show that man is to be thought of 
as a wild animal, emotionally unstable and 
often vicious, the answer one will receive is that 
these things may happen if people are taught 
as children to behave in these ways, but that 
fear, hatred, and cruelty are not inherent in 
man's nature. 

If so, one might wonder at the coincidences 
that allowed the same teaching to originate in 
so many parts of the globe. 

However, we should make it clear at once 
that we do not argue that these attitudes are in- 
born. We do argue that man by birth and 
maturation has a tremendous emotional suscep- 
tibility on the one hand and some need for 
excitement on the other; that if these are 
handled carefully in the growing child, with a 
highly specialized environment, the results may 
be most admirable; but that these qualities by 
themselves, with no teaching from others or in 
spite of ineffective teaching, may by reaction 
to other environments produce socially disas- 
trous results. 

As far as we have gone, therefore, and think- 

eing of our own suburban milieux as providing 
exceptional rather than typical examples of 
human social behavior, we might be justified 
in considering man not a tame animal but in- 
herently dangerous, to be domesticated only 
with great pains. Picking a human society at 
random from all those that have existed, one 
might say that the risk taken by a stranger 
entering it — a member of another society, also 
chosen at random —“would not be less than 
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that taken by a chimpanzee or a wolf encoun- 
tering a group of his fellows. It is not wholly 
fantastic, consequently, to suggest that man is 
more emotionally excited by what is strange 
than lower animals are, except after special 
training. 

Perhaps we can go further. Even in ourselves, 
the civilized, amiable, and admirable part of 
mankind, well brought up and not constantly 
in a state of fear, there are signs that this urban- 
ity depends as much on our successfully avoid 
ing disturbing stimulation as on a lowered 
sensitivity. As already suggested, the capacity 
for emotional breakdown may be self-conceal- 
ing, leading the animal to find or create an 
environment in which the stimuli to excessive 
emotional response are at a minimum. So effec- 
tive is our society in this respect that its mem- 
bers — especially the well-to-do and educated 
ones — may not even guess at some of their own 
potentialities. One usually thinks of education, 
in the broad sense, as producing a resourceful, 
emotionally stable adult, without respect to 
the environment in which these traits are to 
appear. To some extent this may be true. But 
education can be seen as being also the means 
of establishing a protective social environment 
in which emotional stability is possible. Per- 
haps it strengthens the individual against un- 
reasonable fears and rages, but it certainly 
produces a uniformity of appearance and be- 
havior which reduces the frequency with which 
the individual member of the society encounters 
the causes of such emotion. On this view, the 
susceptibility to emotional disturbance may not 
be decreased. It may in fact be increased. The 
protective cocoon of uniformity, in personal 
appearance, manners, and social activity gen- 
erally, will make small deviations from custom 
appear increasingly strange and thus (if the 
general thesis is sound) increasingly intolerable. 
The inevitable small deviations from custom 
will bulk increasingly large, and the members 
of the society, finding themselves tolerating 
trivial deviations well, will continue to think of 
themselves as socially adaptable. 

The educated man usually prides himself on 
his broad-mindedness, his judgment of persons 
by their essential qualities, and his disregard, 
of the superficial. The college student is, if 
anything, even more confident of his own con- 
tempt for appearances. Now suppose that 
someone whose appearance, only, departed sig- 
nificantly from the usual pattern applied for a 
job as a university instructor, Suppose it is a 
male who takes the highly reasonable view that 
women's clothing is more comfortable in the 
North American summe?, or a female who de- 
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cides that long hair is a nuisance and shaves her 
head bare. Or suppose it is a war casualty whose 
face is really disfigured by scar tissue, without 
ears, nose, eyebrows, or orbital ridges, but whose 
senses, intellect, and speech are unimpaired: 
what chance would any of these have to get a 
job? If he got the job, what chance would there 
be that students would sit in a classroom and 
listen to him, no matter how wise he were? 
None of these are major deviations from the 
usual human appearance. Merely asking these 
questions of an undergraduate class allows one 
to observe a curious mixture of uneasiness and 
incredulity. The degree of uniformity in the 
social environment is such that one has diff- 
culty even thinking of a really different one. 
The trivial changes of fashion from one year 
to the next look big to us, and we flatter our- 
selves that a man's appearance does not funda- 
mentally affect our behavior toward him. — ' 

Similarly, the adult may think that he is not 
disturbed by strange places, darkness, or soli- 
tude, although young children are. As our lives 
are usually arranged, this belief is easy to main- 
tain. It is more difficult if one has tried being 
separated from one's companions in the deep 
woods at night; reportedly, it is also difficult 
for those who have suffered solitary confine- 
ment. 

All this, it seems, casts grave doubt on the 
idea that man has intrinsically a greater emo- 
tional stability than the chimpanzee or other 
animals. The real difference may be in the 
protective environment man has created for 
himself, which he so takes for granted that he 
becomes unable to see how his stability is 
achieved, and how his behavior would look to 
someone not used to it. 

More clearly than anywhere else, this is seen 
in taboo phenomena. The travelers of the 
eighteenth and nineteenth centuries were com- 
pletely astonished by taboo — something new, 
quite foreign to their own lives. Scholars now 
recognize that taboo has always existed in 
civilized cultures, but even today it seems not 
to be recognized how complete and typical its 
manifestations are, nor how far-reaching in 
social effect. 

Taboo is relevant to the present discussion in 
several respects. It was drawn to our attention, 
in the first place, by observing the chimpanzee's 
response to a chimpanzee or human head (de: 
scribed earlier). "This reminded us of the 
equally puzzling human response to the sight 
of a major operation or autopsy, so the further 
observations were made with an apparently 
dead (ie anesthetized) chimpanzee as test 
object. The ambivalence of the behavior this 
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aroused then recalled Freud’s (1919) emphasis 
on the puzzling ambivalence of some primitive 
taboos: the object may be simultaneously re- 
garded with reverence, and as dangerous or 
contaminating. lt seems to be thought that in 
more enlightened societies such anomalies have 
disappeared. Any remnants of taboo in our 
own society are thought of simply as ‘moral 
prohibitions.” Our taboos are thus regarded as 
purely negative, and more or less based on 
rational attitudes. But with the chimpanzee's 
behavior as a starting point, and with the ques- 
tion in mind whether man is really as free of 
susceptibility to irrational emotion as he seems, 
we found something different. 

Consider first a relatively mild taboo, on the 
word "God" Although the taboo has become 
much less strong in recent years, its ambivalence 
is still beautifully clear. The word (a) has a 
reverent use, and (b) is also on occasion de- 
scribed literally as a “dirty word,” for using 
which one of the writers some time ago (about 
1910) had his mouth washed out with soap. 
The lexicographer need not be puzzled to find 
modern examples to illustrate the double mean- 
ing of the Latin sacer. 

A much more powerful taboo on the dead 
human body is equally ambivalent. Be reverent 
to the dead, speak in whispers and make no 
jokes at the funeral service, and, if you are male, 
remove your hat. There are still remnants of 
the Middle-Ages and Renaissance prohibition 
against cutting up the human body, shown in 
the difficulty of getting bodies for medical- 
school teaching or permission for autopsy. Dis- 
section violates the sanctity of the dead. But 
in opposition to such reverence, the corpse is 
also contaminating, unpleasant, disgusting; dis- 
gust may be expressed, for example, at a report 
that relatives have kissed the dead man after 
preparation for burial. In general, members of 
this society are rarely exposed to dead bodies, 
except in war or disaster, and they tend to be 
ashamed of their emotional sensitivity thereto, 
so it is hard to produce formal evidence of the 
repugnance. An undergraduate class of 198 
persons, including some nurses and some vet- 
erans, was polled to find out how many had 
encountered a dead body. Thirty-seven had 
never seen a corpse in any circumstances; 91 
had seen one only after it was prepared for 
burial — a total of 65 percent who had either 
not seen a corpse or who had seen one only in 
the ritualized procedure that rigidly limits ex- 
posure to it. These statistics show clearly that 
society, for some reason, protects its members 
from something about the dead body, and if 
there should still be somé doubt that this is 
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the emotional disturbance that the body arouses, 
the reader can get the evidence for himself if 
he will obtain a human head from the dissect- 
ing room and try to carry it home openly in 
the subway. 

Sex taboos are multifarious, but some of them 
clearly show the ambivalence we are discussing. 
On the one hand, marriage is holy; on the other, 
unsanctified sexual activity is — once more — 
“dirty,” contaminating, filthy. We read with 
amusement and incredulity of the behavior of 
the primitive people who put to death those 
who violate a taboo; but the death sentence for 
rape and the prison sentence for the bigamist 
or seller of pictures of sexual congress is exactly 
in the same class, for it does not have to be 
shown that physical or psychological harm has 
been done to anyone; only that the taboo is 
violated. 

That there is a double standard of morals is 
the old way of saying that as a sex object the 
human female has a stronger taboo than the 
male. For the ambivalence of taboos, it is sig- 
nificant that the swing from one extreme to the 
other is greater with the female than the male: 
her virginity is more sacred than the male’s 
(which is apt to be a source of derision, except 
when the male in question is under another 
taboo, of the clergy) , and she is more degraded 
and dangerous or contaminating to others when 
the taboo has been flagrantly broken. Observe 
also that young men have been known to say, 
and even on credible report to feel, that they 
are “not worthy” of some young woman or 
other; the converse point of view is notably rare. 

The only apparent way of accounting for 
these ambivalences is to suppose that the taboo 
object is, in the first place, simply a very excit- 
ing one; and that this emotional arousal in 
some circumstances tends to be channeled into 
one motor outlet (eg, aggression), in other 
circumstances another outlet (e.g., fawning or 
abasement). This implies that there might be 
individual differences in response to the same 
object, just as there must be differences in what 
is exciting to different individuals, It is, of 
course, evident socially that the growing child 
left to himself would develop his own set of 
jaboos. "Moral education" has as one of its 
main objectives to establish uniformity: uni- 
formity of taboo, but more broadly a uniformity 
of emotional sensitivity. 

If all persons could be given the same emo- 
tional attitudes, including a fear of acting in a 
way that arouses emotion too strongly in others, 
internal social conflict would disappear, if at 
the same time the society is economically suc- 
cessful and provides itf neembers with comfort 
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and safety. This is the direction, it seems, in 
which society tends to move, up to the point at 
which the moral prohibition reduces provoca- 
tive behavior below the optimal level. This 
level itself varies, of course, with individuals 
according to past experience, which means that 
society will never reach complete uniformity. 
There will always be a tendency to "test the 
limits” of the social code; the result in some 
cases will anger or horrify too many members 
of society, and the testing will be suppressed. In 
other cases, however, the tester will elicit a de- 
gree of response that is acceptable, and the 
innovation will be socially perpetuated. 

We may see human society, therefore, as con- 
ditioned primarily by the avoidance of the too- 
strong emotional disturbance. In the first place, 
this may be the action of “economic man”: the 
search for wealth in the form of shelter and 
safety. Primitive man is likely to have more 
than enough emotional disturbance imposed 
on him by insufficient control of his physical 
environment. But an equally important de- 
terminant of early society is a means of con- 
trolling man’s own behavior, as a source of 
emotional disturbance. This influence may 
have definitely uneconomic results, even at an 
early stage in the development. 

If thus we consider man as distinctive not 
only intellectually but also emotionally, a more 
adequate picture of human society emerges, 
Although we have oversimplified, even our ac- 
count does not make it a simple picture of 
enforced conformity and avoidance of fear- and 
rage-provoking stimulation. There are, also, 
the search for the optimal level of arousal, and 
individual differences of emotional susceptibil- 
ity. Our society, as well, has a long way to go 
before it can be supposed to have reached the 
maximal degree of conformity, But it does not 
seem to have been recognized how consistently 
our society has moved in the direction of ar- 
ranging that its members act so as not to excite 
their fellows beyond a certain point. When 
the strong excitation is not avoided, it is aroused 
under carefully specified conditions only (mar- 
riage, law court, and prizefighting are examples) 
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in which the sexual arousal, fear, or anger 
aroused in one does not spread uncontrolled to 
others. Producing a low-level chronic excitation 
in others, such as fear of losing a job or not 
passing an examination, is not frowned on; but 
the hypothetical civilized man, the goal at which 
approved social education aims, does not, ex- 
cept in the special circumstances referred to, 
act so as to produce acute, strong sexual arousal, 
jealousy, anger, or fear. He does not display 
disgusting objects openly; he is equally careful 
not to get into situations in which his own emo- 
tions make a display of him. 

Our social arrangements fall short of this 
admirable goal, but they have, on the whole, 
achieved in some segments of this society a 
remarkable success in avoiding the causes of 
strong emotional arousal in ordinary life. This 
result seems quite unpredictable from the as- 
sumption that man is in himself a relatively 
unemotional animal, distinctive mainly by his 
intellect. It does become intelligible if man 
has a manifold susceptibility to emotional dis- 
turbance; if he tends to avoid actions that lead 
to strong emotion, and if one of the causes of 
emotion is the sight of it in others; and if the 
gradual establishment of conformity makes 
things become exciting which would otherwise 
not be so, because of their strangeness. 

Man is a rational, unemotional animal as 
long as there is nothing to disturb his emotions. 
Also, the causes of strong emotion are few, as 
we usually consider them; for all we take ac 
count of are the causes that do operate in a 
society the main function of which may be to 
control and limit strong emotion, and we are 
almost incapable of thinking of an environment 
that differs from the social and physical one in 
which we live and which we have so carefully 
tailored to our needs. The animal evidence 
shows that the altruism and friendship so promi- 
nent in Homo sapiens are fundamental to his 
nature: if we are to learn how to foster such at- 
titudes, we must recognize and provide the kind 
of protective social environment in which it is 
possible for them to develop. 


THE SOCIAL SIGNIFICANCE OF ANIMAL STUDIES 


559 


REFERENCES 


ALLEE, W. C., Emerson, A. E., Park, O., Park, T., & 
Scumipt, K. P. Principles of animal ecology. 
Philadelphia: W. B. Saunders, 1949. 

ALLPORT, G. W. The psychologist's frame of refer- 
ence. Psychol. Bull., 1940, 37, 1-28. 


Beacu, F. A. Evolutionary changes in the physiologi- 
cal control of mating behavior in mammals. 
Psychol. Rev., 1947, 54, 297-315. 

BracH, F. A. Hormones and behavior. New York: 
Hoeber, 1948. 

Brach, F. A. The snark was a boojum. Amer. Psy- 
chologist, 1950, 5, 115-124. 

BrgLvNE, D. E. Novelty and curiosity as determi- 
nants of exploratory behavior. Brit. J. Psychol., 
1950, 41, 68-80. 

BERNSTEIN, L. A note on Christie's "Experimental 
naiveté and experiential naïveté.” Psychol. Bull., 
1952, 49, 38-40. 

Bierens DE Haan, J. A. Animal language in rela- 
tion to that of man. Biol. Rev., 1929, 4, 249-268. 

Bincuam, W. E., & Grirritus, W. J. The effect of 
different environments during infancy on adult 
behavior in the rat. J. comp. physiol. Psychol., 
1952, 45, 307-312. 

Bircu, H. G., & Crank, G. Hormonal modification 
of social behavior. II. The effects of sex-hormone 
administration on the social dominance status of 
the female-castrate chimpanzee. Psychosom. Med., 
1946, 8, 320-331. 

BincH, H. G., & Crank, G. Hormonal modification 
of social behavior. IV. The mechanism of estro- 
gen-induced dominance in chimpanzees. J. comp. 
physiol. Psychol., 1950, 43, 181-193. 


CARPENTER, C. R. Societies of monkeys and apes. 
Biol. Sympos., 1942, 8, 177-204. 

CLARK, G., & Bircu, H. G. Hormonal modifications 
of social behavior. I. The effect of sex-hormone 
administration on the social status of a male- 
castrate chimpanzee. Psychosom. Med., 1945, 7, 
321-331. 

Conant, J. B. On understanding science: an histori- 
cal approach. New Haven: Yale, 1947. 

Crawrorp, M. P. The cooperative solving of prob- 
lems by young chimpanzees. Comp. Psychol. 
Monde: 1937, 14, No. 68. Pp. 1-88. 

CnAwronp, M. P. The relation between social dom- 
inance and the menstrual cycle in female chim- 
panzees. J. comp. Psychol., 1940, 30, 483-513. , 

Crawrorn, M. P. The cooperative solving by chim- 
panzees of problems requiring serial responses to 
color cues. J. soc, Psychol., 1941, 13, 259-280. 


Daruine, F. F. A herd of red deer. Oxford: Oxford 
Univ. Press, 1937. 

Davis, K. Extreme social isolation of a child. Amer. 
J. Sociol., 1940, 45, 554-565. ` 
Davis, K. Final note on a case of extreme isolation. 

Amer. J. Sociol., 1947, 52, 432487. 


Drorkin, V. H. Host specificity relations of termite 
protozoa. Ecology, 1941, 22, 200-202. 


FiscHEL, W. Beiträge zur Soziologie des Haushuhns. 
Biol. Zbl., 1927, 47, 678-696. 

Fon», C. S, & BzacH, F. A. Patterns of sexual be- 
havior. New York: Harper (Hoeber), 1951. 

Forcays, D. G., & Forcays, J. W. The nature of the 
effect of free-environmental experience in the rat. 
J. comp. physiol. Psychol., 1952, 45, 322-328. 

FnEUD, S. Totem and taboo. (Trans. by A. A. Brill.) 
London: Routledge, 1919. 


GorpsrEmN, K. Human nature in the light of psycho- 
pathology. Cambridge, Mass: Harvard Univ. 
Press, 1940. 

GoopENoucH, Frorence L, The measurement of 
mental growth in childhood. In L. Carmichael 
(Ed.), Manual of child psychology. New York: 
Wiley, 1946. Pp. 450-475. 

GREENBERG, B. Some relations between territory, 
social hierarchy, and leadership in the green sun- 
fish (Lepomis cyanellus). Physiol. Zoól., 1947, 20, 
267-299. 

Grece, R. E. The origin of castes in ants with special 
reference to Pheidole morrisi Forel. Ecology, 
1942, 23, 295-308. 

GRÜNTHAL, E. Zur Frage der Entstehung des Men- 
schenhirns. Mschr. Psychiat. Neurol., 1948, 115, 
129-160. 


Hatt, C. S., & WutrEMAN, P. H. The effects of in- 
fantile stimulation upon later emotional stability 
in the mouse. J. comp. physiol. Psychol., 1951, 
14, 61-66. 

Harrow, H. F. Levels of integration along the phy- 
logenetic scale: learning aspect. In J. H. Rohrer 
& M. Sherif (Eds) , Social psychology at the cross- 
roads. New York: Harper, 1951. Pp. 121-144. 

Hartow, H. F., HARLOW, MARGARET K., & MEYER, 
D. R. Learning motivated by a manipulation 
drive. J. exp. Psychol., 1950, 40, 228-234. 

Hayes, K. J, & Hayes, CATHERINE. The intellectual 
development of a home-raised chimpanzee. Proc. 
Amer. phil. Soc., 1951a, 95, 105-109. 

Hayes, K. J., & Haves, CATHERINE. Imitation in a 
home-raised chimpanzee. J. comp. physiol. Psy- 
chol., 1951b, 45, 450-459. 

Hess, D. O. Emotion in man and animal: an analy- 
sis of the intuitive processes of recognition. 

o Psychol. Rev., 1946a, 53, 88-106. 

Hess, D. O. On the nature of fear. Psychol. Rev., 
1946b, 53, 259-276. 

Hess, D. O. Temperament in chimpanzees: I. 
Method of analysis. J. comp. physiol. Psychol., 
1949a, 42, 192-206. ! 

Hess, D. O. The organization of behavior. New 
York: Wiley, 1949b. 

Hess, D. O., & Brora, D. Scientific writing and the 

neral problem of communication. Amer. Psy- 
chologist, 1952, 7, 569-573. 


560 


Hess, D. O., & Riesen, A. H. The genesis of irra- 
tional fears. Bull. Canad. psychol. Ass., 1943, 3, 
49-50. 

Heron, W. T. The inheritance of maze learning 
ability in rats. J. comp. Psychol., 1935, 19, 77-89. 
Hit, J. C, & Rosinson, B. A case of retarded 
mental development associated with restricted 
movement in infancy. Brit. J. med. Psychol., 1929, 

9, 268-277. 

Hupsow, B. B. One-trial learning in the domestic 
rat. Genet. Psychol. Monogr., 1950, 41, 99-145. 
Huu, C. L. Principles of behavior. New York: 

Appleton-Century, 1943. 

Hunt, J. McV. The effects of infant feeding-frustra- 
tion upon adult hoarding in the albino rat. J. 
abnorm. soc. Psychol., 1941, 36, 338-360. 

Hunt, J. McV., ScuLossrRc, H., SoLomon, R. L., & 
SrELLAR, E. Studies of the effects of infantile ex- 
perience on adult behavior in rats: I. Effects of 
infantile feeding-frustration on adult hoarding. 
J. comp. physiol. Psychol., 1947, 40, 291-304. 

Hvwovrrcu, B. The effects of experiential varia- 
tions on problem-solving in the rat. J. comp. 
physiol. Psychol., 1952, 45, 313-321. 


Jacossen, C. F., JAcoBsEN, M. M., & Yosuoxa, J. G. 
Development of an infant chimpanzee during her 
first year. Comp. Psychol. Monogr., 1932, 9, 1-94. 

Jean, A. T., & Hormes, F. B. Children’s fears. 
New York: Teach. Coll. Bur. Publ., 1935. 

Jones, E. S. Subjective evaluations of personality. In 
J. McV. Hunt (Ed.) , Personality and the behavior 
disorders. Vol. I. New York: Ronald, 1944. Pp. 
139-169. 

Jones, H. E. Environmental influences on mental 
development. In L. Carmichael (Ed.), Manual of 
child psychology. New York: Wiley, 1946. Pp. 
582-632. 

Jones, H. E., & Jones, M. C. A study of fear. Child- 
hood Educ., 1928, 5, 136-143. 


Karz, D. Social psychology and group processes. 
Annu. Rev. Psychol., 1951, 2, 197-172. 

Katz, I. Behavioral interaction in a herd of Barbary 
sheep (Ammotragus Lervia). Zoologica, N. Y., 
1949, 34, 9-18. 

Kur, W. The mentality of apes. (2nd Ed.) New 
York: Harcourt, Brace, 1927. 


Levy, D. M. Experiments on the sucking reflex and 
social behavior in dogs. Amer. J. Orthopsychiat., 
1934, 4, 203-224. 

Licur, S. F. The determination of the castes of 
social insects. Quart, Rev. Biol., 1942, 17, 312-326 

Licur, $. F. The determination of the castes of social 
insects, Quart. Rev. Biol., 1943, 18, 46-63. 

Liwpstev, D. B. Emotion. In S. S. Stevens (Ed.), 
Handbook of experimental psychology. New 
York: Wiley, 1951. Pp. 473-516. 

Lorenz, K. A contribution to the comparative soci- 
ology of colonial-nesting birds. Proc. int. orn. 
Congr., Vill, Amsterdam, 1954, 207-218. 

Lorenz, K. Der Kumpenén der Umwelt des Vogels. 
J. Om., Lpz, 1935a, 83, 137-213. 


HANDBOOK OF SOCIAL PSYCHOLOGY 


Lorenz, K. Der Kumpan in der Umwelt des Vogels, 
J. Orn., Lpz., 1935b, 83, 289-413. 

Lorenz, K. King Solomon's ring. London: Methuen, 
1952. 


McBrivg, A. F. Meet Mister Porpoise. Nat. Hist. 
Mag., 1940, 45, 16-29. 

MCBRIDE, A. F., & Hess, D. O. Behavior of the cap- 
tive bottle-nose dolphin, Tursiops truncatus. J. 
comp. physiol. Psychol., 1948, 41, 111-123. 

McCuttocu, T. L., & Hasterup, G. M. Affective 
responses of an infant chimpanzee reared in isola- 
tion from its kind. J. comp. Psychol., 1939, 28, 
437445. 

McKxrLam, P. Provocation to anger and the devel- 
opment of attitudes of hostility. Brit. J. Psychol, 
1950, 40, 104-114. 

Mater, N. R. F., & ScuNEmrA, T. C. Principles of 
animal psychology. New York: McGraw-Hill, 
1935. 

Mastow, A. H. The instinctoid nature of basic 
needs. J. Pers., 1953, in press. 

Mason, Marie K. Learning to speak after six and 
one-half years of silence. J. Speech Disorders, 
1942, 7, 295-304. 

Meret, P. E, & MacCorquopate, K. A further 
study of latent learning in the T-maze. J. comp. 
physiol. Psychol., 1948, 41, 372-396. 

Meeut, P. E., k MAcConQUopALE, K. Some methodo- 
logical comments concerning expectancy theory. 
Psychol. Rev., 1951, 58, 230-233. 

MzxtzACK, R. Irrational fears in the dog. Canad. J. 
Psychol., 1952, 6, 141-147. 

Mur, N. E. Learnable drives and rewards. In 
S. S. Stevens (Ed.), Handbook of experimental 
psychology. New York: Wiley, 1951. Pp. 435-472. 

Montcomery, K. C. The relation between explora- 
tory behavior and spontaneous alternation in the 
white rat. J. comp. physiol. Psychol, 1951, 44, 
582-589. 

Montcomery, K. C. Exploratory behavior and its 
relation to spontaneous alternation in a series of 
maze exposures. J. comp. physiol. Psychol, 1952, 
15, 50-57. 

Morris, C. Signs, language and behavior. New 
York: Prentice-Hall, 1946. 

Mowrer, O. H. Learning theory and personality 
dynamics. New York: Ronald, 1950. 

Mowrer, O. H. Motivation. Annu. Rev. Psychol, 
1952, 3, 419-438. 


Nissen, H. W. A field study of the chimpanzee 
Comp. Psychol. Monogr., 1931, 8, No. 36. Pp- 
1-122. 

Nissen, H. W. Social behavior in primates. In C. P. 
Stone (Ed.), Comparative Psychology (3rd Ed.)- 

„New York: Prentice-Hall, 1951. Pp. 423-457. 

Nissen, H. W., Cnow, K. L., & Semmes, JOSEPHINE 
Effects of restricted opportunity for tactual, kines- 
thetic, and manipulative experience on the be- 
havior of a chimpanzee. Amer. J. Psychol. 1951, 
64, 485-507. 


THE SOCIAL SIGNIFICANCE OF ANIMAL STUDIES 


Pratu, O. E. Insect societies. In C. Murchison 
(Ed.), A handbook of social psychology. Worces- 
ter, Mass.: Clark Univ. Press, 1935. Pp. 83-141. 


Révész, G. The language of animals. J. gen. Psy- 
chol., 1944, 30, 117-147. 

Ruesen, A. H. The development of visual perception 
in man and chimpanzee. Science, 1947, 106, 
107-108. 

Riesen, A. H. Arrested vision. Scient. Amer., 1950, 
183, 16-19. 

RosENzwEIG, S. An outline of frustration theory. 
In J. McV. Hunt (Ed.), Personality and the be- 
havior disorders. Vol. 1. New York: Ronald, 1944. 
Pp. 379-388. 

Ross, S., Sam, W. I., & DENENBERG, V. A prelimin- 
ary study of individual and group hoarding in 
the white rat. J. genet. Psychol., 1950, 77, 123-127. 


SARGENT, S. $. Reaction to frustration — a critique 

, and hypothesis. Psychol. Rev., 1948, 55, 108-114. 

ScuwEmLA, T. C. Problems in the biopsychology of 
social organization. J. abnorm. soc. Psychol., 
1946, 41, 385-402. 

ScuNEIRLA, T. C. A consideration of some concept- 
ual trends in comparative psychology. Psychol. 
Bull., 1952, 49, 559-597. 

Scorr, J. P. Social behavior, organization and lead- 
ership in a small flock of domestic sheep. Comp. 
Psychol. Monogr., 1945, 18, 1-29. 

Sears, R. R. A theoretical framework for person- 
ality and social behavior. Amer. Psychologist, 
1951, 6, 476-483. 

SrEWART, JEANNIE C., & Scorr, J. P. Lack of correla- 
tion between leadership and dominance relation- 
ships in a herd of goats. J. comp. physiol. Psychol., 
1947, 40, 255-264. 


Tuompson, W. R. Exploratory behavior as a func- 
tion of hunger in "bright" and "dull" rats. In 
press. 

Tuomrson, W. R., & Binpra, D. Motivational and 
emotional characteristics of "bright" and "dull" 
rats. Canad. J. Psychol., 1952, 6, 116-122. 

TuoMrsoN, W. R., & Heron, W. Exploratory be- 
havior in normal and restricted dogs. In press. 

Tuorre, W. H. The concepts of learning and their 
relation to those of instinct. Sympos. Soc. exp. 
Biol., 1950, 4, 387-408. 


c 


561 


"TiNBERGEN, N. The study of instinct. Oxford: Ox- 
ford Univ. Press, 1951. 

Torman, E. C. Purposive behavior in animals and 
men. New York: Century, 1932. 

Tryon, R. C. Genetic differences in maze learning 
ability in rats. Yearb. nat. Soc. Stud. Educ., 1940, 
39 (I), 111-119. 


VALENTINE, C. W. The innate bases of fear. J. genet. 
Psychol., 1930, 37, 394-419. 

Von FniscH, K. Bees: their vision, chemical senses, 
and language. Ithaca: Cornel] Univ. Press, 1950. 

Von SrNpEN, M. Raum- und Gestaltauffassung bei 
operierten Blindgeborenen vor und .nach der 
Operation. Leipzig: Barth, 1932. 


Wasmann, E. Psychology of ants and of higher ani- 
mals. St. Louis: B. Herder, 1905. 

Wuee ter, W. M. Social life among insects; IV. Sci. 
Mon., 1922, 25, 385-404. 

Wher er, W. M. The social insects, their origin and 
evolution. New York: Harcourt, Brace, 1928. 

Wnurrisc, J. W. M., k Mowrer, O. H. Habit pro- 
gression and regression — a laboratory study of 
some factors relevant to human socialization. J. 
comp. Psychol., 1943, 36, 229-253. 

WoopwonTH, R. S. Psychology. New York: Holt, 
1921. 


Yerkes, R. M. Dominance and sexual status in the 
chimpanzee. Quart. Rev. Biol., 1939, 14, 115-136. 

Yerkes, R. M. Social behavior of chimpanzees: dom- 
inance between mates, in relation to sexual status. 
J. comp. Psychol., 1940, 30, 147-185. 

Yerkes, R. M. Conjugal contrasts among chimpan: 
zees. J. abnorm. soc. Psychol., 1941, 36, 175-199. 
Yerkes, R. M. Chimpanzees: a laboratory colony. 

New Haven: Yale Univ. Press, 1943. 

Yerkes, R. M., & Exper, J. H. Oestrus, receptivity, 
and mating in chimpanzee. Comp. Psychol. Mon- 
ogr., 1936, 13, No. 65. Pp. 1-39. 

Yerkes, R. M., and Yerkes, A. W. Nature and con- 
ditions of avoidance (fear) response in chimpan- 
zee. J. comp. Psychol., 1936, 21, 53-66. 


ZUCKERMAN, S. The social life of monkeys and apes. 
London: Kegan Paul, 1932. 


AUTHOR INDEX 


Abelson, R. P., 351, 367 

Aberle, D. F., 31,51, 531 

Abt, L. E., 499, 519, 521 

Ach, N., 44, 51, 199-200, 219 

Ackerman, N. W., 174, 176 

Adams, D. K., 119, 123, 138, 257 

Adler, A., 14, 51, 175 

Adler, D. L., 202, 219 

Adler, F., 333 

Adorno, T. W., 174, 176, 247, 255, 456, 485 

Aichorn, A., 174, 176 

à Kempis, T., 498 

Albert, E., 131, 138 

Albig, W., 490, 511, 519 

Albrecht, Ruth, 246, 255 

Alexander, F., 179 

Alexander, G., 235, 255 

Allee, W. C., 536-537, 559 

Allport, F. H., 23-24, 26, 39-40, 42, 47, 49-51, 58, 69, 
79, 85, 87, 126, 138, 341, 366 

Allport, G. W., 3, 13-14, 18, 23, 45, 51-52, 76, 87, 93- 
94, 115-116, 119-120, 122, 129, 133, 138, 145, 178, 
224, 237-238, 244, 252, 255, 286-287, 335, 366, 376, 
403, 432, 491, 499, 510, 515, 519, 545, 559 

Alper, Thelma G., 201, 219 

Alpert, A., 108, 138 

Ames, A., Jr., 97-98, 118, 138 

Ammons, Carol H., 473, 485 

Ammons, R. B., 473, 485 

Anderson, Gladys L., 499, 519 

Anderson, H. H., 208, 219, 392-393, 399, 403, 499, 519 

Anderson, T. W., 361, 366 

Andrews, 'T. G., 500, 519 

Andreyev, A. A., 499 

Angell, R., 178, 485 

Angyal, A., 93, 95, 122, 138, 194, 219 

Anrep, G. V., 89 

Archer, W., 233, 236, 255 

Aristippus, 10 

Aristotle, 3, 6, 17, 51, 144, 498, 521 

Armitt, Fannie M., 287 

Arnheim, R., 506, 509, 511, 515, 519 

Arnold, Magda B., 237, 255 

Arnold, M., 497 

Arnold, T., 17,51 

Arrington, R. E., 371, 404 

Arsenian, Jean M., 209, 219 

Asch, S. E., 18, 23-24, 28-29, 40, 49-51, 92, 106, 113, 
119, 122-123, 138, 141, 178, 176, 182, 195, 211, 219, 
238, 255 

Ash, P., 277, 288, 494, 519 

Asheim, L., 492, 517, 519 

Ashley-Montagu, M. F., 18, 20, 51, 480, 444 


Attneave, F., 260, 264, 287, 348, 366 
Auhagen, F. F., 502 


Back, K., 181, 215-216, 219-220, 418, 430, 445, 462, 485 

Backer, R., 89 

Bacon, F., 17, 51 

Bagehot, W., 21-22, 24,51 

Baier, D. E., 439, 444 

Bailyn, Lotte, 289 

Bain, R., 405, 444 

Baker, B., 246, 255 

Baker, H., 6,51 

Baker, K., 511, 515, 519 

Baker, W. B., 429, 446 

Baldwin, A. L., 500, 519 

Baldwin, J. M., 22, 24, 51, 223, 238, 241-242, 255 

Bales, R. F., 172, 174, 176, 179, 374, 376-381, 390, 394, 
408, 495, 501, 511, 519 

Bard, P., 555 

Barker, R. G., 87, 107, 142, 182-183, 193-194, 198, 203, 
207, 219, 221-222, 388-389, 403-404, 408, 444, 484- 
485 

Barnard, C. I., 182 

Barnes, H. E., 5, 52 

Baron, D., 434, 444 

Barr, F. E., 429 

Barron, M. L., 517, 519 

Barry, H., III, 523, 531 

Bartlett, D., 289 

Bartlett, F. C., 35, 38, 44, 52, 110, 138, 468, 485 

Bartlett, K. G., 511, 519 

Bartlett, M. S., 275, 287 

Bartlett, R. J., 336, 366 

Basedow, H., 235, 255 

Bassett, R. E., 408, 444 

Bastian, A., 35, 52 

Bateson, G., 99, 111, 138, 255, 396, 403 

Battin, I. L., 308, 333 

Bauer, Alice H., 463, 467, 485 

Bauer, R. A., 34, 48, 52 

Baune, Helen B., 434, 438, 445 

Bavelas, A., 197, 199, 202, 213-214, 218-219 

Bayes, T., 361 

Bayley, Nancy, 241, 255 

Bayne, Martha C., 511, 515, 519 

Beach, F. A., 528, 530-531, 534, 544, 559 

Beaglchole, E., 171, 176 

Beard, C. A., 493 

Beazley, Virginia (See Hertzler, Virginia Beazley) 

Bechterew, V. M., 26, 47, 52 

Becker, C. L., 493 

Becker, H., 5, 20,52 e 

Beegle, J. A., 412, 428, 430, 438, 445-446 


563 e M 


564 


Beeley, A. L., 49 

Bell, G. B., 422, 424, 444 

Bell, H. M., 433 

Bellak, L., 499, 519, 521 

Bender, 1. E., 237, 247, 256 

Benedict, Ruth, 228, 255-256 

Benne, K. D., 232, 256 

Bennett, Edith B., 217, 219 

Bennett, J. F., 337, 354, 366 

Bentham, J., 5, 10-13, 16, 52, 53,55 
Berelson, B., 488-489, 494, 496, 498-499, 501-502, 506, 

508-512, 515, 517, 519, 522 

Bergman, D. V., 385, 408 

Bergmann, G., 58, 78, 80, 87, 336, 366 
Beria, L., 499 

Berlyne, D. E., 551, 559 

Bernard, L. L., 5, 22, 41, 49-50, 52, 172, 176 
Bernberg, R. E., 342, 366, 369, 461, 487 
Bernfeld, S., 144 

Bernheim, H., 25-27, 52 

Bernreuter, R. G., 433 

Bernstein, L., 534, 559 

Bettelheim, B., 495, 519 

Beum, C. O., 417, 419, 444 

Bierens de Haan, J. A., 538, 559 
Bindra, D., 532-533, 539, 559, 561 
Binet, A., 26-27, 46, 52 

Bingham, W. E., 533, 559 

Bingham, W. V., 465, 485 

Bion, W. R., 174, 176 

Birch, H. G., 108, 124, 138, 544, 559 
Bird, C., 49 

Bismark, O. von, 179 

Bitterman, M. E., 124, 138 

Bjerstedt, A., 415, 444 

Blackwood, Beatrice, 524, 531 

Blake, R. R., 138-139 

Blanton, Margaret, 23-24, 52 

Blanton, S., 23-24, 52 

Bliss, C. I., 326 

Blum, G. S., 170, 176 

Blumenstock, Dorothy, 507, 520 
Blythe, Irene T., 498, 519 

Boas, F., 170, 172, 528, 581 

Bock, R. D., 418-419, 444 

Bodine, A. J., 355, 367 

Bogardus, E. S., 49, 52, 353, 366 
Boguslavsky, G. W., 76, 87 

Boldyreff, J. W., 47, 52 

Bonney, M. E., 422, 428, 429-433, 497, 444 
Borgatta, E. F., 355-356, 366, 369, 405, 412, 421, 444 
Boring, E. G., 6, 25, 36, 48, 52, 92-93, 99, 102, 108, 

124, 138, 143-144, 176 

Bosanquet, B., 34, 52 
Bousquet, G. H., 17, 52 
Bowlby, J., 174, 176 
Bowman, C. C., 236, 256 
Braid, J., 25,27,52 — ^ 


AUTHOR INDEX 


Brandt, H. F., 208, 219 

Bray, D. W., 342, 366 

Bretsch, H. S., 434, 446 

Brett, G. S., 6, 34, 52 

Bridgman, P. W., 91-92, 102, 108, 138 

Brierley, Marjorie, 175-176 

Brill, A. A., 559 

Britt, S. H., 30, 49, 52 

Brogden, W. J., 59, 87 

Bronfenbrenner, U., 57, 66, 87, 412, 415-416, 420, 
428, 444 

Bronté, Charlotte, 500, 521 

Bronté, Emily, 500, 521 

Brown, C. W., 376, 404 

Brown, J. C., 228, 256 

Brown, J. F., 49-50, 109, 138, 173, 176, 182 

Brown, J. S., 65-68, 70, 80-81, 83, 87 

Brown, L. G., 49-50 

Brown, T., 26, 52 

Brown, W., 339, 392 

Browne, C. G., 438, 444 

Brownfain, J. J., 256 

Brücke, E., 144 

Brundage, E. G., 417, 444 

Bruner, J. S., 59-60, 87, 110, 113, 117, 138, 141, 491, 
496, 511, 516, 519 

Brunswik, E, 100, 103-104, 132-133, 138, 142, 184, 
219, 228, 230, 256 

Brush, F. R., 83, 87 

Buchanan, R. E., 515, 519 

Buchman, M., 141 

Bugelski, B. R., 174, 179, 285, 287 

Bugental, J. F. T., 244, 256 

Bühler, Charlotte H., 132, 138, 371, 403 

Bühler, K., 126, 138 

Bulganin, N. A., 499 

Bunker, H. A., 178 

Burchard, W., 252, 256 

Burgess, E. W., 55, 171-172, 176, 224, 238, 257 

Burt, H. E., 505, 520 

Bush, R. R., 65, 73, 77, 83, 87, 89, 289 

Buytendijk, F. J. J., 123, 138 

Byrd, E., 422-423, 444 

Byron, H. J., 20 


Cahnman, W., 515, 519 2 

Cameron, N., 224-227, 233, 235-236, 252, 256 

Campbell, A. A., 458, 485 

Campbell, D. T., 335-336, 342, 366, 460-462, 478, 485, 
487 

Cannell, C. F., 468, 478, 485 

Cannon, W. B., 114-115, 138, 235, 256, 555 

Cantril, H., 27, 30, 52, 110, 141, 259, 287, 357, 366 
367, 452, 456, 476-477, 485 

Caplan, G., 451 

Caritat, M. J. A. N. (See Condorcet) 

Carlson, A. J., 234, 256 

Carmichael, L., 221, 486, 559-560 


AUTHOR INDEX 


Carpenter, C. R., 538, 541, 559 

Carter, H. D., 228, 256 

Carter, L. F., $73, 382-384, 394, 403 

Cartwright, D., 207, 209, 214, 219-220 

Casey, R. D., 520 

Cassirer, E., 99, 125-127, 180, 138, 187, 220 

Catlin, G., 6, 52 

Cattell, J., 140 

Cattell, J. McK., 144 

Cervinka, V., 419, 444 

Chabot, J., 419, 444 

Chall, Jeanne, 497, 508, 519 

Chapin, F. S., 281, 287, 405, 410-412, 444-445 

Chapple, E. D., 384-385, 394-395, 403 

Charcot, J. M., 22, 25-27, 30, 46, 52 

Chasdi, Eleanor Hollenberg, 82-83, 88, 475, 485, 531 

Chase, S., 21, 52 

Chave, E. J., 260, 288, 341, 344, 347, 350, 365, 369 

Chen, I., 97, 138, 187, 193, 218, 220 

Child, C. M., 132, 188 

Child, I. L., 58, 71, 83-84, 86-87, 90, 507-508, 519, 
523-526, 528-531 

Childs, H. L., 520 

Chow, K. L., 534, 560 

Christiansen, H. F., 281-282, 287 

Christie, R., 559 

Cisin, I. H., 477, 485 

Claparède, E., 112, 138 

Clark, G., 544, 559 

Clark, K. E., 357, 365-367 

Clark, R. A., 82, 87 

Clarke, H. M., 44, 52 

Cline, V., 246-247, 256 

Clyde, R. J., 447 

Coch, L., 209, 213, 220 

Cochran, W. G., 263, 268, 273-275, 284-285, 287, 328, 
333 

Coffey, H. S., 389, 403-404 

Cohen, A. K., 31,51 

Cologne, Rose, 441, 445 

Combs, A. W., 101-102, 122, 141 

Comte, A., 3-4, 6-9, 15, 48, 52, 54 

Conant, J. B., 532, 559 

Condorcet, M. de, 16, 52 

Conn, J. H., 473, 485 

Cook, 3. W., 53, 192, 213, 220, 447, 456, 485-486 

Cooley, C. H., 23-24, 52, 228, 238, 242 

Coombs, C. H., 335, 37, 343, 358, 366 


Crawley, E., 235, 256 

Crespi, L. P., 471, 485 

Cressey, D. F., 248, 258 

Criswell, Joan H., 405, 411, 415416, 419, 422, 425- 
426, 428-430, 443-445 

Cronbach, L. J., 259, 264, 287, 358.939, 366, 397, 
405 

Crossley, Helen M., 480, 486 

Crown, S., 365-366 

Crutchfield, R. S., 18, 27, 49-50, 54, 91, 94, 107, 140, 
173, 179, 182, 211, 221, 238, 257, 335, 362, 367, 
452, 456, 485 

Cureton, E. E., 333 

Curran, F. J., 440, 446 

Curti, M. E., 6, 52 


Dale, E., 497, 506, 508, 519 

Danielsson, B., 423, 430, 445 

Darling, F. F., 541, 559 

Darwin, C., 16, 41, 52, 143 

Dashiell, J. F., 47, 52 

Davidson, D. M., Jr., 409, 441, 446 

daVinci, L., 175, 177 

Davis, A., 86, 88 

Davis, A. K., 31, 51 

Davis, F. B., 352, 366 

Davis, K., 172, 176, 533, 559 

DeGrange, McQ., 7, 52 

deGrazia, S., 498-499, 502, 508, 511, 517,519 

Delaisi, F., 17, 52 

deLange, M., 47, 52 

DeLott, J., 469, 487 

Dembo, Tamara, 87, 202-203, 208, 219-221 

Denenberg, V., 534, 561 

Dennis, W. D., 135, 138-139 

De Saussure, R., 175-176 

Descartes, R., 17, 41 

Deutsch, F., 500, 519 

Deutsch, K., 188, 220 

Deutsch, M., 53, 181, 192, 202, 214-216, 220, 399, 403, 
447, 456, 485-486 

Deutschberger, P., 431, 441, 445 

Devereux, G., 171-172, 176, 235, 256 

Dewey, J., 11, 17, 4142, 48, 51-52, 223 

Dewey, R., 73, 88 

Dickens, C., 11 

Dickson, Lenore, 409, 430, 432, 446 

Dickson, W. J., 438, 451, 486 


Cottrell, L. S, Jr., 5, 52, 86-88, 182, 224-225, 227, * Dietrich, D. H., 468, 487 


243, 247, 256 

Coughlin, C. E., 496, 521 

Coules, J., 216, 222, 398, 404 

Cousins, A. N., 252, 956 

Coutu, W., 182, 232, 256 : 
` Covner, B. J., 468, 485 

Cox, Gertrude M., 268, 268, 273-275, 284-285, 287 

Crane, S., 497 

Crawford, M. P., 540, 544, 559 


Dinsmoor, J. A., 79, 88 

Dittman, A. T., 385, 403 

Dixon, W. J., 285, 287, 289, 333 

Dollard, J., 12, 23-24, 42, 52, 55, 58, 62, 66, 70, 73, 
76, 81-86, 88-89, 97, 119, 141, 145, 174, 176, 227, 
257, 286, 288, 501, 519 
nne, J., 497 

du. z W., 21, 30, 49,42, 54, 69, 82-83, 88, 97, 119, 
138, 174, 176, 335, 366 


566 


Dostoevsky, F., 500, 520 

Droba, D. D., 45, 52, 344, 366 
Dropkin, V. H., 537, 559 

Dryden, J., 497 

DuBois, Cora A., 171, 176, 226, 241, 256 
Dudman, J., 361 

Duncan, D. B., 304, 333 

Duncker, K., 103, 107-109, 138-139 
Dunlap, K., 41, 49, 52-53 
Durkheim, E., 22, 37-38, 48, 53 
Dymond, Rosalind F., 237, 247, 256 


Ebel, R. L., 363, 366 
Eberhart, J. C., 477-478, 486 
Edgerton, H. A., 353, 366 
Edwards, A. L., 259-260, 263-264, 268-269, 273-274, 
278, 282, 284-287, 289, 291, 303-304, 333, 348, 351, 
357, 365-366 
Edwards, Daisy S., 415, 445 
Ehrenfreund, D., 131, 138 
Eichler, R. M., 326, 333 
Einstein, A., 182, 220 
Eisenhart, C., 289, 322, 334 
Eissler, K. R., 174, 176 
Elder, J. H., 544, 561 
Elinson, J., 477, 485 
Ellwood, C. A., 49, 51,53 
Emerson, A. E., 536, 559 
Emerson, R. W., 497 
Emery, D., 181 
Eng, E., 408, 445 
Engelmann, G., 5, 53 
Engels, F., 11, 17, 54 
England, L. R., 463, 485 
Entwisle, Doris, 289 
Epicurus, 10 
Erikson, E. H., 169, 171, 176 
Escalona, Sybille K., 207, 220 
Espinas, A., 35, 53 
Estensen, E. V., 490, 519 
Estes, W. K., 73, 77, 88 
Ewer, B. C., 49 
Eysenck, H. J., 365-366 


Faden, Janet M., 510, 515, 519 
Faigin, H., 531 

Fajans, Sara, 206, 220 
Faraday, M., 25, 53, 181 
Farber, I. E., 65-68, 70, 80-81, 83, 87 
Farberow, N. L., 249, 258 
Faris, E., 45, 53, 172, 176, 238 
Faris, R. E. L., 49 

Farnsworth, P. R., 49 

Faunce, D., 430, 438, 445 
Fechner, G., 144 

Feigl, H., 58, 88, 102, 107, 139 
Feldman, J. J., 477-478, 485 
Feldstein, M. J., 409, 422, 447 


AUTHOR INDEX 


Fenichel, O., 164, 176, 252, 256 

Féré, C., 43, 46, 53 

Ferguson, A., 336, 366 

Ferguson, G. A., 352, 366 

Ferguson, L. W., 351, 366 

Ferris, C. D., 289, 291, 302, 333 r 

Festinger, L., 182, 207-208, 214-217, 220-221, "3 
366, 418-419, 430, 445, 485 

Fichte, J. G., 35 

Fischel, W., 541, 559 

Fisher, R. A., 48, 261, 268, 270-271, 273-274, 285, 28 
312, 329-331, 333 

Fiske, D. W., 342, 367 

Fitch, F. B., 88 

Fitzpatrick, G. A., 439, 448 

Flemming, G., 433, 445 

Flesch, R., 497, 508, 519 

Flowerman, S. H., 471, 476-478, 487 

Foa, U. G., 414, 445 

Folsom, J. K., 49, 172, 176 

Foote, N. N., 86, 88 

Ford, C. S., 232, 256, 523, 525, 528, 530-531, 544, 559 

Ford, R. N., 355, 366 

Forgays, D. G., 533, 559 

Forgays, J. W., 533, 559 

Forlano, G., 351, 368, 433, 447 

Forsyth, Elaine, 416-420, 432, 445 

Foster, C. R., 505, 515, 519 

Foster, H. S., 506, 519 

Fouillée, A., 38, 53 

Fouriezos, N., 381, 404 

Frank, J., 175-176 

Frank, J. D., 209, 220 

Franklin, Muriel, 140, 202, 221 

Fredericson, E., 240, 256 

Freedman, H., 433, 447 

Freedman, M. B., 389-390, 403-404 

Freedman, R., 281, 287 

Freeman, E., 49 

Freeman, M. F., 326-327, 833 

Freeman, Willa (See Grunes, Willa Freeman) 

French, J. R. P., Jr., 66, 88, 185, 209, 213-215, 220 

French, R. L., 408, 422, 424, 432, 435, 444-445 

French, T. M., 168-169, 176 

Frenkel-Brunswik, Else, 176, 485 

Freud, Anna, 163-164, 168, 176, 178-179, 251-252, 256 

Freud, S., 5, 11-12, 14, 16, 18, 24-26, 28, 31, 35, 41, 44 
53, 58, 77, 82, 115, 143-180, 188, 223, 239, 256, 455, 
506, 520, 557, 559 

Friedlander, Kate, 174, 178 

Friedman, G. A., 174, 179, 523, 528, 531 

Friedman, M., 820, 338 ` 

Friedman, O., 49 

Fries, C. C., 497, 519 

Fritzsche, H., 518 

Fromm, E., 115, 189, 170, 173-175, 178 

Fromme, A., 342, 366 

Frosch, J., 176 


AUTHOR INDEX 


Fryer, D., 425, 439, 448 
Fuller, Elizabeth M., 434, 438, 445 
Fuson, W. M., $35, 367 


Gabel, Hortense, 517, 519 

Gabel, M., 517, 519 

Galileo, 109 

Gall, F. J.,8 

Gallagher, Ruth, 52, 243, 256 

Gallup, G., 48, 457 

Galton, F., 48, 53, 523 

Gardiner, B. B., 461, 485 

Garner, W. R., 348, 367 

Gault, R. H., 49 

Gauss, C. F., 109 

Gaylord, R. H., 352, 369 

Gehlke, C. E., 38, 53 

Gelb, A., 128, 139 

George, A., 503 

George, H., 9 

George III, 492 

Gerard, H. B., 181, 216-218, 220 

Gerbrands, H., 394, 403 

Gerson, J., 498 

Getzels, J. W., 460-461, 485 

Ghiselli, E. E., 376, 404 

Gibb, C. A., 425, 445 

Gibson, Eleanor, J., 64, 80; 88 

Gibson, J. J., 85, 88, 100, 112-113, 117, 132, 139 ` 

Gibson, W. A., 361 

Giddings, F. H., 20, 53 

Gillin, J. P., 86, 88, 525, 531 

Ginsberg, M., 523, 531 

Glaser, N., 462, 486 

Glixman, A. F., 201, 220 

Godwin, W., 16, 53 

Goebbels, J., 503, 518 

Goethe, J. W., 136 

Goldenweiser, A., 35, 38, 53, 531 

Goldman, Frieda (See Goldman-Eisler, Frieda) 

Goldman-Eisler, Frieda, 239, 256 

Goldsen, J., 488-489, 510, 514-515, 520 

Goldstein, K., 93-95, 111, 114-115, 122, 125-128, 132- 
184, 139, 141, 535, 559 

Goldwyn, R., 435, 446 

Gonick, Mollie R., 198, 219 

Goodacre, D. M., III, 428, 440, 445 

Goodenough, Florence L., 264, 287, 371, 480, 485, 
533, 559 

Goodenough, W. H., 356, 367 

Goodman, C. C., 60, 87 

Goodnow, R. E., 426, 429, 445 

Gordon, D. A., 452, 456, 485 

Gordon, L. V., 362, 367 

Gordon, M. H., 329, 331, 333 

Gottheil, E., 421-422, 445 

Gottlieb, Lillian, 517, 520 

Gottschalk, L, 178, 485 


o 


567 


Gough, H. G., 227, 252-253, 244-246, 248, 251, 256 

Goulden, C. H., 285, 287 

Grant, D. A., 284, 287 

Gray, W. S., 497, 508, 520 

Green, A. W., 172, 178 

Green, B. F., 355, 356, 360-361, 367 

Green, Helen D., 441, 445 

Green, T. H., $4, 53 

Greenberg, B., 541, 559 

Greenwood, E., 53 

Gregg, R. E., 536, 559 

Grey, A., 514, 516, 520 

Griffiths, W. J., 533, 559 

Gronlund, N. E., 437, 445 

Gross, N., 459, 463, 485 

Grosser, D., 210, 220 

Grossmann, Beverly, 429, 431, 433, 445 

Grubbs, F. E., 289, 291, 333 

Grunes, Willa Freeman, 222 

Grünthal, E., 543, 559 

Guest, L., 471, 476, 478, 485 

Guetzkow, H., 179, 381, 398, 404, 450, 485 

Guilford, J. P., 264, 287, 305, 319-320, 333, 345, 347, 
367 

Gulliksen, H., 264, 287, 336, 339-340, 348, 352-553, 
363, 367 

Gumplowicz, L., 4 

Gurnee, H., 49 

Guterman, N., 30, 54, 174, 179 

Guthrie, E. R., 58-59, 63, 65, 68, 70, 72-76, 84, 88, 
286-287 

Guttinger, F., 517, 520 

Guttman, L., 260, 287, 335-336, 339, 344, 353-358, 
361-362, 364-367, 397-398, 404, 477, 481 


Hacker, Helen M., 228, 256 
Haddon, A. C., 5,53 
Hake, H. W., 348, 367 
Hale, W. B., 498, 520 
Hall, C. S., 143, 534, 559 
Hall, G. S., 48, 53, 144, 223 
Hall, J. K., 178 

Hall, M., 88 

Hall, W. B., 223 
Hallowell, A. I., 171, 178 
Hamilton, A., 16 
Hamilton, C. H., 309, 334 
Hamilton, T., 508, 520 


* Hamilton, W., 26, 53 


Hammond, K. R., 342, 367, 461, 485 
Hamsun, K., 500 

Hanchett, Gertrude A., 341, 366 
Hanfmann, Eugenia, 126, 128, 132, 139 
Harary, F., 199, 220, 418, 447 

Harding, J., 181, 213, 220 

Hardyck, C., 230, 248, 257, 258 

Harlow, H. F., 118, 124, €39. 550-551, 559 
Harlow, Margaret K., 551, 559 


568 


Harriman, P. L., 138 

Harris, H., 510, 520 

Harris, M., 267, 287 

Harsh, C. M., 246, 256 

Hart, A., 498, 520 

Hart, C. W., 477478, 485 

Hart, H., 4, 48, 58, 494, 508, 505, 520 

Hartley, E. L., 49, 174, 179-180, 221, 229, 238, 256, 531 

Hartley, F. A., 494, 519 

Hartley, Ruth E., 49, 238, 256 

Hartmann, G. W., 240, 258 

Hartmann, H., 115, 189, 168-169, 178 

Hartshorne, H., 353, 368, 410, 445 

Haslerud, G. M., 554, 560 

Hastorf, A. H., 287, 247, 256 

Hatch, D. L., 515, 520 

Hatch, Mary A., 515, 520 

Havighurst, R. J., 86, 88 

Hawley, A. H., 281, 287 

Hayakawa, S. I., 17, 53 

Hayes, Catherine, 539-540, 559 

Hayes, K. J., 539-540, 559 

Hays, D. G., 355-356, 361, 366-367, 369 

Haythorn, W., 382, 403 

Hayworth, D., 498, 520 

Hayworth, Rita, 125 

Head, H., 110-112, 126-127, 139, 555 

Hebb, D. O., 66, 69, 88, 100, 139, 142, 532, 539-540, 
543-545, 547, 549, 551-554, 559-560 

Hegel, G. W. F., 3, 34-36, 53 

Heidbreder, Edna, 127-128, 139 

Heider, F., 96-97, 105-106, 119, 139, 182-183, 194, 199, 
210-212, 216, 220 

Helmholtz, H., 144 

Helson, H., 110, 139, 178 

Hendrick, I., 168, 178 

Heneman, H. G., Jr., 465, 485 

Henle, Mary, 202, 220 

Henmon, V. A. C., 246 

Henry, A. F., 355-856, 367, 369 

Henry, J., 171, 178 

Henry, Zunia, 171, 178 

Herbart, J. F., 25, 36, 53 

Herma, H., 506, 520 

Heron, W. T., 533-534, 560-561 

Herrick, C. J., 125, 139 

Hershkowitz, A., 91 

Herskovits, M. J., 256 

Hertz, P., 181 

Hertzler, Virginia Beazley, 428-432, 446 

Herzog, Herta, 509, 522 

Hevner, Kate, 260, 287, 347, 364, 367 

Heyns, R. W., 370, 378, 404 

Hilgard, E. R., 59, 65, 76, 88, 92, 95, 100, 116, 118, 
120, 124, 139, 170, 178, 208, 220, 244, 257, 465, 485 

Hill, H. W., 138, 142 

Hill, J. C., 533, 560 ib 

Hinckley, E. D., 351, 367 


AUTHOR INDEX 


Hitler, A., 4, 30, 34, 492, 520-522 

Hobbes, T., 3, 10, 13-16, 32-33, 53, 79 

Hobhouse, L. T., 523, 531 

Hocking, W. E., 39, 53 

Hoel, P. G., 263, 287 

Hollenberg, Eleanor (See Chasdi, Eleanor Hollen- 
berg) 

Hollingworth, H. Leta, 26, 58 

Holmes, F. B., 554-555, 560 

Holt, E. B., 23-24, 42, 53 

Holzberg, J. D., 435, 445 

Holzinger, K. J., 418, 444 

Homans, G. C., 70, 88, 182 

Hoppe, F., 13-14, 53 

Horney, Karen, 14, 53, 169-170, 173-174, 178 

Horowitz, E. L., 341-342, 367-368 

Horowitz, M., 218, 220, 222 

Horowitz, M. W., 212, 220 

Horowitz, Ruth E., 341-342, 367-368 

Horst, P., 353, 367 

Horton, C. P., 76, 88 

Horton, D., 493, 521, 523, 530, 531 

Horvitz, D. G., 267, 287 

House, F. N., 5,53 

Hovland, C. I., 59, 61-62, 84, 88, 248, 277, 287, 351, 
367, 450, 485 

Howes, D. H., 85, 90 

Hudson, B. B., 554, 560 

Hughes, E. C., 224, 257 

Hughes, J. W., 224, 257 

Hull, C. L., 26, 53, 58-59, 63-76, 80-81, 83, 85, 88, 90, 
92-93, 95-96, 100, 120, 124, 129-131, 189, 145, 169, 
188, 286-287, 333, 542, 560 

Humber, W. J., 73, 88 

Hume, D., 103-104, 532 

Humphrey, G., 23-24, 53 

Hunt, J. McV., 84, 88-89, 180, 221-222, 239, 257, 422, 
425, 431, 445, 534, 560-561 

Husain, Suraya Z., 418-419, 444 

Husserl, E. G., 101 

Hutt, M., 381, 404 

Huxley, J., 37, 125, 139 

Hyde, R. W., 440, 445 

Hyman, H., 469, 471, 477-478, 480, 485-487 

Hymovitch, B., 485, 588, 560 


Immergliick, L., 141 
Infeld, L., 182, 220 


` Infield, H. F., 441, 445 


Irwin, F. W., 286-287 
Irwin, Lucietta, 222 
Isaacs, Susan, 135, 139 


Jack, Lois M., 374, 391-392, 404 
Jackson, J. H., 126-127, 189 
Jacob, P. E., 512, 520 

Jacobs, J. H., 438, 445 


AUTHOR INDEX 


Jacobsen, C. F., 554, 560 

Jacobsen, M. M., 554, 560 

Jacobson, E., 237, 257 

Jaensch, E. A., $5, 5$ 

Jahoda, Marie, 53, 174, 176, 181, 192, 220, 447, 456, 
485-486 

James, W., 22, 25, 34, 37, 41-42, 44, 46, 48, 53, 125, 
139, 144, 160, 223, 238-239, 257 

Janet, P., 25-26 

Janis, I. L., 489, 520 

Janowitz, M., 172, 180, 495, 519 

Jaspen, N., 277, 288 

Jefferson, T., 16, 18, 28-29 

Jeffress, L. A., 140 

Jenkins, J. G., 428, 440, 446 

Jenkins, R. L., 440, 446 

Jenkins, W. O., 58, 76, 83, 87-88 

Jennings, Helen H., 406, 409, 411, 413-416, 419, 422- 

424, 427, 434, 437, 489, 443, 445-447 

Jersild, A. T., 554-555, 560 

John, W. F., 83, 87 

Johnson, A. D., 437, 446 

Johnson, H. M., 336, 367 

Johnson, P. O., 263, 274, 285, 288 

Jones, Dorothy B., 506, 509-511, 520 

Jones, D. S., 223 

Jones, E., 16, 53, 172, 178, 500, 520 

Jones, E. $., 545, 560 

Jones, H. E., 538, 554, 560 

Jones, L. V., 262, 288 

Jones, Mary C., 118-119, 139, 554, 560 

Jordan, N., 212, 220 

Jowett, B., 51, 55 

Jucknat, M., 208, 220 

Judd, C. H., 35, 39, 53 

Jung, C. G., 34, 53, 146, 175 


Kaganovich, L. M., 499 
Kahane, M., 409, 446 

Kahn, L. A., 355, 367 

Kahn, R. L., 482, 485 

Kamin, L. J., 85, 90 

Kamiya, J., 253, 258 

Kant, I., 25 

Kantor, J. R., 49 

Kanzer, M., 500, 520 

Kaplan, A., 488-489, 510, 514-515, 520 
Kaplan, D., 514, 516, 520 
Kaplan, Edith, 132, 136, 142 
Kardiner, A., 166, 170-171, 178 
Karpf, Fay B., 5,53, 172, 178 
Karsten, Anitra, 202, 220 
Kasanin, J., 126, 132, 139 


Katona, G., 11, 54, 109, 139, 481, 485 E 

Katz, D., 49, 54, 357, 367, 471, 476-477, 482, 485, 551, 
560 

Katz, I., 541, 560 


Katz, L., 416-420, 425, 432, 445-446 


Kearl, B., 497, 520 

Kecskemeti, P., 174, 178, 504, 520 

Keet, C. D., 385, 404 

Keller, Helen, 127, 189 

Kelley, H. H., 216, 218, 220, $51, $67, 485 

Kelly, E. L., 342, 367 

Kempf, E. J., 16, 54 

Kempthorne, O., 285, 288 

Kendall, M. G., 264, 288, $20, DI 

Kendall, Patricia L., 451, 486 

Kendler, H. H., 107, 159 

Kenney, Kathryn C., 260, 287, 365-366 

Kerr, W., 434, 438, 448 

Kerstetter, Leona M., 437, 441, 446, 448 

Khrushchev, N. S., 499 

Kilpatrick, F. P., 260, 287, 357, 566 

Kimble, G. A., 61, 88 

Kingsbury, Susan M., 494, 520 

Kingsley, C., 11, 500, 519 

Kinsey, A. C., 334, 451-458, 457, 468, 466, 480, 485 

Klein, G. S., 69, 88, 113, 116-117, 140, 143 

Klineberg, O., 49 

Kluckhohn, C., 5, 12, 35, 87, 54, 86, 89, 170-171, 178, 
255, 451, 473-474, 485, 528, 530-531 

Kluckhohn, Florence, 475, 486 

Koch, S., 58, 88 

Koffka, K., 44, 54, 95, 97-98, 101, 108, 110-118, 122, 
133, 140, 191, 221, 238, 257, 286, 288 

Kohler, W., 23-24, 54, 92, 96-98, 103-104, 108, 118, 
122-124, 130-131, 140, 539, 542-543, 549-550, 554, 
560 

Kolbe, L. E., 353, 366 

Komarovsky, Mira, 228, 257 

Korsakow, S., 112 

Kosygin, A., 499 

Kounin, J. S., 201-203, 219, 221 

Kracauer, S., 513, 520 

Krech, D., 18, 27, 49-50, 54, 69, 88, 91, 94-95, 107, 111, 
140, 173-174, 178-179, 182, 211, 221, 238, 257, 335, 
362, 367, 532 

Kriedt, P. H., 357, 365-367 

Kriesberg, M., 516, 520 

Kris, E., 143-144, 168-169, 174, 178-179, 491, 500, 506, 
508, 512, 520 

Kroeber, A. L., 5, 37, 54, 170, 179, 486, 527, 531 

Kropotkin, P. A., 18-20, 54 

Krout, M. H., 49 

*Krueger, E. T., 49 

Krugman, Dorothy C., 256 

Kruskal, W. H., 289, 320, 384 

Kubie, L. S., 170, 179 

Kuder, G. F., 339, 432 

Kuenne, Margaret R., 131, 140 

Kuhlen, R. G., 434, 446 

Külpe, O., 144 

Kurtz, A. K., 277, 288 


fe 


570 


Lambert, Elisabeth C., 57 
Lambert, W. W., 57, 60, 62, 85, 88 
Lamoreaux, R. R., 131, 141 
Landis, M. H., 505, 520 
Lange, C., 160 
Lange, L., 43, 54 
Lange, N., 43, 54 
Langer, Susanne K., 127, 129, 140 
Lanzetta, J., 382, 403 
LaPiere, R. T., 49, 341, 367 
Lashley, K. S., 95, 98, 110-111, 127, 130-181, 140 
Lasswell, H. D., 175, 179, 219, 485-486, 490-491, 495- 
496, 499-505, 507-508, 510, 512, 514, 516, 520-522 
Lavoisier, A. L., 532 
Lawrence, David H., 500, 521 
Lawrence, Douglas H., 61, 88 
Lawsing, Margaret, 429, 446 
Lazarsfeld, P. F., 335, 343, 355, 359, 361, 367, 416, 
455, 486, 490, 495, 519, 521 
Lazarus, M., 3, 35-36, 54 
Leary, B. E., 497, 508, 520 
Leary, T. F., 389, 403-404 
Leavitt, H. J., 199, 218, 221, 409, 425, 439, 448 
LeBon, G., 22, 25-26, 29-31, 54 
Lecky, P., 14, 54, 239, 257 
LeDantec, F., 14-15, 17, 54 
Lee, A. McC., 496, 512, 521 
Lee, Elizabeth B., 496, 512, 521 
Lee, I. J., 140 
Lee, V., 17, 54 
Leeman, C. P., 442.443, 446 
Leeper, R. W., 65, 88, 93, 109, 122, 140, 190, 196-197, 
199, 221 
Leighton, A. H., 21, 54 
Leites, N., 174, 178, 488-489, 491, 499, 504-505, 508, 
511-512, 517-518, 520-522 
Lemann, T. B., 420, 427, 432, 446 
Lenin, N., 28-29, 499 
Lerner, D., 219, 485-486, 506, 521 
Levine, Estelle M., 507-508, 519 
Levinson, D. J., 176, 485 
Levy, D. M., 170, 179, 473, 486, 534, 560 
Levy, M. J. 31, 51 
Levy, N., 89 
Levy, S., 215-216, 221 
Levy-Bruhl, L., 7, 38, 54 
Lewin, H. S., 492, 521 
Lewin, K., 47-48, 54, 58, 67, 77-78, 87-88, 94, 101, 140, 
145, 169, 173-174, 181-193, 195-209, 212-214, 217% 
219, 221.223, 238, 257, 286, 288 
Lewis, D. J., 89 
Lewis, Helen B., 122, 140, 202, 221 
Lewis, J. H., 234, 257 
Lewis, M. M., 242 
Lewis, P. M., 510, 520 
Lichtenberg, P., 181 
Liebeault, A. A., 25, 54. œ 
Light, S. F., 536, 560 


AUTHOR INDEX 


Likert, R., 260, 287-288, 344, 351-353, 357, 360, 364- 
366, 368 

Lindemann, E., 385, 403 d 

Lindesmith, A. R., 49, 51, 87-88, 126, 140, 257 

Lindner, R. N., 499, 521 

Lindquist, E. F., 269, 274, 288, 366, 368 

Lindsley, D. B., 555, 560 

Lindzey, G., 50, 54, 143, 174, 179, 405, 434-435, 446 

Link, R. F., 289, 304, 320, 334 

Linnaeus, C., 537 

Linton, R., 171, 179, 224, 257 

Lippitt, R., 48, 54, 209-210, 212-215, 218, 220-222, 
370, 374, 387, 394, 396-397, 404, 423, 447 

Lippmann, W., 17, 54 

Lipps, T., 23-24, 54, 231 

Lissner, Kate, 202, 222 

Littman, R. A., 95, 140, 183, 222 

Locke, H. J., 172, 176 

Locke, J., 15, 54 e 

Loevinger, Jane, 339, 357, 368 

Loewenstein, R. M., 168-169, 174, 178-179 

Lombroso, C., 30 

Longmore, T. W., 412, 428, 446 

Loomis, A. M., 371, 404 

Loomis, C. P., 409, 411-412, 414-417, 421, 425, 428- 
430, 432, 441, 446-447 

Lord, E., 323-324, 334 

Lord, F. M., 352, 368 

Lorenz, K., 534-585, 541, 560 

Lorge, I., 337, 850, 868, 497, 508, 521 

Loveland, E. H., 333 

Lowell, A. L., 534 

Lowenfeld, H., 175, 179 

Lowenthal, L., 30, 54, 174, 179, 500, 504, 509, 511, 
517, 521 

Luce, D. R., 418, 446 

Luchins, A. S., 110, 117, 140, 532 

Lukasczyk, K., 5, 35, 54 

Lumsdaine, A. A., 88, 277, 287, 450, 485 

Lundberg, G. A., 405, 407-409, 428-482, 446 

Lynip, A. W., 126, 140 

Lyons, J., 212, 220 


McBride, A. F., 542-543, 560 

McCarthy, Dorothea, 135, 140 

McClelland, D. C., 70-71, 85, 89, 174, 179, 224, 257, 
528, 528, 531 

McClelland, F. M., 437, 446 

Maccoby, Eleanor E., 449 

Maccoby, N., 449, 485 

MacCorquodale, K., $35, 368, 550, 560 

McCulloch, T. L., 554, 560 

McCurdy, H. G., 500, 521 

McDougall, W., 5, 9, 12-14, 16-18, 20-25, 27-28, 32, 35, 
3942, 45-46, 48, 50-51, 54, 58, 144-145, 179, 188 

McGarvey, H. R., 864, 368 

McGinnies, E., 117, 141 

McGranahan, D. V., 492, 504, 511, 521 


AUTHOR INDEX 


McGregor, D., 209, 222 

Macgregor, Frances C., 396, 404 

McGuffey, W. H., 490, 519 

Mach, E., 92 

Machiavelli, N., 15-16, 54 

McKellar, P., 550, 560 

McKinney, J. C., 414-415, 446 

MacLeod, R. B., 18, 54, 58, 89, 93, 101-102, 106-107, 
119, 122-123, 140 

McNemar, Q., 219, 274, 284-285, 288, 331, 334-385, 
368, 479, 486 

McReynolds, P., 58, 89 

MacStewart, W., 314, 384 

Madison, J., 16 

Magaret, G. Ann, 208, 220, 226, 233, 256 

Mahan, A. T., 498 

Mahler, Wera, 202, 222 

Mahut, 551 

Maier, N. R. F., 21, 54, 66, 89, 109, 131-132, 140, 538, 
560 

Malenkov, G. M., 499 

Malinowski, B., 35, 54, 78, 171-172, 179, 527 

Maller, J. B., 410, 445 

Mann, F., 482, 486 

Mann, H. B., 290, 315-316, 323, 325, 334 

Mannheim, K., 11, 17, 54 

Mantel, N., 323, 334 

Marder, E., 355, 368 

Marks, M. R., 262-263, 288 

Marrow, A. J., 201, 222 

Martin, C. E., 480, 485 

Martin, E. D., 31,54 

Marx, K., 11, 17, 84, 54 

Marx, M. H., 79, 89, 93, 140 

Maslow, A. H., 114-116, 140, 550, 560 

Mason, Marie K., 533, 560 

Mason, W., 459, 463, 485 

Massarik, F., 409, 446 

Massey, F. J., Jr., 285, 287 

Mather, K., 285, 288 

Mathews, Ravenna, 248, 257 

Maucorps, P. H., 409, 439, 446 

Maurel, H., 53 

Maxwell, C., 181-182, 222 

May, M. A., 21, 52, 54, 120, 141, 853, 368, 410, 445 

Mayer, A., 46, 54 

Mazelis, Sarah, 232, 257 

Mead, G. H., 22, 24, 54, 86-87, 89, 182, 223-224, 226- 
227, 232-233, 238, 242-243, 245, 257, 539 

Mead, Margaret, 21, 54, 171-172, 179, 255, 396, 403- 
404, 430, 447, 473, 486 

Mecht, P. E., 57, 89, 335, 368, 550, 560 

Meirowitz, Beatrice, 382, 403 

Meltzer, H., 13, 54 k 

Melzack, R., 551, 554, 560 

Mensh, 1. N., 106, 141, 432, 445 

Merrill, K., 219 


571 


Merton, R. K., 224, 249, 257, 451, 454, 486, 507-508, 
521 

Mesmer, F. A., 25, 54 

Metzner, Helen, 482, 486 

Meyer, D. R., 551, 559 

Meyerson, A., 49 

Meyerson, L., 91 

Michotte, A., 104-106, 141 

Mikoyan, A. I., 499 

Miles, Josephine, 490, 497, 508, 512, 521 

Mill, J. S., 10-13, 16, 55 

Millay, Edna St. V., 497 

Miller, D. R., 174, 179, 435, 447 

Miller, G. A., 85, 89, 91, 141 

Miller, N. E., 12, 23-24, 42, 52, 55, 58-59, 62, 66, 70-71, 
73-74, 76, 81-86, 88-89, 97, 119, 124, 141, 145, 174, 
176, 179, 206, 222, 227, 257, 286, 288, 542, 550, 560 

Miller, S. M., 463, 486 

Minturn, L., 85, 89 

Mintz, A., 516, 521 

Mitrano, A. J., 85, 89 

Moede, W., 47, 55 

Molotoy, V. M., 499 

Montgomery, K. C., 551, 560 

Mood, A. M., 263, 267, 287-289, 314, 322, 333-334 

Mooney, R. L., 434 

Moore, B. V., 465, 485 

Moore, H. T., 47, 55 

Moore, O. L., 58, 89 

"Moreno, J. L., 48, 55, 227, 230, 236, 253, 257, 405, 
407-411, 413-414, 416-417, 422-423, 425-430, 436-438, 
441-443, 447 

Morgan, Christiana D., 46, 55, 461, 486 

Morgan, C. T., 240, 258 

Morgan, W., 178 

Morris, C. W., 54, 128-130, 141, 489, 539, 560 

Morse, Nancy C., 485 

Moses, 166-167, 175, 177, 179 

Moses, L. E., 289, 312, 814-315, 321, 323, 334 

Moshman, J., 304, 334 

Mosier, C. I., 343, 348, 368 

Mosteller, F., 65, 73, 77, 83, 87, 89, 289, 343, 345, 363, 
368 

Motz, Annabelle B., 227, 257 

Mowrer, O. H., 59, 64-66, 71-78, 82, 84-86, 88-89, 114, 
120, 181, 141, 145, 176, 501, 519, 550-552, 560-561 

Muensterberger, W., 171-172, 178, 18 

Muhlhan, Gertrude, 500, 519 


«Mukerjee, R., 49 


Müller, G. E., 44, 55 

Müller, J., 144 

Miinsterberg, H., 43, 47, 55 

Murchison, C., 43, 49-52, 55, 188, 178, 179, 221, 366, 
403, 561 

Murdock, G. P., 62, 86, 89, 523, 525-526, 528-531 

Murphy, G., 6, 48-50, 52, 55, 133, 141, 182, 223, 250, 
257, 276, 288, 364, 368, Sit 


572 


Murphy, Lois B., 21, 48-50, 55, 276, 288 

Murray, H. A., 12, 46, 54-55, 70, 89, 145, 178, 237, 
255, 286, 288, 461, 486 

Mussen, P. H., 342, 368 

Myers, C. S., 35, 336, 366 


Naboisek, H., 229, 257 

Nacht, S., 169, 179 

Napoleon, 129 

Neiman, L. J., 224, 257 

Nelson, C. J., 512, 522 

Nelson, M. J., 246 

Newcomb, T. M., 49-51, 55, 173-174, 179-180, 182, 
221, 223, 232, 244, 257, 276, 288, 342, 368, 409, 
422, 431, 447, 461, 486, 531 

Newstetter, W. I., 409, 422, 447 

Newton, I., 181-182, 532 

Neyman, J., 291, 334 

Nietzsche, F., 14, 55 

Nissen, H. W., 532, 534, 540, 542, 551, 560 

Niven, J. R., 353, 365, 368 

Noland, E. W., 355, 368 

Norman, R., 199, 920 

Northway, Mary L., 405, 412, 414, 422, 426-427, 434- 
435, 497, 447 

Notcutt, B., 237, 257 

Nottingham, Elizabeth K., 228, 257 

Novikoff, A. B., 125, 141 

Nowlis, Helen H., 81, 89 

Nowlis, V., 81, 89, 486 

Nunberg, H., 168, 179 

Nuttin, J., 123, 141 


Oersted, H. C., 181 
Ogburn, W. F., 4, 55, 531 
Ogden, C. K., 16, 55 
Ojemann, R. H., 491, 521 
Olds, J., 82, 85, 89 
Olesen, Barbara A., 447 
Olson, W. C., 371, 392-393, 404 
Opler, M. E., 171, 179 
Ort, R. S., 228, 257 
` Osburn, H., 385, 404 
Osgood, C. E., 75, 84, 89, 128-129, 141 
O.S.S. Assessment Staff, 342, 368, 451, 486 
Ossorio, A. G., 389, 403-404 
Ovsiankina, Maria (See Rickers-Ovsiankina, Maria) 


Pace, C. R., 341, 368, 486 

Paget, Violet (See V. Lee) 

Palmer, Georgiana P., 498, 512, 521 
Papania, N., 222 

Pareto, V., 11, 17, 52, 54-55, 78 
Park, O., 536, 559 

Park, R. E., 45, 55, 224, 238, 257 
Park, T., 536, 559 

Parry, H. J., 480, 486 

Parsons, Elsie C., 170, 179 . 


í 


AUTHOR INDEX 


Parsons, T., 5, 11, 38, 40, 55, 61, 69-70, 78, 89-90, 142, 
172, 179, 182, 190, 194, 222, 223-225, 227, 258, 257 

Parten, Mildred B., 456, 486 

Paterson, D. G., 465, 485 

Paul, B. D., 473, 486 

Pavlov, I. P., 57, 61, 64, 72, 89, 130, 169 

Payne, S. L., 341, 368, 456, 465, 468, 485.486 

Pear, T. H., 21, 55 

Pearson, K., 48, 291, 318, 334, 415-416, 437 

Pelley, W. D., 502 

Pepinsky, H. B., 409, 415-416, 435, 440, 446-447 

Pepinsky, Pauline N., 406, 421, 423, 447 

Pepitone, A. D., 212, 222 

Perkins, D. T., 88 

Perlmutter, H. V., 212, 220 

Perry, A. D., 418, 446 

Perry, R. B., 39, 55 

Perry, S. E., 175, 180 

Peterson, D. R., 233, 248, 256 c 

Phadke, N. S., 49 

Piaget, J., 38, 55, 110, 115, 132-137, 141, 242-243, 257, 
451, 471-472, 486 

Pilzecker, A., 44, 55 

Pintner, R., 351, 368, 438, 447 

Plath, O. E., 536-537, 561 

Plato, 3, 5-6, 16, 18-19, 29, 32, 58, 55-56 

Poe, E. A., 497 

Poisson, S. D., 326-328 

Polansky, N. A., 210, 220-222, 374, 387, 399-400, 404, 
493, 447 

Pomeroy, W. B., 480, 485 

Pool, I. deS., 488-489, 508, 511,517, 521 

Posner, Rita, 435, 445 

Postman, L., 57, 59, 87, 89, 110, 113-114, 117, 138, 
141, 357, 368 

Potter, Elmer H., 507-508, 519 

Powell, R. M., 412, 428, 430, 446-447 

Pratt, C. C., 92, 102, 117, 141 

Prentice, W. C. H., 201, 222 

Prince, M., 26 

Proctor, C. H., 405, 411-412, 414-415, 419, 421, 428- 
429, 446-447 

Proshansky, H. M., 342, 368, 486 

Pumpian-Mindlin, E., 178-179 

Punke, H. H., 511, 521 


Quarrington, B., 412, 426, 447 
Queener, F. L., 49, 89, 173, 179 
Quetelet, A., 48 


Radke, Marian J.» 472, 480, 486 
Rainoff, T. J., 491, 521 

Ramsey, G. V., 138-139 

Rapaport, D., 138-139, 168, 179, 184, 221 
Rapaport, Dorothy, 222 

Ratliff, J. A., 437, 446 

Reckless, W. C., 49 


AUTHOR INDEX 


Redi, F., 210, 222, $74, 387, 404, 423, 447 

Reik, T., 172, 180 

Reinhardt, J. M., 49 

Reisman, D., 172, 180 

Reiss, A., 514, 521 

Révész, G., 539, 561 

Reymert, M. L., 138, 141, 220 

Ribot, T., 20-21, 55 

Rice, S. A., 5, 52, 55, 477, 486 

Richardson, Helen M., 432, 447 

Richardson, M. W., 339 

Richmond, Winifred, 434, 447 

Rickers-Ovsiankina, Maria, 128, 139, 201, 222 

Rickert, Edith, 490, 521 

Riesen, A. H., 534, 551, 560-561 

Riesman, D., 462, 486 

Riker, B. L., 362, 368 

Ritchie, B. F., 107, 141 

Rivers, W. H. R., 35, 55, 170, 180, 235, 257 

Roberts, J. M., 486, 530 

Robinson, B., 535, 560 

Robinson, J. H., 17,55 

Roby, T. B., 89 

Roethlisberger, F. J., 458, 451, 486 

Rogers, C. R., 18, 55, 115, 121-122, 141, 244, 257, 385, 
404, 451, 465-466, 486 

Rogers, Maria, 438, 447 

Rogers, W. C., 515, 521 

Roheim, G., 35, 55, 171-172, 176, 178-180, 520 

Rohrer, J. H., 54, 140-141, 559 

Rokeach, M., 247, 257 

Romney, K., 486 

Roosevelt, F. D., 496, 508, 522 

Roper, E., 457 

Rorschach, H., 178, 249-250, 326, 333, 382, 420, 435, 
447, 452, 460, 499, 521 

Rosander, A. C., 341, 368 

Rose, A. M., 227, 257 

Rosen, E., 95, 140, 183, 222 

Rosen, S., 210, 221 

Rosenbaum, M., 256 

Rosenberg, B. G., 230, 244, 258 

Rosenstock, J., 461, 486 

Rosenzweig, S., 201, 222, 236, 252, 257, 342, 445, 550, 
561 

Roslow, S., 364, 368 

Ross, E. A., 34, 9, 19, 27, 30-31, 48, 55 

Ross, Helen, 179 

Ross, I. C., 418, 447 

Ross, R. T., 88 

Ross, S., 62, 90, 534, 561 

Rossi, P. H., 361, 368, 464 

Rothmann, E., 128, 141 

Rousseau, J. J., 15, 56 

Royce, J., 36, 55 

Rubin, E., 98 

Ruesch, J., 138 

Rundquist, E. A., 365 


D 


573 


Runion, H. L., 498, 512, 521 
Ruskin, J. 11 
Russell, B., 6, 14, 55 


Sabine, G., 6, 55 

Saffir, M. A., 260, 288, 347.348, 364, 368 

Sait, E. M., 208, 220 

Salter, Patricia, 494, 509-511, 515, 517, 519 

Sanders, I. T., 441, 447 

Sanderson, D., 405, 447 

Sanford, F. H., 542, 368, 461, 486 

Sanford, R. N., 176, 485 

Sapir, E., 40, 170-172, 180, 256 

Sarason, S., 236, 257 

Sarbin, Genevieve, 223 

Sarbin, T. R., 223-224, 227, 229-230, 232, 294, 257- 
238, 240, 244, 246-249, 253, 255, 257-258 

Sargent, J., 409, 437, 446-447 

Sargent, S. S., 49, 224, 258, 550, 561 

Schachter, S., 215-217, 220, 222, 418, 430, 445, 485 

Shaeffle, A., 35, 55 

Schanck, R. L., 49 

Schauer, G., 440, 447 

Scheerer, M., 91, 126, 128, 131-152, 139, 141 

Scheffé, H., 304, 334 

Scheidlinger, S., 167, 180 

Scheler, M., 20-21, 55 

Schilder, P., 111, 115, 133, 141, 240, 258 

Schlesinger, H., 113, 140 

Schlipp, P. A., 51 

Schlosberg, H., 64-65, 71, 80, 89-90, 257, 534, 560 

Schmidt, J. K. (See M. Stirner) 

Schmidt, K. P., 536, 559 

Schneider, B. H., 117, 141 

Schneider, D. M., 178 

Schneider, L., 175, 180 

Schneirla, T. C., 124-125, 141, 532, 534-536, 538-539, 
541, 560-561 

Schopenhauer, A., 16, 55 

Schramm, W., 521 

Schuessler, K. F., 248, 258 

Schultz, J. H., 237, 258 

Schutz, W., 515, 521 

Schwartz, M., 351, 367 

Schwartz, S., 256 

Scott, J. P., 541, 561 

Seagoe, Mary V., 433, 447 

Sears, Pauline S., 80, 89, 208, 221, 486 

© Sears, R. R., 69-71, 81-84, 86-89, 145, 170, 176, 180, 

243, 248, 258, 286, 288, 481, 486, 524-525, 531, 540, 
543, 561 

Seeman, M., 416, 447 

Seligman, C. G., 170, 180 

Sellars, R. W., 141 

Semmes, Josephine, 534, 560 

Sengupta, N. N., 49 

Seward, J. P., 76, 89, 128139, 141 

Shakespeare, W., 497-498, 521 


574 AUTHOR INDEX 


Shanas, Ethel, 491, 521 Sorel, G., 17, 55 
Shand, A., 46, 55 Sorokin, P. A., 5, 18, 21, 51, 55 
Shapiro, J., 477-478, 486 Spearman, C. E., 232-233, 339, 392 
Shartle, C. L., 409, 439, 448 Speier, H., 174, 179, 508, 520 
Shaw, G. B., 28 Spence, K. W., 57, 59, 61-62, 69-70, 74, 78, 80, 87, 
Sheats, P., 232, 256 91-93, 96, 124, 130-131, 141, 184, 222, 336, 366 
Sheatsley, P. B., 465, 471, 486 Spencer, H., 4-5, 11, 19-20, 32, 43, 48, 56 
Sheffield, F. D., 76, 88-89, 277, 287, 450, 485 Spenser, E., 497 
Sherif, M., 49.50, 54, 109-110, 140-141, 342, 351, 367- — Speroff, B., 434, 438, 448 
368, 559 Sperry, Margaret, 82-83, 88 
Shils, E. A., 5, 40, 55, 61, 69, 89-90, 142, 172, 179-180, Spier, L., 256 
190, 194, 229, 923.225, 227, 253, 257 Sprott, W. J. H., 49 
Shipley, T. E., 85, 89 Sprowls, J. W., 5, 56 
Shirley, Mary M., 241, 258 Spurgeon, Carolina F. E., 490, 497, 521 
Shneideman, E. S., 499, 521 Stalin, J., 499, 521 
Shoobs, N. E., 437, 447 Stanley, J. C., Jr., 58, 88 
Shor, J., 506, 520 Stanton, A. H., 175, 180 
Shvernik, N. M., 499 Stanton, F. N., 519, 521 
Sickles, W. R., 240, 258 Starbuck, E. D., 48, 56 
Sidis, B., 25-26, 30, 55 Steele, Mary, 408, 428-429, 446 
Siegel, L., 409, 435, 447 Steinthal, H., 3, 35-36, 54 
Sighele, S., 25, 30, 55 Steinzor, B., 380-381, 396, 404 
Silva, A. L. M., 237, 257 Stellar, E., 240, 257-258, 534, 560 
Simmel, E., 169, 174, 180 Stember, H., 469, 471, 486-487 
Simmel, G., 30, 55 Stephenson, R. W., 247, 258 
Simmel, Marianne, 105, 139, 210, 212, 220 Stern, W., 93-94, 106, 111, 132, 134, 141-142, 371 
Simmons, L. W., 523, 531 Stevens, N. E., 491, 521 
Simon, C. W., 287 Stevens, S. S., 61, 87-90, 92, 99, 140-142, 335, 337, 
Simon, H. A., 182 560 
Simpson, G. G., 125, 141 Stevenson, R. L., 26 
Sisson, E. D., 362, 368 Stewart, F. A., 408, 429-430, 441, 448 
Skinner, B. F., 63-66, 71, 73-74, 85, 90, 92, 96, 102, Stewart, Isabel A., 408, 448 
141, 286, 288 Stewart, Jeannie C., 541, 561 
Sletto, R., 365 Stewart, M., 516-517, 521 
Sliosberg, S., 202, 222 Stewart, Naomi, 471, 476-478, 487 
Sloane, E. H., 93, 141 Stine, Margaret E., 174, 179, 435, 447 
Smelser, N., 431, 447 Stirner, M., 14, 56 
Smith, A., 10, 19-21, 55 Stogdill, R. M., 409, 425, 439, 448 
Smith, B. L., 520 Stone, C. P., 87, 89, 287, 560 
Smith, G. H., 342, 368 Stouffer, S. A., 18, 56, 253, 258, 276-277, 288, 329-331, 
Smith, H. L., 478, 486 334-335, 343, 356, 367-369, 450, 465, 476, 482, 487 
Smith, J. J., 49 Stratton, G. M., 56 
Smith, Kate, 507-508 Strauss, A. L., 49, 51, 87-88, 196, 140, 257 
Smith, M., 431, 447 Strodtbeck, F. L., 486 
Smith, M. B., 238, 258 Stroud, J. B., 62, 90 3 
Smith, R. G., Jr., 364, 368 Suchman, F. A., 835, 355, 357-358, 369 i 
Smith, S., 76, 90 Sullivan, H. S., 173, 182 
Smith, W. I., 62, 90, 534, 561 Sullivan, P. L., 228.229, 258 
Smucker, O., 414-415, 447 "Sumner, W. G., 4 
Smythe, D., 493, 521 Sussman, Leila A., 494, 517, 521 
Snedecor, G. W., 268, 285, 288, 326, 334, 345, 868 Sutherland, J. D., 439, 448 
Snyder, W. U., 385-386, 404 Suttie, I. D., 18, 56 
Snygg, D., 101-102, 122, 141 Sutton, F. X., 31, 51 
Socrates, 29 Swahson, G. E., 174, 179-180, 531 


Solomon, R. L., 60, 72-74, 79-80, 85, 88-90, 257, 276- Swed, Frieda S., 289, 322, 334 
277, 288, 420, 422, 425, 427, 431-432, 445-446, 534, Symonds, P. M., 238, 258, 468, 487 


560 è Szekely, L., 109, 142 
E 


AUTHOR INDEX 


Taeuber, Irene B., 490, 511, 515, 517, 522 

Taft, R. A., 494, 519 

Tagiuri, R., 409, 426-427, 429, 435, 445, 448 

Talbot, E., 409, 438, 448 

Tannenbaum, R., 409, 438, 446, 448 

Tarde, G.. 3, 8, 22, 24, 30, 48, 56, 81 

Taylor, D. W., 287 

Taylor, W. S., 145, 180 

Terman, L. M., 331, 334 

Thelen, H., 395-396, 404 

Thibaut, J. W., 215-217, 220, 222, 

Thistlethwaite, D., 118, 142, 342, 

"Thomas, Dorothy S., 371, 398, 404 

"Thomas, W. I., 43-45, 56 

"Thompson, W. R., 532.534, 551, 561 

Thorndike, E. L., 13, 41, 56-57, 61, 63, 76, 90, 114, 
142, 496 

"Thorndike, R. L., 279-282, 288 

Thorpe, W. H., 537, 561 

Thouless, R. H., 49 

"Thrasher, F. M., 437, 448 

"Thurnwald, R., 35, 56 

Thurstone, L. L., 48, 56, 78, 260, 264, 287-288, 335, 
341, 344-345, 347-348, 350-351, 360, 362, 364-369, 
400, 409 

Tinbergen, N., 63, 90, 535, 561 

Tippett, L. H. C., 263, 285, 288 

Titchener, E. B., 10, 12, 26, 44, 56, 144 

Toby, J., 253, 258 

Tocher, K. D., 312, 334 

Tolman, E. C., 61, 72-78, 78, 90, 92-94, 96, 104, 108, 
110-111, 113, 117-118, 122, 124, 129, 142, 145, 169, 
172-173, 182-183, 188, 190, 207, 222, 226, 245, 258, 
286, 288, 538, 541, 561 

Tomkins, S. S., 145, 180 

Torgerson, W. S., 335, 337, 369 

Travers, R. M. W., 362, 369 

Travis, L. E., 47,56 

Triplett, N., 46-47, 56 

Troland, L. T., 12, 56 

Trotter, W., 20, 41, 56 

Tryon, R. C., 533, 561 

Tucker, L. R., 352, 369 

Tukey, J. W., 289, 304, 326-327, 333-334 

Turner, F. J., 493 

Tylor,*E. B., 523, 528, 531 


398, 404, 485 


Urdan, J. A., 434, 446 


Vaihinger, H., 236, 258 

Valentine, C. W., 551, 561 

Vallance, T. R., 85, 90 

Vanatta, E. L., 409, 435, 447 

Van Dyne, Virginia E., 433, 448 
Van Spanckeren, W. J., Jr., 244, 258 
Vaughan, C. E., 6, 56 

Vaughan, M. F., 49 

Veblen, T., 175, 180 


€ 


^ 


575 


Veitch, J.. 53 

Vernon, P. E., 76, 87, 237, 432 
Veroff, J. A., 85, 89 

Vicary, J. M., 461, 487 

Vico, G., 3, 56 

Vigotsky, L. S., 126, 142 

Vogt. E. Z., 530 

von Bertalanffy, L., 95, 111, 142 
von Frisch, K., 537-538, 561 
von Holst, E., 111 

von Humbolt, K. W., 35 

von Senden, M., 100, 142, 584, 553, 561 
Voroshilov, K. E., 499 
Vreeland, F. M., 430, 448 


Wade, M., 150, 140 

Waelder, R., 28, 56, 168, 174, 180 

Wald, A., 290, 321 

Walk, R. D., 85, 90 

Walker, Helen M., 263, 269, 288 

Wallace, D. L., 289, 304, 320, 334 

Wallach, H., 92, 99, 142 

Wallas, G., 18, 41, 56 

Waller, W. W., 172, 180, 223, 249, 258 

Wallin, P., 223, 228, 258 

Wallis, W. A., 289, 320, 328, 330-331, 334 

Wallis, W. D., 39, 56 

Walworth, A., 491-492, 517, 522 

Waples, D., 488-489, 496, 508, 511-512, 519-520, 522 

Wapner, S., 105, 117, 131-132, 142 

Ward, C. H., 17, 56 

Warden, C. J., 125, 142 

Warner, W. L., 228, 235, 258 

Warren, H. C., 27, 45, 56 

Warren, R. L., 252, 258 

Washburn, Margaret F., 44, 56 

Wasmann, E., 536, 561 

Waterhouse, I. K., 87 

Watson, G., 47, 56, 219 

Watson, J. B., 42, 56, 57-58, 73, 90, 92, 94, 550 

Watson, P. O., 60, 85, 88 

Wayne, I., 492, 504, 511, 521 

Weaver, C. L., 289, 291, 333 

Weiss, P., 132, 142 

Weiss, W., 84, 88, 90 

Welles, O., 30 

Wells, F. L., 173 

Werner, H., 105-106, 117, 124, 126, 128, 132-133, 136- 
137, 142, 239-240, 250, 258 

Wertheimer, Max, 98, 108-109, 134, 142 

Wertheimer, Michael, 100, 142 

Weschler, I. R., 342, 369, 409, 412, 438, 446, 448, 461, 
487 

Wever, E. G., 109, 142 

Whaley, F., 222 

Wheeler, G. C., 523, 531 

Wheeler, W. M., 534, 530-537, 561 

Wherry, R. J., 352, 369, 425, 439, 448 


[s € 


576 


White, C. T., 65, 90 

White, L. A., 38, 56, 126, 142, 525, 531 

White, R. K., 48, 54, 207, 209, 214-215, 218, 221-222, 
496, 501, 510, 522 

Whiteman, P. H., 534, 559 

Whiting. Beatrice B., 523, 528, 530-531 


Whiting, J. W. M., 58, 71, 83-84, 86-87, 90, 486, 523- 


526, 528-531, 551, 561 

Whitney, D. R., 290, 312, 315-318, 323, 325, 334 
Whittemore, I. C., 47, 56 

Whitton, J. B., 520 

Wiehe, F., 209, 222 

Wiener, N., 34, 56, 76 

Wigdor, Blossom T., 435, 447 
Wilbur, G., 171-172, 178, 180 
Wilcoxon, F., 289-290, 314-315, 334 
Wild, J., 56 

Wilkinson, N. M., 398, 404 

Wilks, S. S., 308, 334, 352-353, 369 
Willey, M. M., 489, 494, 511, 515,522 
Williams, D. C., 532 

Williams, J. D., 230, 258 

Williams, J. M., 49 

Williams, S. B., 409, 425, 439, 448 
Williams, W., 14, 56 

Willkie, W. L., 496 

Willoughby, R. R., 173 

Wilson, T., 289 

Wilson, W., 498, 521 

Winch, R. F., 172, 180 
-Windelband, W., 6, 56 

Winslow, C. N., 432, 448 

Wise, G. W., 524-525, 531 

Wishner, J., 106, 141 

Wissler, C., 24, 56 

Withall, J., 395, 404 

Witkin, H. A., 109, 117, 142 
Wittreich, W. J., 113, 142 
Wolfenstein, Martha, 504-505, 518, 522 
Wolff, K. H., 55 

Wolfowitz, J., 290, 321 

Wolman, S., 441, 448 

Wood, L., 103, 142 

Woodward, J. L., 469, 487, 505, 508, 522 


AUTHOR INDEX 


Woodworth, D. G., 223 

Woodworth, R. S., 41, 56, 74, 116, 118, 142, 145, 261, 
288, 551, 561 

Wordsworth, W., 497 

Wormhoudt, A., 500, 522 

Wright, Beatrice A., 91, 198, 219 

Wright, G. O., 83, 90, 523, 528, 531 

Wright, H. F., 107, 142, 182-183, 193-194, 219, 221. - 
222, 286, 288, 388-389, 403-404, 484-485 

Wright, Q., 512, 522 

Wright, R., 501 

Wrighter, Joyce, 429, 431, 433, 445 

Wulff, J. J., 89 

Wundt, W., 5, 8, 10, 27, 35-37, 44, 46, 56, 144, 370 

Wyant, Rowena, 509, 522 

Wyatt, D. F., 478, 487 

Wyatt, T., 497 

Wynne, L. C., 72-73, 85, 90 


Yakobson, S., 491, 522 

Yates, F., 262, 273, 287, 314, 333 

Yeats, W. B., 497 

Yerkes, A. W., 554, 561 

Yerkes, R. M., 104, 130, 539, 543-545, 549-551, 554, 
561 

York, R. H., 440, 445 

Yoshioka, J. G., 554, 560 

Young, K., 45, 49, 56, 224, 238, 243, 258 

Young, P. T., 115, 142 

Youtz, Cleo, 289 

Yule, G. U., 498, 522 


Zander, A., 394, 404 

Zeigarnik, Bluma, 112, 200-201, 218, 222 
Zelen, S. L., 244, 256 

Zeleny, L. D., 409, 414, 422, 426-428, 437, 440, 448 
Zeller, A. F., 201, 222 

Zener, K. E., 109, 142, 257 

Zilboorg, G., 178 

Zimmerman, C., 357, 368 

Znaniecki, F., 43-45, 48, 56 

Zubek, J. K., 532 

Zuckerman, S., 541, 561 


J 


SUBJECT INDEX 


Abstraction, 127-128, 132, 186-137 
Accident proneness, 434, 438 
Achievement, and sociometric status, 431 
Acquisition, theories of, 63-65 
See also: learning, conditioning, stimulus-response 
theory 
Acting, 233, 249 
Action, 224-228, 230 
Action research, 212-213 
Adaptation, 133-134 
Additive property, 336-337 
Adjustment, 227 
social, sociometric measures of, 426-427, 435 
Adolescence, 198 
Afferent interaction, 100 
After-only design, 277 
Age, and sociometric status, 430 
Aggression, 71, 83, 159-160, 168-169 
in chimpanzees, 546-550 
Altruism, 541-543 
Ambivalence, 166, 206 
Analogical theory, 18-19, 32-34 
Analysis of variance, 267-268, 290, 304-307, 320-821, 
327.328 
error variance, 327-328 
one-way classification, 304-305 
ranked one-way, 290, 320-321 
short-cut allowances in, 290, 304-807 
two-way classification, 305-307 
Anger, 549-550 
Animal magnetism, 25 
Animal psychology, 124-125, 532-558 
relevance of, 532-534 
studies of intelligence in, 533-534 
Animation, 106 
Annoyance, 245 
Antecedent-to-response analysis, 66 
Anthropology, 86 
and psychoanalysis, 170-171 
use of interviews in, 473-475 
See algo: cross-cultural method 
Anticathexis, 147, 156, 163 
Anticipatory phenomena, 62-63, 116 
Antisocial behavior, 248 
Anxiety, 85, 160-163, 530 
See also: psychoanalytic theory, anxiety 
Anxiety reduction, 65 
Aphasia, 128 
Apparent behavior, 104-107, 212 D 
Applied research, 260 
Approach, constructive vs. classificatory, 187-188 
Arc sin transformation, 345 
Aristotelian thinking, 185 


s 577 


Ascendance, $91-$92 
“As if" behavior, 234, 236.238, 249 
Assignment therapy, 437-438 
Association, 37, 67, 73-74, 152, 199-200 
by similarity, 37 
Associationism, and suggestion, 26 
Associative chain theory, 127 
Attitude, 43-46, 97, 119.120, 238, 335.336, 357-858, 
431-482, 476478, 534, 543-548 
definition of, 43, 45, 335-336 
effect of interviewer's, 477-478 
intensity of, 357-358 
recognition of, 544-545 
Attitude change, 84, 216-217 
Attitude measurement, 48, 335-366 
Attitude scale, zero point of, 357-858 
See also: attitude measurement, scaling 
Attitude universe, 336, 340 
Aufgabe, 43, 45-46, 110, 128 
Aussage test, 342 
Authoritarianism, 209, 247 
Autism, 135, 153 


Babbling, 126 
Balanced Latin square design, 285 
Barrier, 198, 204, 206 
Before-after design, 277 
Behavior, biosocial vs. psychosocial, 535 
in field theory, 191-192 
Behavior disposition, 129 
Behavior setting, 194, 219 
Behavioral contagion, 210, 387-388 
See also: imitation 
Behaviorism, 57, 91-93, 129-130 
See also: conditioning, stimulus-response theory 
Bias, 351 
Binomial distribution, 291-292, 326 
Body image, 111-114, 240 
Boundary, 196 
Brain injury, 128 
Bureaucracy, 249 


Capacity, man vs. animal, 125 
\ Capacity fulfillment, 115-116 
Caste, insect, 535-536 
Castration fear, 161, 166 
Category, discrete vs. continuous, 374-975 
Category sets for observation systems, 372-375 
Category system, Bales', 376-378 
conference research problem solving, 378-880 
Steinzor's social interaction, 380-381 
Cathexis, 146-147, 156, 163. © 


578 


Causation, 103-104, 186-187, 194 
systematic vs. historical, 186-187 
Change, socially induced, 209-210, 217 
Character formation, 165 
Chi-square test, 293-294, 329, 345 
Child development, 83-84 
See also: Cognitive development, development 
Child psychology, 391-393, 471-473 
Child-rearing practices, 239, 243, 524-525 
Chimpanzees, social behavior of, 540, 549-550 
Choice, 207 
Circular reflex, 126 
Co-acting group, 47 
See also: face-to-face group, small group theory 
Codification, 99 
Coefficient of homogeneity, 357 
of reproducibility, 355-356 
Coercion, 194-195 
Cognition, and conation, 114-123 
definition of, 94 
and manifest organization, 103-104 
and reinforcement, 117-118 
role in behavior of, 91-93, 107-108 
and secondary reinforcement, 118-120 
Cognitive control, 117 
Cognitive development, 132-137, 238-244 
Cognitive organization, 18, 23, 28-29, 108-109 
and social perception, 104-107 
Cognitive structure, 197-198, 207, 211 
Cognitive theory, 91-137 
vs. stimulus response theory, 93, 96-97, 113-115 
118-120 
Cohesiveness, 214-216 
Collective, consciousness, 35 
representation, 37-38 
unconscious, 34 
Common enemy, 14 
Common segment theory, 39-40 
Commonality, 101, 194-195 
Communication, 125-127, 216-217, 587-540 
in insect societies, 537 
See also: content analysis, information theory, 
language 
Competition, 202 
Conation, see: motivation 
Concept, 125-180, 539 
in field theory, 187 
Concept formation, 126-127, 130-182, 242 
Concreteness, 127-128, 136-137 
Conditioned response, type R vs. type S, 64 
Conditioning, 23, 26, 59-61, 96, 118-119, 130-131 
higher order, 62 
and imitation, 23 
spontaneous recovery in, 61 
and suggestion, 26 
temporal relations in, 61 
See also: association, learning, stimulus-response 
theory 


, 


SUBJECT INDEX 


Conditions, experimental, arrangement of, 270 
Confidence limits, 305 

Conflict, 197-198, 205-207, 248, 251-253 

Conformity, 21-24, 216-217 

Confounding, in experiments, 269, 276 

Conscience, 153-154 

Consciousness (conscious) , 57, 148, 155-156 j 


of kind, 20 
See also: Psychoanalytic theory, conscious 


Consistency, 121-122 
Consonance, 198 

Constancy, 98-99, 239; 251-252 
Contemporaneity, 186 
Content analysis, 488-518 


audience of content, 503-505 
categories of, 510-512 
formal categories, 511-512 
substantive categories, 510-511 
conditions for exact counting, 512-514 
definition of, 488-489 
effects of content, 505-507 
focus of attention, 505-506 
response to communication, 506-507 
formal characteristics of content, 495-498 
exposure of propaganda, 495-496 
readability, 496-497 
stylistic features, 497-498 
history of, 489-490 
inference in, 516-518 
and personality measurement, 500-501 
producers of content, 498-503 
detection of propaganda, 501-502 
determination of psychological states, 499-501 
intent of communicator, 498-499 
political and military intelligence, 502-503 
and projective tests, 499 
qualitative analysis, 512 
recording unit vs. context unit, 507 
reliability of, 514-515 
sampling in, 515-516 
substantive characteristics of, 490-495 
communication standards, 494 
comparison of communication media, 492-493 
content vs. its objectives, 493-494 
development of scholarship, 491 
international differences in content, 491-492 
as a research tool, 494-495 
trends in content, 490-491 
technical problems of, 512.518 
units of, 507-510 
character, 509 
interrelationship of units, 509 
item, 509 
space and time units, 509 
theme, 508-509 
unit of classification vs, unit of analysis, 507-508 
word, 508 


SUBJECT INDEX 


uses of, 490-507 
validity of, 504-505 
Contiguity theory, 63, 73-77 
Continuity in learning, 75 
Control group, function of, 276-277 
number of subjects in, 277 
Control group design, extension of, 276-277 
Conversation of gestures, 22 
Cooperation, 21, 202, 540-541 
Copying, 23 
See also: imitation 
Correction for continuity, 309 
Covariance techniques, 274 
Craze, 31 
Critical ratio test, 302-303 
Critical research, 260 
Cross-cultural method, 523-531 
and behavior science, 523-525 
history of, 523 
* research in, 528-531 
sampling in, 526-527 
units of, 527-528 
See also: anthropology 
Crowd psychology, 25-26, 28-31 
Cue-drive learning, 72 
Cultural determinism, 38-39 
Cultural relativism, 101 
Culture, 36-37, 85-86, 101, 113, 167-168, 208, 224, 239, 
266, 430-431 
effect of on cognition, 101, 113 
effect of on level of aspiration, 208 
and personality, 239 
and psychological processes, 266 
and sociometric status, 430-431 
Culture trait, 527 
Culturology, 38, 525-526 
Curiosity, 118 
Custom, 526-528 
Cybernetics, 34 


Death instinct, 146, 149, 156-159, 169 

Death wish, 235 

Decalogue, 14-15 

Decentering, 133-134 

Decision making, 207, 217-218 

Defense mechanism, 162-164 
See also: ego defense 

Delinquency, 437 

Democracy, and crowd behavior, 30 

Demographic variables, and sociometric variables, 

429-430 

Derivations, doctrine of, 17 

Descriptive analogy, as theory generalization, 79-80 

Determination, proximal vs. distal, 95-97 
relational, 98-99, 130 

Determining tendency, 27, 110, 199-200 

Development, psychological, 203-204 


579 


See also: child development, cognitive develop- 
ment, differentiation, maturation 
Developmental stage, 133-137 
Deviation, 217 
Differentiation, 132, 192, 196, 203-204, 239-240 
of the life space, 192 
of regions, 196 
Diffusion, 24 
Dimension dominance, 218 
Direction, in field theory, 189, 197, 199 
Directionality of result, 330 
Discontinuity, in learning, 75 
Discreteness, 136 
Discrimination, 62, 130-131 
Displacement, 82-83 
Disposition, behavior, 129 
Dissociation of consciousness, 25-26, 29 
Distal determination, 95-97 
Distance, in field theory, 197 
Distinguished path, 189, 196, 199 
Dominance, 392, 541 
Drive, 60, 68, 70-71, 114-115 
effect of increase in, 68 
learnable, 70-71 
viscerogenic, 114-115 
Drive reduction, 63-64, 72, 116 
Drive-response learning, 72 
Drive strength, 67-68 
Dual theories of acquisition, 64-65, 71-73 
Dyadic interaction, 30, 83, 86-87, 384-387, 540-541 
Dynamic approach, in field theory, 188-189 
Dynamogenesis, 25, 46 


Echo principle, 23 
Ecology, psychological, 185, 193-195, 388-889 
Economic man, 10-11, 558 
Ecstasy, 235 
Education, and sociometry, 430, 436-438 
Ego, 111-114, 117, 120-123, 148, 150-155, 159, 162, 168- 
169, 238, 254 
Freudian, 148, 150-155, 159, 162, 168-169 
vs. self, 238 
See also: body image, psychoanalytic theory, ego, 
self 
Egocentrism, 135 
Ego defense, 14, 162-164, 251-252 
Ego ideal, 153 
Ego instinct, 157 


, Ego involvement, 201 
' Egoism, 13-15 


Electromagnetism, 181-182 
Emergence, 37, 125-126 
Emotion, 65-67, 122-123, 553-558 
and learning, 65-67 
See also: feeling, mental qualities, specific emo- 
tions (i.e., anger) 
Emotional learning, 71 
speech, 126 "o 


580 SUBJECT INDEX 


Empathy, 20, 23, 231-232, 237-238, 246-248, 409, 427, 


539-540 
Energy, 145-147, 153, 156 
Enlightenment, The, 15-16 
Environment, 189-190, 204-212, 533-534 
in field theory, 189-190, 204-212 
Epiphenomenalism, 37 
Equal-appearing intervals, method of, 347 
Equalitarianism, 247 — . 
Equivalence of meaning, 452-453 
Erogenous zone, 158 
Eros, see life instinct 
Eroticism, 158 
Error, choice technique, 342, 461 
in rating, 271-272 
Type I vs. Type II, 262-263, 291 
variance, 278 
Escape behavior, 206 
Estimates of time, 409-410 
Ethics, utilitarian, 10 
Ethnocentrism, 247, 339 
Event system, 40, 85 
Evolution, 11, 19-20 
Ex post facto design, 281-282 
Excitation, 68 
Excitement, 551-552 
Experimental design, types of, 277-285 
Experimental error, 268 
reduction of, 272-274 
Experimental method, 50, 260-261 
beginnings of, 46-48 
Experimental technique, 274-275 
Experiments, planning and execution of, 259-287 
Extinction, 63-65 


F-test, 268, 272-273, 294-302 
Face-to-face group, 47, 376-384 
See also: small group theory 
Face validity, 340, 398, 422-423 
Faces test, 341-342 
Factor analysis, 352 
Factorial design, 282-284 
Fallacy of unitary explanation, 9 
Family structure, 165-167, 429 
Fear, 70-72, 548-552 
as a learnable drive, 70-72 
Feeling, 155, 239 
See also: emotion 
Field, psychological, 185 
See also: life space 
Field study, 213 
Field theory, 181-219 
basic concepts of, 189-207 
changes in psychological environment, 207-212 
definition of, 182 
dynamic concepts in, 199-207 
mathematics in, 189, 195 
metatheory of, 182-18) 


in physics, 181-182 
psychological approach in, 183-185 
structural concepts of, 195-199 
Fixation, 240 
Flexibility, 136, 247, 250 
Fluidity, 201 
Folk mind, 35-36 
psychology, 35-37 
Force, psychological, 188-189, 204-207, 209 
Forced choice technique, 362-363 
Fractional anticipatory goal response, 62 
Frame of reference, 109-111, 194 
Friendship, 541-543 
Frustration, 83, 147, 203-204, 549-551 
and learning, 66-68 
Functional autonomy, 120 
See also: motives, transformation of 


Generalization, 82-83 

of results, 265, 274 

of theory, 77-81 

See also: stimulus generalization 
Generalized other, 377-378 
Genetic reductionism, 124-132 
Geographic environment, 97-98 
Genotype, 187 
Gestalt psychology, 18, 23, 28-29, 40, 97-99, 103-1 

119, 122-123, 130-132 

Gesture, 242 
Gradient of generalization, 61-62 
Gravitation, 181-182 
Gregariousness, 19-20 
Group atmosphere, 48, 209, 380-381 
Group decision, 217-218 
Group, definition of, 214 
Group dynamics, 212-219 
Group fallacy, 39 
Group mind, 18-19, 31-40, 213-214 

analogical theory of, 18-19, 32-34 

as collective representations, 37-38 

as collective unconscious, 34 

common segment theory of, 39-40 

criteria of, 32, 39 

as cultural determinism, 38-39 

as folk psychology, 35-37 

as objective mind, 34-35 n 
Group participation scale, 409 
Group preference record, 409 
Group psychology, 167 

See also: small group theory 
Group structure, 218, 428-429 

sociometric measures of, 428-429 
Guess who technique, 410 
Guilt, 82, 154, 161-162 
Guttman type scale, 353-358, 361-362 


H statistic, 320-321 
H technique, for scaling, 356 


SUBJECT INDEX 


Habit, 17-18, 4245, 67, 96, 115, 258, 526.528 
family hierarchy, 96 
strength, 67 
Hallucination, 149 
Halo effect, 562-363 
Hedonism, 10-15, 114, 120 
Hedonistic calculus, 10, 16 
Heredity, 533-534 
Heteronomous event, 194, 210 
Heteroscedasticity, $25 
Hierarchy, of function, 131-133, 136 
of needs, 116 
of roles, 253 
Hodological space, 189, 196-197, 199 
Homeostasis, 114-115, 238-289 
Homogencity, of a scale, 339-340 
of variance test, 275 
Hostility, 548-552 
H using, 441 
Human nature, 552-558 
Human Relations Area Files, 529-530 
Hunger, 234 
Hypnosis, 25-27, 233-234, 236-237, 249-250 
Hypochondriasis, 247 
Hypothesis, 110, 113 
testing, 261, 332 
Hypothetico-deductive method, 65 
Hysteria, 234-235 


“I” concept, 242-243 
See also: ego, self 
Id, 148-150, 153-154, 163 
Idea psychology, 23 
Identical element, 37, 75, 129 
Identification, 23-24, 28, 84, 121, 146-147, 153, 165- 
167, 227, 236 
Identity of perception, 150 
Ideology, 11, 13, 17 
Ideo-motor theory, 23, 25 
Image, in crowd behavior, 29-30 
Imagery, 237 
Imagination, 226-227, 236 
Imitation, 18-19, 21-24, 81-82, 119, 210, 228, 387-388 
definition of, 24 
forms of, 22 
laws of; 22 
Impulse, 149 
Impunitiveness, 236-237 
Index of social distance, 411-412 
Individual differences, 70, 269 
postulate, 70 
Individualism, 11 : 
Industry, and sociometric techniques, 438 
Inferiority complex, 14 
Information theory, 34, 91, 99 
Inhibition, 64, 68 
Insects, social phenomena among, 534-537 
Insight, 104, 108-109, 131, 247 


Instinct, 18, 24, 4142, 146, 148-149, 156.160, 169 
Freudian, 146, 148-149, 156-160, 109 
of submission, 24 
of thought, 18 
See also: Psychoanalytic theory, instincts 
Institution, social, 165-167 
and group mind, $1 
Instrumental conditioning, and imitation, 28 
Instrumental response, 60 
Integration, $92 
Intelligence, 431, 535.584, 553-555 
and emotion, 553-555 
and sociometric status, 431 
Intention, 116-117, 120-121 
Interaction chronograph, 394-395 
Interaction effects, 276-277, 282, 284 
Interaction recorder, 394 
Interaction theory, 228, 238-239, 376.578, $80.81, 
384-385, 535 
Interests, 120-121, 205 
Interjection, 126 
Internal validity, see: face validity 
Interpersonal mechanisms, 389-891 
Interrupted tasks, 200-202, 218 
Intervening variable, 69, 78, 107, $35 
Interview, 449-484 
as a research tool, 450-451 
content analysis of, 495 
definition of, 449 
funnel, 459 
nondirective probing in, 466, 478 
open us. closed questions in, 457-459 
vs. other techniques, 482-484 
phrasing of questions in, 455-459, 476477 
reliability of, 453-454 
role relationships in, 462-464 
standardized vs. unstandardized, 451-455 
validity of, 454, 479-482 
Interview data, recording of, 467-470, 478-479 
coded vs. verbatim, 469-470 
field vs. office coding, 469 
field vs. office rating, 468-469 
memory vs. mechanical, 468, 470 
Interviewer bias, 462, 475 
characteristics of, 471, 476-478 
error, sources of, 475-479 
Interviewing, of children, 471-473 
creating rapport in, 464-465 
* cross-cultural, 473-475 
direct vs. indirect, 460-462 
field administration of, 471 
historical roots of, 449-450 
techniques of, 465-467 
Introjection, 121 
See also: identification 
Invariance, 99 
Inverse sine transformation, 326-328 
Involvement, in a role, 233-235 
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Irrationalism, 15-18 
Irreality, level of, 192-193, 236 
Isomorphism, 98 
Item, 338, 341-350, 363-364 
form of, 343-344 
monotone vs. nonmonotone, 343-344 
scaling of, 344-350 
selection of, 350, 364 
types of, 341-343 
Item analysis, 351-352 


Judgment, 153 
Justice, 19, 168 


Lability, 136, 250 
Lady tea taster, 270-271 
Laissez-faire, 10-11 
La morale, 7-9 
Language, 125-130, 242-243, 538-540 
See also: communication 
Language differences, in interviewing, 473-474 
Latency period, 158 
Latent, class analysis, 360-361 
distance scale, 361-362 
learning, 117-118 
structure analysis, 352, 359-362 
variable, 385 
Latin square design, 284-285 
balanced, 285 
Laughter, 20 
Law, of affective evolution, 15 
of effect, 11-13, 63, 114, 120-121 
of three stages, 4, 7 
Leadership, 28-30, 209, 232-233, 382, 410-411, 424- 
425, 439, 541 
sociometric measures of, 424-425, 439 
Learnable drive, 70-71 
Learning, 59-77, 108-109, 117-118, 121, 123-125, 207, 
242 
definition of, 67 
of emotions, 123 
us. performance, 69 
processes of, 61-63 
See also: acquisition, cognitive organization, con- 
ditioning, extinction, stimulus-response theory 
Level of aspiration, 208-209 
Leviathan, 13 
Libido, 146, 157, 169 
Life instinct, 146, 157-159 
Life space, 184-185, 189-195 
dimensions of, 192 
Likert type scale, 351-353 
Locomotion, direction of, 189 
in field theory, 196, 204 
“Looking glass” principle, 238, 243 
Love, 158 
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Manifest organization, 103-104 
Mann-Whitney test, 290, 315-317 
Marital relationship, 227 
Matched-dependent behavior, 82 
See also: imitation 
Matched groups design, 278-281 
regression effects in, 279-281 
Matching problem, 290, 807-312 
Maturation, 239 
Meaning, 84-85, 126-127, 452-453 
Measurement, 272, 336-337 
random errors of, 272 
See also: experiments, quantitative techniques 
Mechanism, 188 
Median test, 290, 314 
Mediating reaction, 129 
Membership motive, 215 
Memory, 111-112, 152 
Mental hygiene, 534 
Mental qualities, 155-156 
Mesmerism, 25 
Metaphysical absolute, 14 
Milieu, nonpsychological, 193-195 
Military applications of sociometric techniques, 439- 
440 
Model, 332-333 
Molar vs. molecular unit, 63, 74, 94-109, 183 
Molar unit, criticism of, 95 
varieties of, 94-95 
Monoideism, 25, 27 
Moral anxiety, 161 
Moral realism, 38 
Morale, sociometric measures of, 428, 440 
Motivation, 114-193, 156, 188, 374, 380-381, 397, 543- 
548, 550-552 
inferred from behavior, 374, 380-381, 397 
intrinsic, 122, 551-552 
Motives, transformation of, 15, 70-71 
Multivariate analysis, 332 
Mutually shared field, 106-107 
Mystic writing pad, 152 


Narcissism, 146-147 
National character, 492 
Need, 114-117, 156-157 
cognitive, 114-116 : 
effect of on perception, 117 
primary, 114-115 
See also: drive 
Neurosis, 252 
Neurotic anxiety, 160-161 
"New look" experiments, 115, 117 
Nondirective counseling, 18 
Nonparametric methods, 290-291, $12.32 
Nonpsychological milieu, 193-195 
Normal test, 302-303 
Null hypothesis, 291, 293 


SUBJECT INDEX 


Objective anxiety, 160 
Objective mind, 34-35 
Obligation, 226 
Observation, 48, 261, 376-393, 483-484, 495 
of children, 391-393 
of dyadic situation, 384-387 
of face-to-face groups, 376-384 
in a field setting, 387-391 
in therapy situation, 385-387 
Observation systems, category sets for, 372-375 
context of, 378, 380-381, 384 
dimensions of, 374-375 
orientation of the observer in, 377-383, 387, 389- 
390 
pre vs. post categorization, 399 
problems of, 396-400 
purpose of, 376-382, 384-393 
rating scales for, 372375 
teliability of, 378, 380-382, 384-388, 391-893, 396- 
397 
sampling of behavior in, 398-399 
scores for, 378, 380-382 
typical systems, 375-393 
Bales', 376-378 
Carter's, 382-384 
Chapple's, 384-385 
conference research problem solving, 378-380 
self-oriented need, 381-384 
Steinzor's, 380-381 
unit of observation, 375, 378-385, 387, 391 
validity of, 397-398 
Observational techniques, 370-403 
equipment for, 398-396 
history of, 370-371 
occasions for, 371-872 
optimal conditions for, 400-402 
and psychophysics, 400-402 
Observer, effect of, 399-400 
tasks of, 372-373 
Oedipus complex, 153-154, 166 
Open-ended question, 341 
Operational analogy, 80-81 
extension of theory, 79 
Operationism, 57-58, 91-92, 107-108, 188, 264 
Operating characteristics, 289-303, 343, 350-352, 359- 
360 * 
definition of, 291 
of items, 343, 350-352, 359-360 
Opinion polling, 18, 48, 341 
Opinion statement, 341 
Organismic variable, 263, 269 
Organization, level of, 131-133, 136 
manifest, 103-104 
perceptual, 97-99, 104-107 
silent, 109-114 
See also: cognitive organization 
Out-of-the-field behavior, 206 e 
Overlapping situation, 198, 206-207 
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Paired comparison scaling, 345-347 
Parameter, 266, 332-333 
Parent figure, 243-244 
Participation, 209 
Past experience, 106, 108-109, 208-209 
Path, 189, 196-197, 199 
Perception, 36, 97-99, 104-107, 116, 151-152, 210-211, 
228-231, 241, 253-255, 544-545 
and motivation, 116 
of persons, 106-107, 210-211, 228-231, 241, 253-255, 
544-545 
and psychoanalysis, 151-152 
social, 36, 104-107, 210-211 
Perceptual representation, theories of, 99-103 
Perceptual unit formation, 98-99, 103-107 
Person, as cause, 106-107, 194 
in field theory, 190-191 
Personality, 113, 148-155, 224, 239, 439-436 
and culture, 239 
effect of on cognition, 113 
and sociometric status, 482-436 
subdivisions of, 148-155 
Personality measurement, and content analysis, 500- 
501 
Personality scaling, 250-251 
Phenomenal causality, 103-104 
field, theories of, 99-103 
Phenomenology, 18, 20-21, 28-29, 92-93, 101-102, 122- 
123, 183-185 
definition of, 101-102 
Phenotype, 187 
Phobia, 160-161 
Phrenology, 8 
Physiognomic quality, 123 
Play, 226-227 
Pleasure principle, 149-150 
Poisson distribution, 326 
Position, in field theory, 196 
in a group, 218-219 
in role theory, 223-225 
Positivism, 4, 7, 57-58, 91-93, 107-108 
Postulate system, Hull's, 70 
Posture, and role perception, 230-231 
Potency, 197-198, 207 
Power, 13-15 
Power curve, 289-303, 314 
definition of, 291 
Power field, 209-210 
Pragmatism, 57-58 
Preconscious, 155-156 
Prediction, 266-267 
Prejudice, 342-343, 425-426 
sociometric measures of, 425-426 
Prestige, 47 
Primal horde, 167 
Primary group, 167 
Primary process, 149-150 . 
Primitive passive sympathy, 20-23 
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Principle, of stimulus-response contiguity, 74-75 
of utility, 10 

Privacy, 102-103, 123 

Probabilities, multiplication of, $29 

Probing, nondirective, 466, 478 

Problem solving, 71, 376-379 
by groups, 376-379 

Process analysis, 78 


and conscious-unconscious distinction, 156 
and repression, 163 
conscious, 148, 155-156 
preconscious and unconscious, 155-156 
and subdivisions of personality, 148 
defenses (see ego, defenses) 
id, 148-150, 153-154, 163 
contents, 148-149 


Productive thinking, 109 and ego, 153 
Productivity, of groups, 233 functions, 149-150 
Projection, 164, 237, 342 origin, 148 


Projective questions, 460-462 
Projective techniques, and content analysis, 499 
and sociometric status, 435-436 
Propaganda, 28, 495-496, 501-502, 512 
Propensity, 41 
Propositional language, 125-127 
Proximal determination, 95-97 
Psychic actuality, 36-37 
Psychoanalysis, and anthropology, 170-171 
applications of to other social sciences, 170-175 
origins of, 144 
and psychology, 143-145 
recent trends in, 168-170 
research methods of, 169-170 
and social psychology, 145, 165-168, 172-175 
and sociology, 171-172 
and stimulus-response theory, 58 
Psychoanalytic theory, 148-175 
anxiety, 160-163 
and defenses, 162-163 
definition of, 160 
origins, 161 
in relation to ego, 162 
in relation to superego, 162 
types of, 160-161 
ego, 148, 150-155, 159, 162, 168-169 
defenses, 162-164 
other defenses, 164 
projection, 164 
reaction formation, 164 
regression, 164 
repression, 162-164 
sublimation, 164 


and primary process, 149-150 
and secondary process, 150 
and superego, 154 
instincts, 146, 156-160, 169 
characteristics of, 157 
death, 146, 156-160, 169 
definition of, 156-157 
and the cgo, 159 
and the id, 148-149 
life, 146, 157-159 
number of, 157 
origin of, 156-157 
and the superego, 159-160 
theory, recent trends, 169 
libido, 146, 157, 169 
pleasure principle, 149-150 
preconscious, 155-156 
primary process, 149-150 
reality principle, 150-151, 153 
secondary process, 150-151, 153 
social psychology, 165-168 
culture, 167-168 
family structure and dynamics, 165-167 
group psychology, 167 
socialization of the individual, 165 
society, origins of, 167 
superego, 147, 153-155, 159-160, 161-162 
and anxiety, 161-162 
and ego, 154-155 
functions of, 154 
and id, 154 
and instincts, 159-160 
origins of, 153-154 
unconscious, 148, 151-152, 155-156 


functions of, 150-151 Psychological field, 185, 189-195 

and memory, 151-152 Psychology, social vs. physiological, 8 

origins of, 150 Psychometric function, 343 

and perception, 151-152 ` Psychopathology, and role theory, 233 

in relation to anxiety, 162 Psychopathy, 245-248 

in relation to id, 153 G Psychophysics, and observational techniques, 4 

in relation to instincts, 159 Psychosexual development, 158 

in relation to superego, 154-155 Psychosis, 252 

theory, recent trends, 168-169 Pàychotherapy, 385-386, 440 

and thought and judgment, 152-153 and stimulus-response theory, 82-83 
energy, 145-147, 153 Punishment, 83, 205 

of ego, 153 Ee Purpose, 94, 183. . 
cathexis, 146-147, 156, 163 Purposive behavior, 62, 94, 538-539 
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Quantitative techniques, 289-533 fractional anticipatory, 62 
Quasi-stationary equilibrium, 217 nature of, 73 
Question, direct vs. indirect, 460-462 Response-response theory, 77 
open vx. closed, 457-459 ` Responsibility, 210-211 


phrasing of, 455-459, 476-477 
Questionnaire, 558, $44, 482-483 


Randomization, 269-271 
Randomired blocks design, 278-279 
Randomized groups design, 277-278 
Range, 304, 323-324 
Rank order scaling, 347 
Rankings, multiple, 290, $19-320 
Rapport, 25, 464-465, 474-475 
Rating, 264 
errors in, 271-272 
Rating scales, 272.275 
vs. sociometric measures, 406 
Rationalism, 15.18 
ationalization, 14, 16-17 
Reaction formation, 164 
Readability, 496-497 
Reality, level of, 192-193 
Reality principle, 150-151, 153 
Reality testing, 151 
Recall of interrupted tasks, 200-202, 218 
Reciprocal action, 223 
Redintegration, 26 
Reduction of determining tendencies, 27 
Reductionism, genetic, 124-182 
Reference scale, 208 
Region, in field theory, 196-197 
Regression, 157, 164, 203-204, 249-250 
Regression effects, 279-281 
Reinforcement, 11-13, 60, 114, 117-118, 120-121, 242 
and imitation, 23 
Reinforcement theory, 63-71, 226 
Relational analysis, 409 
Relational determination, 98-99, 130 
Reliability, 264, 338-340, 363, 371, 378, 880-382, 384- 
388, 391-393, 396-397, 420-422, 453-454, 514-515 
of content analysis, 514-515 
of interviews, 453-454 
of observation systems, 378, 380-982, 384-388, 391- 
393, 396-397 
of questions, 462 
rater, 363 
of scales, 338-340 
of sociometric measures, 420-422 
Religious experience, 34 
Repetition compulsion, 157 
Replications, number of, 272-273 
Representations, collective, 37-38 
Repression, 31, 152, 161-164, 236-237 H 
and crowd behavior, 31 
Research, types of, 259-260 
Respondents, scoring of, 350 S 
Response, cognitive definition of, 108-109 


Resumption of interrupted tasks, 201-202 
Right, 226 
Rigidity, 136, 201, 247, 249-250 
Rites de passage, 255 
Role, 223-227, 232.233, 235-238, 248-255, $80, 462-464, 
473 
accessibility of, 255-236 
acquisition of, 226-227 
“as if" dimension of, 236-238 
interaction with self, 248-255 
number of, 232-238 
relationships in interviewing, 462-464, 478 
vs. self, 253-255 
Role conflict, 228 
Role enactment, 232.238 
level of involvement in, 233-235 
organismic dimension of, 233-237 
Role expectation, 225-229 
Role perception, 229-232 
Role-playing technique, 228 
Role-role conflict, 252-253 
Role taking, 232, 237-238, 243, 246-248 
Role theory, 223-255 
origins of, 223-224 
vs. interaction theory, 223 
Rorschach test, 249-250 
Run test, 290, 321-323 


Sampling, 265, 398-399, 515-516, 526-527 
in content analysis, 515-516 
in cross-cultural studies, 526-527 
Satiation, 202-203 
Saturation point, 425 
Scale criteria, 363 
Scale, interval, 364 
Scaling, 264, 336-341, 344-365 
comparison of methods, 363-365 
judgment methods, 344-351 
latent class analysis, 360-361 
latent distance scale, 361-362 
latent structure analysis, 359-362 
maximum variance score, 353, 358 
method of summated ratings, 351-353 
principal components, 358 
rating methods, 362-363 
response methods, 344 
single common factor model, 352-358 
unfolding techniques, 358-359 
Scalogram analysis, 353-358, 361-362 
Schema, 110-111, 113, 116 
Schizophrenia, 247 
Science, classification of, 7-8 
Secondary process, 150-151, 153 
Secondary reinforcement, 6%, 72, 115, 118-120 
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Selective distortion, 342 
Self, 26, 29, 228-224, 238-255 
and cognitive tasks, 248 
empirical methodology for assessing, 244-245 
experimental study of, 245-248 
interaction with role, 248-255 
vs. role, 253-255 
subconscious, 26, 29 
See also: ego 
Self-actualization, 115-116 
Self-concept, 121-122 
Self-consciousness, 236 
Self-oriented need, 381 
Self-role conflict, 251-252 
Sentiment, 41, 45-46 
self-regarding, 46 
vs. attitude, 45-46 
Set, 43-44, 110 
See also: aufgabe, determining tendency 
Sex, and sociometric status, 430 
Sex differences, in role expectation, 228-229 
in role perception, 230 
Sex role, 226 
Sexual instinct, 146, 157-158 
Sibling rivalry, 166-167 
Sign, 128-130 
Sign gestalt, 104 
Sign test, 290, 312-314 
Signal, 128-130 
Significance, tests of, 261-263, 268, 290-303, 331-332 
combining, 290, 328-331 
one-tailed vs. two-tailed, 262 
Significant symbol, 22 
Silent organization, 109-114 
See also: set 
Similarity, association by, 37 
Simple and sovereign theories, 9-29 
Simplicist fallacy, 9 
Simplicity, 125, 132 
Sleeper effect, 84 
Small group, co-acting, 47 
early studies of, 46-48 
face-to-face, 47 
Small group theory, 232-233, 376-384 
Social act, 535, 545-548 
Social class, 86, 228 
Social climate, 48 
Social contract, 13, 15-16 
Social disorganization, 252 
Social distance scale, 353 
Social facilitation, 47 
Social increment, 47 
Social norm, 195, 253 
Social perception, 36, 104-107, 210-211 
Social psychology, and abnormal psychology, 25-26 
Aristotelian, 6 
Comtean, 7-9 > 
definition of, 4-5 o 


early experimentation in, 46-48 
founders of, 3, 7-9 
history of, 3-51 
objective methodology in, 46-48 
origins of, 3-4, 6-9 
Platonic, 6 
and psychoanalysis, 145, 165-168, 172-175 
secondary sources for history of, 5-6 
and sociology, 4-5, 50-51 
textbooks in, 48-51 
units of analysis of, 40-46 
Social sensitivity, 409, 427 
See also: empathy 
Social stimulus value, 392-393 
Social subvaluent, 47 
Socialization, 15, 83-84, 153-154, 165 
and egoism, 15 
Socially induced change, 209-210, 217 
Society, 31-32, 167, 224, 552-558 
conditions for, 31-32 
insect, 535-537 
origins of, 167 
Sociology, 4-5, 7-9, 38, 171-172 
Comte's invention of, 7-9 
of knowledge, 17 
and social psychology, 4-5 
Sociogram, 410-413 
appraisal of, 412-413 
target, 412 
Sociometric choice, intensity of, 414 
patterns of, 410-411 
Sociometric data, analysis of, 410-420 
factor analysis of, 419 
fractionation of groups of, 419-420 
graphic representation of, 410-413 
indices from, 413-414 
machine computation of, 419 
matrix approaches to, 416-419 
profiles for, 415 
scores from, 413-414 
simple quantitative methods for, 413-415 
statistical methods for, 415-416 
tabular methods for, 413 
Sociometric measures, 
applications of, 436-441 
community applications, 440-441 
educational applications, 436-438 
industrial applications, 438 
military applications, 439-440 
psychotherapeutic applications, 440 
data appropriateness of, 423 
definition of, 407-408 
empirical concepts definable by, 424-429 
irlternal consistency of, 421 
problems of, 441-444 
qualities of, 405-406 
and related devices, 409-410 
relation to independent measures of, 423 
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reliability of, 420-422 
interpretive reliability of, 420-421 
and multiple criteria, 421422 
repeat reliability of, 421 
test reliability of, 421 
research utility of, 423-436 
validity of, 422-424 
face validity of, 422-423 
variations of, 408-409 
Sociometric self-rating, 409 
Sociometric survey, multirelational, 409 
Sociometric variables, measures related to, 429-4936 
cultural variables, 480-431 
demographic variables, 429-430 
personality, 432-436 
Sociometry, 48, 405-444 
origins of, 407 
use of the term, 405 
Socius, 242 
Soviet psychology, 34, 47-48 
Spontaneous recovery, 61 
Square root transformation, 326 
Stability, 136, 250 
Standard deviate, 329 
Standard deviation, of a mean, 323 
Statistical techniques, 289-333 
quick methods in, 290, 323-324 
Status, 224, 427-430 
sociometric measures of, 427-428 
and sociometric variables, 429-430 
Stereotype, 17 
Stimulus, equivalence, 452-453 
generalization, 61-62, 75, 130-132 
nature of, 73, 97-99 
vs. need, 156-157 
Stimulus-intervening variable law, 67 
Stimulus-response theory, 57-87 
categorization of behavior sequence, 59-61 
us. cognitive theory, 93, 96-97, 113-115, 118-120 
contiguity theory, 63, 73-77 
dual theories, 64-65, 71-78 
and language, 129-130 
origins of, 57-58 
and other behavior sciences, 85-87 
and reductionism, 124-125 
reinforcement theory, 63-64, 65-71 
and social phenomena, 69-70, 81-85, 119-120 
systems of, 59, 63-77 
Stimulus-stimulus theory, 70-71 
Stimulus variable, 66 
Stochastically smaller scores, 317-319 
Structural analysis, 78 
Subconscious self, 26, 29 


Subjects, number of, 266-267, 277, 292-294, 302-308, 


330-331 
randomization of, 269-270 
selection of, 265-266, 973-974 «© 
Sublimation, 164 


Submission, 24, 591.392 
Substitution, 202 
Successive intervals, method of, 347-349 
Suggestibility, 25-27, 29, 287 
conditions of, 27, 29 
Suggestion, 18-19, 24-50 
definition of, 24, 26-27 
explanations of, 24-29 
Sum of squares, 267-268 
Superego, 147, 153-155, 159-162 
Survey research, 259 
Symbol, 128-130 
significant, 22 
universal, 34 
Symbolic function, 121, 125-130 
Sympathy, 18-23 
empirical studies of, 21 
forms of, 19-21 
primitive passive, 20-23 
proponents of, 19-20 
Symptoms, 163 
Syncretism, 136 
System, personality vs. social, 40 
theoretical, 59-77 
theoretical-empirical, 59, 77-87 


T statistic, 315-317 
t test, 272-278, 303, 323-324 
Taboo, 556-557 
Task orientation, 122, 202 
Technique research, 260 
Tension binding, 239, 245-246 
Tension reduction, 11-13, 114-115, 240-241 
Tension system, 112, 199-205 
Testing, 264 
Textbooks in social psychology, 48-51 
Theory, problem of generalizing, 77-81 

simple and sovereign, 9-29 
Thinking, 125-130, 152-153 
Thurstone type scale, 344-351 
Time binding, 239 
Topological diagram, 67, 198-199 
Topology, 195-199 
Torres Straits Expedition, 35 
Total situation, 185-186, 189-195 
Trace line, 343, 359 

system, 111-113 
Trait, 238 


“Transfer of training, 75, 109, 181-182 


Transformation of motives, 15, 70-71 
Transformations, 290, 324-328 
Transitoriness of instinct, 42 
Transposition, 130-132 

‘Trauma, 160-161 

Triadic hypothesis, 236-237 


U statistic, 290, 317-319 © 
Unconscious, 109-111, 148, 151-152, 155-156 
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Unidimensionality, of a scale, 339-340, 363 
Unitary explanation, fallacy of, 9 
Utility, principle of, 10 


V statistic, 317-319 id 
VTE, 245 
Valence, 198, 200, 204, 206, 208 
Validity, of content analysis, 504-505 

of interviews, 454, 479-482 

of observation systems, 397-398 

of questions, 458, 460-462 

of scales, 338, 340-341 

of sociometric measures, 422-424 
Value, 45, 113-114, 431-432, 510 
Variable, 263-264 

independent vs. dependent, 261 

quantification of, 264 


Variance estimate, 329-330 

stabilizing transformations, 325-326 
Variation between groups, 265, 268-272 
Veridicality of role perception, 229-230 
Vocalization, 126 
Voodoo death, 235 
Vulnerability of a theory, 65 


W-A-Y technique, 244-245 

Wagging dance, 537-538 
Wald-Wolfowitz run test, 290, 321-323 
Weaning, age of, 239, 524-525 

Whitney's U statistic, 290, 317-519 
Wilcoxon's signed rank test, 290, 314-315 
Wish, 149-150 

Würzburg school, 44 
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